Zhiting Hu
Treating a language model as one component in a larger planning system — instead of asking it to do reasoning end-to-end in its own head.
The dominant frame for LLM reasoning is to keep everything inside the language model: prompt it to produce reasoning steps, sample from the resulting distribution, hope the right answer falls out. RAP (Reasoning as Planning, 2023, with Hu as senior author) makes a different bet — use the LLM as a world model that predicts what happens after a candidate action, and combine it with classical Monte Carlo Tree Search to choose which action to actually take. The architectural consequence is that reasoning becomes auditable in a way pure prompting isn't: you can inspect the search tree, see which states the model evaluated as promising, and trace why the final answer was chosen.