v1v2 (latest)

Cost-Awareness in Tree-Search LLM Planning: A Systematic Study

20 May 2025

Zihao Zhang

Fei Liu

Kenan Jiang

Shijia Pan

Shu Kai

Fei Liu

ArXiv (abs)PDF HTML

Main:7 Pages

5 Figures

Bibliography:3 Pages

4 Tables

Appendix:3 Pages

Abstract

Planning under resource constraints is central to real-world decision making, yet most large language model (LLM) planners assume uniform action costs. We systematically analyze whether tree-search LLM planners are cost-aware and whether they efficiently generate budget-feasible plans. In contrast to black-box prompting, explicit search trees expose intermediate decisions, node evaluations, and failure modes, which allows for controlled ablations of planner behavior. We study depth-first search, breadth-first search, Monte Carlo Tree Search, and bidirectional search within a unified framework. Our experiments show that existing tree-based LLM planners often struggle to find cost-optimal plans, and that additional search computation does not reliably improve optimality. Among the methods evaluated, bidirectional search achieves the best overall efficiency and success rate. MCTS achieves the highest optimality on short-horizon tasks. Tree-search planners are especially valuable for studying LLM planning because their reasoning steps are explicit, in contrast to plain LLMs that internalize planning dynamics through post-training trajectories. Our findings suggest that improving LLM planning under resource constraints will likely require new search algorithms, rather than solely scaling inference-time compute.

View on arXiv

Comments on this paper