296
v1v2 (latest)

AMAP Agentic Planning Technical Report

AMAP AI Agent Team
Yulan Hu
Xiangwen Zhang
Sheng Ouyang
Hao Yi
Lu Xu
Qinglin Lang
Lide Tan
Xiang Cheng
Tianchen Ye
Zhicong Li
Ge Chen
Wenjin Yang
Zheng Pan
Shaopan Xiong
Siran Yang
Ju Huang
Yan Zhang
Jiamang Wang
Yong Liu
Yinfeng Huang
Ning Wang
Tucheng Lin
Xin Li
Ning Guo
Main:14 Pages
8 Figures
Bibliography:3 Pages
4 Tables
Appendix:11 Pages
Abstract

We present STAgent, an agentic large language model tailored for spatio-temporal understanding, designed to solve complex tasks such as constrained point-of-interest discovery and itinerary planning. STAgent is a specialized model capable of interacting with ten distinct tools within spatio-temporal scenarios, enabling it to explore, verify, and refine intermediate steps during complex reasoning. Notably, STAgent effectively preserves its general capabilities. We empower STAgent with these capabilities through three key contributions: (1) a stable tool environment that supports over ten domain-specific tools, enabling asynchronous rollout and training; (2) a hierarchical data curation framework that identifies high-quality data like a needle in a haystack, curating high-quality queries by retaining less than 1\% of the raw data, emphasizing both diversity and difficulty; and (3) a cascaded training recipe that starts with a seed SFT stage acting as a guardian to measure query difficulty, followed by a second SFT stage fine-tuned on queries with high certainty, and an ultimate RL stage that leverages data of low certainty. Initialized with Qwen3-30B-A3B to establish a strong SFT foundation and leverage insights into sample difficulty, STAgent yields promising performance on TravelBench while maintaining its general capabilities across a wide range of general benchmarks, thereby demonstrating the effectiveness of our proposed agentic model.

View on arXiv
Comments on this paper