Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.15298
Cited By
v1
v2
v3 (latest)
AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving
21 May 2025
Kangan Qian
Sicong Jiang
Yang Zhong
Ziang Luo
Zilin Huang
Tianze Zhu
Kun Jiang
Mengmeng Yang
Zheng Fu
Jinyu Miao
Yining Shi
He Zhe Lim
Li Liu
Tianbao Zhou
Hongyi Wang
Huang Yu
Yifei Hu
Guang Li
Guang Chen
Hao Ye
Lijun Sun
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving"
34 / 34 papers shown
Title
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects
Yixin Cui
Haotian Lin
Shuo Yang
Yixiao Wang
Yanjun Huang
Hong Chen
LM&Ro
LRM
ELM
121
0
0
26 May 2025
LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving
Zhijie Qiao
Haowei Li
Zhong Cao
Henry X. Liu
VLM
171
16
0
01 May 2025
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
Jingyi Zhang
Jiaxing Huang
Huanjin Yao
Shunyu Liu
Xikun Zhang
Shijian Lu
Dacheng Tao
LRM
145
73
0
17 Mar 2025
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
Ayesha Ishaq
Jean Lahoud
Ketan More
Omkar Thawakar
Ritesh Thawkar
...
Fahad Shahbaz Khan
Hisham Cholakkal
Ivan Laptev
Rao Muhammad Anwer
Salman Khan
LRM
123
4
0
13 Mar 2025
FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback
Kangan Qian
Ziang Luo
Sicong Jiang
Zilin Huang
Jinyu Miao
...
Jiangbo Yu
Xinyu Jiao
Mengmeng Yang
Kun Jiang
Ke Wang
105
2
0
11 Mar 2025
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning
Bo Jiang
Shaoyu Chen
Qian Zhang
Wenyu Liu
Xinggang Wang
OffRL
LRM
VLM
159
12
0
10 Mar 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLM
VLM
OffRL
AI4TS
LRM
392
2,024
0
22 Jan 2025
DriveLM: Driving with Graph Visual Question Answering
Chonghao Sima
Katrin Renz
Kashyap Chitta
Lawrence Yunliang Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
291
207
0
17 Jan 2025
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs
Omkar Thawakar
Dinura Dissanayake
Ketan More
Ritesh Thawkar
Ahmed Heakl
...
Hisham Cholakkal
Ivan Laptev
Mubarak Shah
Fahad Shahbaz Khan
Salman Khan
VLM
LRM
127
58
0
10 Jan 2025
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives
Shaoyuan Xie
Lingdong Kong
Yuhao Dong
Chonghao Sima
Wenwei Zhang
Qi Alfred Chen
Ziwei Liu
Liang Pan
379
20
0
08 Jan 2025
VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Zilin Huang
Zihao Sheng
Yansong Qu
Junwei You
Sikai Chen
VLM
149
11
0
20 Dec 2024
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
Wenwen Zhuang
Xin Huang
Xiantao Zhang
Jin Zeng
LRM
123
31
0
16 Aug 2024
LLaVA-OneVision: Easy Visual Task Transfer
Bo Li
Yuanhan Zhang
Dong Guo
Renrui Zhang
Feng Li
Hao Zhang
Kaichen Zhang
Yanwei Li
Ziwei Liu
Chunyuan Li
MLLM
SyDa
VLM
169
865
0
06 Aug 2024
NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking
D. Dauner
Marcel Hallgarten
Tianyu Li
Xinshuo Weng
Zhiyu Huang
...
Igor Gilitschenski
Boris Ivanovic
Marco Pavone
Andreas Geiger
Kashyap Chitta
113
63
0
21 Jun 2024
Ovis: Structural Embedding Alignment for Multimodal Large Language Model
Shiyin Lu
Yang Li
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
Han-Jia Ye
VLM
MLLM
138
55
0
31 May 2024
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Shihao Wang
Zhiding Yu
Xiaohui Jiang
Shiyi Lan
Min Shi
Nadine Chang
Jan Kautz
Ying Li
Jose M. Alvarez
LRM
99
48
0
02 May 2024
PLUTO: Pushing the Limit of Imitation Learning-based Planning for Autonomous Driving
Jie Cheng
Ying-Cong Chen
Qifeng Chen
VLM
111
32
0
22 Apr 2024
DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving
Tianqi Wang
Enze Xie
Ruihang Chu
Zhenguo Li
Ping Luo
LRM
87
20
0
25 Mar 2024
DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models
Xiaoyu Tian
Junru Gu
Bailin Li
Yicheng Liu
Yang Wang
Chenxu Hu
Kun Zhan
Peng Jia
Xianpeng Lang
Hang Zhao
VLM
206
164
0
19 Feb 2024
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Zhihong Shao
Peiyi Wang
Qihao Zhu
Runxin Xu
Jun-Mei Song
...
Haowei Zhang
Mingchuan Zhang
Yiming Li
Yu-Huan Wu
Daya Guo
ReLM
LRM
200
1,288
0
05 Feb 2024
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Xinpeng Ding
Jinahua Han
Hang Xu
Xiaodan Liang
Wei Zhang
Xiaomeng Li
108
47
0
02 Jan 2024
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Wenhai Wang
Jiangwei Xie
ChuanYang Hu
Haoming Zou
Jianan Fan
...
Lewei Lu
Xizhou Zhu
Xiaogang Wang
Yu Qiao
Jifeng Dai
92
146
0
14 Dec 2023
Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving
Ming-Jun Nie
Renyuan Peng
Chunwei Wang
Xinyue Cai
Jianhua Han
Hang Xu
Li Zhang
LRM
104
60
0
06 Dec 2023
Dolphins: Multimodal Language Model for Driving
Yingzi Ma
Yulong Cao
Jiachen Sun
Marco Pavone
Chaowei Xiao
MLLM
109
64
0
01 Dec 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
169
290
0
21 Nov 2023
A Language Agent for Autonomous Driving
Jiageng Mao
Junjie Ye
Yuxi Qian
Marco Pavone
Yue Wang
LM&Ro
LRM
99
109
0
17 Nov 2023
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving
Long Chen
Oleg Sinavski
Jan Hünermann
Alice Karnsund
Andrew James Willmott
Danny Birch
Daniel Maund
Jamie Shotton
MLLM
125
210
0
03 Oct 2023
GPT-Driver: Learning to Drive with GPT
Jiageng Mao
Yuxi Qian
Junjie Ye
Hang Zhao
Yue Wang
LRM
84
242
0
02 Oct 2023
DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model
Zhenhua Xu
Yujia Zhang
Enze Xie
Zhen Zhao
Yong Guo
Kwan-Yee. K. Wong
Zhenguo Li
Hengshuang Zhao
MLLM
130
307
0
02 Oct 2023
Drive Like a Human: Rethinking Autonomous Driving with Large Language Models
Daocheng Fu
Xin Li
Licheng Wen
Min Dou
Pinlong Cai
Botian Shi
Yu Qiao
83
174
0
14 Jul 2023
NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario
Tianwen Qian
Jingjing Chen
Linhai Zhuo
Yang Jiao
Yueping Jiang
93
158
0
24 May 2023
StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation
Yining Shi
Kun Jiang
Ke Wang
Jiusi Li
Yunlong Wang
Mengmeng Yang
Diange Yang
AI4TS
139
3
0
19 Feb 2023
Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems
Paweł Budzianowski
Ivan Vulić
72
310
0
12 Jul 2019
Textual Explanations for Self-Driving Vehicles
Jinkyu Kim
Anna Rohrbach
Trevor Darrell
John F. Canny
Zeynep Akata
79
348
0
30 Jul 2018
1