Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.09245
Cited By
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
14 December 2023
Wenhai Wang
Jiangwei Xie
ChuanYang Hu
Haoming Zou
Jianan Fan
Wenwen Tong
Yang Wen
Silei Wu
Hanming Deng
Zhiqi Li
Hao Tian
Lewei Lu
Xizhou Zhu
Xiaogang Wang
Yu Qiao
Jifeng Dai
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving"
50 / 100 papers shown
Title
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLM
VLM
30
0
0
13 May 2025
PADriver: Towards Personalized Autonomous Driving
Genghua Kou
Fan Jia
Weixin Mao
Y. Liu
Yucheng Zhao
Ziheng Zhang
Osamu Yoshie
Tiancai Wang
Y. Li
X. Zhang
38
0
0
08 May 2025
LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving
Zhijie Qiao
Haowei Li
Zhong Cao
Henry X. Liu
VLM
73
2
0
01 May 2025
Circinus: Efficient Query Planner for Compound ML Serving
Banruo Liu
Wei-Yu Lin
Minghao Fang
Yihan Jiang
Fan Lai
LRM
24
0
0
23 Apr 2025
Manipulating Multimodal Agents via Cross-Modal Prompt Injection
Le Wang
Zonghao Ying
Tianyuan Zhang
Siyuan Liang
Shengshan Hu
Mingchuan Zhang
A. Liu
Xianglong Liu
AAML
31
1
0
19 Apr 2025
LangCoop: Collaborative Driving with Language
Xiangbo Gao
Yuheng Wu
Rujia Wang
Chenxi Liu
Yang Zhou
Zhengzhong Tu
VLM
34
0
0
18 Apr 2025
CAFE-AD: Cross-Scenario Adaptive Feature Enhancement for Trajectory Planning in Autonomous Driving
Junrui Zhang
Chenjie Wang
Jie Peng
Haoyu Li
Jianmin Ji
Yu Zhang
Y. Zhang
31
0
0
09 Apr 2025
Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models
Rui Gan
Pei Li
Keke Long
Bocheng An
Junwei You
Keshu Wu
Bin Ran
26
0
0
06 Apr 2025
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Tong Nie
Jian-jun Sun
Wei Ma
58
1
0
27 Mar 2025
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation
Haoyu Fu
Diankun Zhang
Zongchuang Zhao
Jianfeng Cui
Dingkang Liang
Chong Zhang
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
38
1
0
25 Mar 2025
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Kexian Tang
Junyao Gao
Yanhong Zeng
Haodong Duan
Yanan Sun
Zhening Xing
Wenran Liu
Kaifeng Lyu
Kai-xiang Chen
ELM
LRM
51
1
0
25 Mar 2025
Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving
Hongkuan Zhou
Stefan Schmid
Yicong Li
Lavdim Halilaj
Xiangtong Yao
Wei Cao
52
0
0
24 Mar 2025
AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models
Boshra Khalili
Andrew W.Smyth
ELM
52
0
0
20 Mar 2025
Aligning Multimodal LLM with Human Preference: A Survey
Tao Yu
Y. Zhang
Chaoyou Fu
Junkang Wu
Jinda Lu
...
Qingsong Wen
Z. Zhang
Yan Huang
Liang Wang
T. Tan
69
2
0
18 Mar 2025
RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving
Yujin Wang
Quanfeng Liu
Zhengxin Jiang
Tianyi Wang
Junfeng Jiao
Hongqing Chu
B. Gao
Hong Chen
56
1
0
18 Mar 2025
Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation
Kailin Li
Zhenxin Li
Shiyi Lan
Yuan Xie
Zhizhong Zhang
J. Liu
Zuxuan Wu
Zhiding Yu
Jose M.Alvarez
44
1
0
17 Mar 2025
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training
Zhenxin Li
Shihao Wang
Shiyi Lan
Zhiding Yu
Zuxuan Wu
Jose M. Alvarez
43
1
0
15 Mar 2025
Unlock the Power of Unlabeled Data in Language Driving Model
Chaoqun Wang
Jie-jin Yang
Xiaobin Hong
Ruimao Zhang
40
0
0
13 Mar 2025
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment
Katrin Renz
Long Chen
Elahe Arani
Oleg Sinavski
MLLM
62
0
0
12 Mar 2025
A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models
Miao Zhang
Zhenlong Fang
Tianyi Wang
Q. Zhang
Shuai Lu
Junfeng Jiao
Tianyu Shi
AI4CE
48
4
0
11 Mar 2025
CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving
Changxing Liu
Genjia Liu
Z. Wang
Jinchang Yang
Siheng Chen
62
0
0
11 Mar 2025
Advancing Autonomous Vehicle Intelligence: Deep Learning and Multimodal LLM for Traffic Sign Recognition and Robust Lane Detection
Chandan Kumar Sah
Ankit Kumar Shaw
Xiaoli Lian
Arsalan Shahid Baig
Tuopu Wen
Kun Jiang
Mengmeng Yang
D. Yang
26
1
0
08 Mar 2025
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving
Katharina Winter
Mark Azer
Fabian B. Flohr
53
0
0
05 Mar 2025
Towards Effective and Efficient Context-aware Nucleus Detection in Histopathology Whole Slide Images
Zhongyi Shui
Ruizhe Guo
Honglin Li
Yuxuan Sun
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Pingyi Chen
Yanzhou Su
Lin Yang
44
0
0
04 Mar 2025
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models
J. Zhang
Xuan Yang
T. Wang
Yu Yao
Aleksandr Petiushko
B. Li
33
0
0
28 Feb 2025
HazardNet: A Small-Scale Vision Language Model for Real-Time Traffic Safety Detection at Edge Devices
M. Tami
Mohammed Elhenawy
Huthaifa I. Ashqar
36
0
0
27 Feb 2025
VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion
Pei Liu
Haipeng Liu
Haichao Liu
Xin Liu
Jinxin Ni
Jun Ma
53
0
0
25 Feb 2025
Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model
Bo-Kai Ruan
Hao-Tang Tsui
Yung-Hui Li
Hong-Han Shuai
LM&Ro
76
4
0
20 Feb 2025
Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation
Namhee Kim
Woojin Park
38
0
0
06 Feb 2025
DriveLM: Driving with Graph Visual Question Answering
Chonghao Sima
Katrin Renz
Kashyap Chitta
L. Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
73
159
0
17 Jan 2025
Visual Large Language Models for Generalized and Specialized Applications
Yifan Li
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
83
10
0
06 Jan 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Mohit Bansal
Parisa Kordjamshidi
LRM
51
17
0
31 Dec 2024
doScenes: An Autonomous Driving Dataset with Natural Language Instruction for Human Interaction and Vision-Language Navigation
Parthib Roy
Srinivasa Perisetla
Shashank Shriram
Harsha Krishnaswamy
Aryan Keskar
Ross Greer
VGen
72
2
0
08 Dec 2024
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation
Ao Wang
Hui Chen
Jianchao Tan
K. Zhang
Xunliang Cai
Zijia Lin
J. Han
Guiguang Ding
VLM
77
3
0
04 Dec 2024
FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback
Kangan Qian
Zhikun Ma
Yangfan He
Ziang Luo
Tianyu Shi
...
Zheng Fu
Xinyu Jiao
Kun Jiang
D. Yang
Takafumi Matsumaru
AI4CE
64
0
0
27 Nov 2024
Generating Out-Of-Distribution Scenarios Using Language Models
Erfan Aasi
Phat Nguyen
Shiva Sreeram
Guy Rosman
S. Karaman
Daniela Rus
OODD
76
4
0
25 Nov 2024
DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
Tianyi Yan
Dongming Wu
Wencheng Han
Junpeng Jiang
Xia Zhou
Kun Zhan
Cheng-Zhong Xu
Jianbing Shen
30
3
0
18 Nov 2024
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM
Jingwei Xu
Chenyu Wang
Zibo Zhao
Wen Liu
Yi-An Ma
Shenghua Gao
48
11
0
07 Nov 2024
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Bo Jiang
Shaoyu Chen
Bencheng Liao
Xingyu Zhang
Wei Yin
Qian Zhang
Chang Huang
W. Liu
X. Wang
VLM
MLLM
LRM
35
11
0
29 Oct 2024
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
62
22
0
21 Oct 2024
LASER: Script Execution by Autonomous Agents for On-demand Traffic Simulation
Hao Gao
Jingyue Wang
Wenyang Fang
Jingwei Xu
Yunpeng Huang
Taolue Chen
Xiaoxing Ma
27
1
0
21 Oct 2024
Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving
Sihao Wu
Jiaxu Liu
Xiangyu Yin
Guangliang Cheng
Xingyu Zhao
Meng Fang
Xinping Yi
Xiaowei Huang
20
0
0
16 Oct 2024
FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion
Jiacheng Ruan
Yebin Yang
Zehao Lin
Feiyu Xiong
Zeyun Tang
Z. Li
Zhiyu Li
VLM
31
3
0
16 Oct 2024
MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation
Chenxi Wang
Xiang Chen
N. Zhang
Bozhong Tian
Haoming Xu
Shumin Deng
H. Chen
MLLM
LRM
21
4
0
15 Oct 2024
A Generalized Control Revision Method for Autonomous Driving Safety
Zehang Zhu
Yuning Wang
Tianqi Ke
Zeyu Han
Shaobing Xu
Qing Xu
John M. Dolan
Jianqiang Wang
23
0
0
23 Sep 2024
Enhancing LLM-based Autonomous Driving Agents to Mitigate Perception Attacks
Ruoyu Song
Muslum Ozgur Ozmen
Hyungsub Kim
Antonio Bianchi
Z. Berkay Celik
AAML
24
5
0
22 Sep 2024
LFP: Efficient and Accurate End-to-End Lane-Level Planning via Camera-LiDAR Fusion
Guoliang You
Xiaomeng Chu
Yifan Duan
Xingchen Li
Sha Zhang
Jianmin Ji
Yanyong Zhang
39
0
0
21 Sep 2024
From Words to Wheels: Automated Style-Customized Policy Generation for Autonomous Driving
Xu Han
Xianda Chen
Zhenghan Cai
Pinlong Cai
Meixin Zhu
Xiaowen Chu
31
1
0
18 Sep 2024
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving
Kairui Ding
Boyuan Chen
Yuchen Su
Huan-ang Gao
Bu Jin
...
Wuqiang Zhang
Xiaohui Li
Paul Barsch
Hongyang Li
Hao Zhao
42
3
0
10 Sep 2024
Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles
Qiujing Lu
Xuanhan Wang
Yiwei Jiang
Guangming Zhao
Mingyue Ma
Shuo Feng
33
5
0
10 Sep 2024
1
2
Next