ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.14150
  4. Cited By
DriveLM: Driving with Graph Visual Question Answering

DriveLM: Driving with Graph Visual Question Answering

17 January 2025
Chonghao Sima
Katrin Renz
Kashyap Chitta
L. Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
ArXivPDFHTML

Papers citing "DriveLM: Driving with Graph Visual Question Answering"

50 / 136 papers shown
Title
DriveAgent: Multi-Agent Structured Reasoning with LLM and Multimodal Sensor Fusion for Autonomous Driving
DriveAgent: Multi-Agent Structured Reasoning with LLM and Multimodal Sensor Fusion for Autonomous Driving
Xinmeng Hou
Wuqi Wang
Long Yang
Hao Lin
Jinglun Feng
Haigen Min
Xiangmo Zhao
19
0
0
04 May 2025
CaRL: Learning Scalable Planning Policies with Simple Rewards
CaRL: Learning Scalable Planning Policies with Simple Rewards
Bernhard Jaeger
D. Dauner
Jens Beißwenger
Simon Gerstenecker
Kashyap Chitta
Andreas Geiger
36
0
0
24 Apr 2025
LangCoop: Collaborative Driving with Language
LangCoop: Collaborative Driving with Language
Xiangbo Gao
Yuheng Wu
Rujia Wang
Chenxi Liu
Yang Zhou
Zhengzhong Tu
VLM
29
0
0
18 Apr 2025
Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks
Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks
Nassim Belmecheri
A. Gotlieb
Nadjib Lazaar
Helge Spieker
GNN
32
0
0
17 Apr 2025
ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models
ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models
Amirhosein Chahe
Lifeng Zhou
LRM
28
0
0
14 Apr 2025
Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models
Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models
Rui Gan
Pei Li
Keke Long
Bocheng An
Junwei You
Keshu Wu
Bin Ran
21
0
0
06 Apr 2025
NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving
NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving
Kexin Tian
Jingrui Mao
Y. Zhang
Jiwan Jiang
Yang Zhou
Zhengzhong Tu
CoGe
60
0
0
04 Apr 2025
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving
Fuhao Li
Huan Jin
Bin-Bin Gao
Liaoyuan Fan
Lihui Jiang
Long Zeng
57
0
0
28 Mar 2025
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving
Yue Li
Meng Tian
Zhenyu Lin
Jiangtong Zhu
Dechang Zhu
Haiqiang Liu
Zining Wang
Yueyi Zhang
Zhiwei Xiong
Xinhai Zhao
CoGe
VLM
73
0
0
27 Mar 2025
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Tong Nie
Jian-jun Sun
Wei Ma
50
1
0
27 Mar 2025
LangBridge: Interpreting Image as a Combination of Language Embeddings
LangBridge: Interpreting Image as a Combination of Language Embeddings
Jiaqi Liao
Yuwei Niu
Fanqing Meng
Hao Li
Changyao Tian
...
Dianqi Li
X. Zhu
Li Yuan
Jifeng Dai
Yu Cheng
MLLM
67
0
0
25 Mar 2025
AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models
AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models
Boshra Khalili
Andrew W.Smyth
ELM
50
0
0
20 Mar 2025
Can Large Vision Language Models Read Maps Like a Human?
Can Large Vision Language Models Read Maps Like a Human?
Shuo Xing
Zezhou Sun
Shuangyu Xie
Kaiyuan Chen
Yanjia Huang
Yuping Wang
Jiachen Li
Dezhen Song
Zhengzhong Tu
48
2
0
18 Mar 2025
Tracking Meets Large Multimodal Models for Driving Scenario Understanding
Tracking Meets Large Multimodal Models for Driving Scenario Understanding
Ayesha Ishaq
Jean Lahoud
F. Khan
Salman Khan
Hisham Cholakkal
Rao Muhammad Anwer
49
0
0
18 Mar 2025
RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving
RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving
Yujin Wang
Quanfeng Liu
Zhengxin Jiang
Tianyi Wang
Junfeng Jiao
Hongqing Chu
B. Gao
Hong Chen
51
1
0
18 Mar 2025
InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving
InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving
Ruiqi Song
Xianda Guo
Hangbin Wu
Qinggong Wei
Long Chen
50
1
0
17 Mar 2025
Road Rage Reasoning with Vision-language Models (VLMs): Task Definition and Evaluation Dataset
Yibing Weng
Yu Gu
Fuji Ren
54
0
0
14 Mar 2025
Centaur: Robust End-to-End Autonomous Driving with Test-Time Training
Chonghao Sima
Kashyap Chitta
Zhiding Yu
Shiyi Lan
Ping Luo
Andreas Geiger
H. Li
Jose M. Alvarez
46
1
0
14 Mar 2025
A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving
Tin Stribor Sohn
Philipp Reis
Maximilian Dillitzer
Johannes Bach
Jason J. Corso
Eric Sax
ELM
LRM
39
0
0
14 Mar 2025
Unlock the Power of Unlabeled Data in Language Driving Model
Unlock the Power of Unlabeled Data in Language Driving Model
Chaoqun Wang
Jie-jin Yang
Xiaobin Hong
Ruimao Zhang
35
0
0
13 Mar 2025
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding
Ayesha Ishaq
Jean Lahoud
Ketan More
Omkar Thawakar
Ritesh Thawkar
...
F. Khan
Hisham Cholakkal
Ivan Laptev
Rao Muhammad Anwer
Salman Khan
LRM
57
0
0
13 Mar 2025
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment
Katrin Renz
Long Chen
Elahe Arani
Oleg Sinavski
MLLM
57
0
0
12 Mar 2025
CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving
Changxing Liu
Genjia Liu
Z. Wang
Jinchang Yang
Siheng Chen
59
0
0
11 Mar 2025
Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense
Yuting Hu
Chenhui Xu
Ruiyang Qin
Dancheng Liu
Amir Nassereldine
Yiyu Shi
Jinjun Xiong
32
0
0
10 Mar 2025
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning
Bo Jiang
Shaoyu Chen
Qian Zhang
Wenyu Liu
Xinggang Wang
OffRL
LRM
VLM
61
2
0
10 Mar 2025
AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning
Yangzhe Kong
Daeun Song
Jing Liang
Dinesh Manocha
Ziyu Yao
Xuesu Xiao
LRM
53
1
0
10 Mar 2025
Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving
Enming Zhang
Peizhe Gong
Xingyuan Dai
Yisheng Lv
Q. Miao
MLLM
ELM
60
0
0
09 Mar 2025
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving
Katharina Winter
Mark Azer
Fabian B. Flohr
48
0
0
05 Mar 2025
Towards Effective and Efficient Context-aware Nucleus Detection in Histopathology Whole Slide Images
Zhongyi Shui
Ruizhe Guo
Honglin Li
Yuxuan Sun
Yunlong Zhang
Chenglu Zhu
Jiatong Cai
Pingyi Chen
Yanzhou Su
Lin Yang
39
0
0
04 Mar 2025
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models
J. Zhang
Xuan Yang
T. Wang
Yu Yao
Aleksandr Petiushko
B. Li
28
0
0
28 Feb 2025
HazardNet: A Small-Scale Vision Language Model for Real-Time Traffic Safety Detection at Edge Devices
HazardNet: A Small-Scale Vision Language Model for Real-Time Traffic Safety Detection at Edge Devices
M. Tami
Mohammed Elhenawy
Huthaifa I. Ashqar
31
0
0
27 Feb 2025
InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer
InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer
Bo-Wen Zhang
Heye Huang
Chunyang Liu
Yaqin Zhang
Zhenhua Xu
67
0
0
25 Feb 2025
VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion
VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion
Pei Liu
Haipeng Liu
Haichao Liu
Xin Liu
Jinxin Ni
Jun Ma
48
0
0
25 Feb 2025
Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model
Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model
Bo-Kai Ruan
Hao-Tang Tsui
Yung-Hui Li
Hong-Han Shuai
LM&Ro
68
4
0
20 Feb 2025
Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing
Sicen Guo
Tianyou Wen
Chuang-Wei Liu
Qijun Chen
Rui Fan
50
0
0
10 Feb 2025
Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation
Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation
Namhee Kim
Woojin Park
33
0
0
06 Feb 2025
VLM-Assisted Continual learning for Visual Question Answering in Self-Driving
VLM-Assisted Continual learning for Visual Question Answering in Self-Driving
Yuxin Lin
Mengshi Qi
Liang Liu
Huadong Ma
CLL
30
1
0
02 Feb 2025
Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces
Social-LLaVA: Enhancing Robot Navigation through Human-Language Reasoning in Social Spaces
Amirreza Payandeh
Daeun Song
Mohammad Nazeri
Jing Liang
Praneel Mukherjee
Amir Hossain Raj
Yangzhe Kong
Dinesh Manocha
Xuesu Xiao
LM&Ro
LRM
67
5
0
17 Jan 2025
Embodied Scene Understanding for Vision Language Models via MetaVQA
Embodied Scene Understanding for Vision Language Models via MetaVQA
Weizhen Wang
Chenda Duan
Zhenghao Peng
Yuxin Liu
Bolei Zhou
LM&Ro
34
0
0
17 Jan 2025
Visual Large Language Models for Generalized and Specialized Applications
Yifan Li
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
83
10
0
06 Jan 2025
FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like
  Autonomous Driving with Adaptive Feedback
FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback
Kangan Qian
Zhikun Ma
Yangfan He
Ziang Luo
Tianyu Shi
...
Zheng Fu
Xinyu Jiao
Kun Jiang
D. Yang
Takafumi Matsumaru
AI4CE
59
0
0
27 Nov 2024
Generating Out-Of-Distribution Scenarios Using Language Models
Generating Out-Of-Distribution Scenarios Using Language Models
Erfan Aasi
Phat Nguyen
Shiva Sreeram
Guy Rosman
S. Karaman
Daniela Rus
OODD
71
2
0
25 Nov 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
J. T. Wang
85
0
0
25 Nov 2024
LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement
LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement
Siwen Jiao
Yangyi Fang
Baoyun Peng
Wangqun Chen
Bharadwaj Veeravalli
66
3
0
20 Nov 2024
MME-Finance: A Multimodal Finance Benchmark for Expert-level
  Understanding and Reasoning
MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Ziliang Gan
Yu Lu
D. Zhang
Haohan Li
Che Liu
...
Haipang Wu
Chaoyou Fu
Z. Xu
Rongjunchen Zhang
Yong Dai
39
0
0
05 Nov 2024
Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM
  challenge
Precise Drive with VLM: First Prize Solution for PRCV 2024 Drive LM challenge
Bin Huang
Siyu Wang
Yuanpeng Chen
Yidan Wu
Hui Song
...
Jing Leng
Chengpeng Liang
Peng Xue
Junliang Zhang
Tiankun Zhao
AILaw
19
0
0
05 Nov 2024
Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map
Driving by the Rules: A Benchmark for Integrating Traffic Sign Regulations into Vectorized HD Map
Xinyuan Chang
Maixuan Xue
Xinran Liu
Zheng Pan
Xing Wei
32
1
0
31 Oct 2024
EMMA: End-to-End Multimodal Model for Autonomous Driving
EMMA: End-to-End Multimodal Model for Autonomous Driving
Jyh-Jing Hwang
Runsheng Xu
Hubert Lin
Wei-Chih Hung
Jingwei Ji
...
Benjamin Sapp
Yin Zhou
James Guo
Dragomir Anguelov
Mingxing Tan
VLM
LM&Ro
23
25
0
30 Oct 2024
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous
  Driving
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Bo Jiang
Shaoyu Chen
Bencheng Liao
Xingyu Zhang
Wei Yin
Qian Zhang
Chang Huang
W. Liu
X. Wang
VLM
MLLM
LRM
30
11
0
29 Oct 2024
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5%
  Parameters and 90% Performance
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
54
16
0
21 Oct 2024
123
Next