ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.14607
  4. Cited By
Can Large Vision Language Models Read Maps Like a Human?

Can Large Vision Language Models Read Maps Like a Human?

18 March 2025
Shuo Xing
Zezhou Sun
Shuangyu Xie
Kaiyuan Chen
Yanjia Huang
Yuping Wang
Jiachen Li
Dezhen Song
Zhengzhong Tu
ArXiv (abs)PDFHTMLHuggingFace (10 upvotes)

Papers citing "Can Large Vision Language Models Read Maps Like a Human?"

18 / 18 papers shown
Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries
Ariadne: A Controllable Framework for Probing and Extending VLM Reasoning Boundaries
Minghe shen
Zhuo Zhi
Chonghan Liu
Shuo Xing
Zhengzhong Tu
Che Liu
LRM
315
0
0
01 Nov 2025
RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks
RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks
Mingxuan Yan
Yuping Wang
Zechun Liu
Jiachen Li
165
2
0
16 Oct 2025
Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization
Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization
Shuo Xing
Soumik Dey
Mingyang Wu
Ashirbad Mishra
Naveen Ravipati
Binbin Li
Hansi Wu
Zhengzhong Tu
233
2
0
09 Oct 2025
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
Sicheng Feng
Kaiwen Tuo
Song Wang
Lingdong Kong
Jianke Zhu
Huan Wang
LRM
289
9
0
02 Oct 2025
MapIQ: Evaluating Multimodal Large Language Models for Map Question Answering
MapIQ: Evaluating Multimodal Large Language Models for Map Question Answering
Varun Srivastava
Fan Lei
Srija Mukhopadhyay
Vivek Gupta
Ross Maciejewski
199
1
0
15 Jul 2025
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence
Yining Hong
Rui Sun
B. Li
Xingcheng Yao
Maxine Wu
Alexander Chien
Da Yin
Ying Nian Wu
Zhecan Wang
Kai-Wei Chang
LM&Ro
578
10
0
18 Jun 2025
Demystifying the Visual Quality Paradox in Multimodal Large Language Models
Demystifying the Visual Quality Paradox in Multimodal Large Language Models
Shuo Xing
Lanqing guo
Hongyuan Hua
Seoyoung Lee
Peiran Li
Yufei Wang
Zinan Lin
Zhengzhong Tu
VLM
408
1
0
18 Jun 2025
VLM@school -- Evaluation of AI image understanding on German middle school knowledge
VLM@school -- Evaluation of AI image understanding on German middle school knowledge
René Peinl
Vincent Tischler
CoGeVLM
354
1
0
13 Jun 2025
SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems
SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems
Peiran Li
Xinkai Zou
Zhuohang Wu
Ruifeng Li
Shuo Xing
...
Yuping Wang
Haoxi Li
Qin Yuan
Yingmo Zhang
Zhengzhong Tu
421
16
0
09 Jun 2025
ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps
ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps
Sicheng Feng
Song Wang
Shuyi Ouyang
Lingdong Kong
Zikai Song
Jianke Zhu
Huan Wang
Xinchao Wang
LRM
414
16
0
24 May 2025
The Role of Open-Source LLMs in Shaping the Future of GeoAI
The Role of Open-Source LLMs in Shaping the Future of GeoAI
Xiao Shi Huang
Zhengzhong Tu
X. Ye
Michael Goodchild
361
2
0
24 Apr 2025
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving
UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving
Yuping Wang
Xiangyu Huang
Xiaokang Sun
Mingxuan Yan
Shuo Xing
Zhengzhong Tu
Jiachen Li
453
20
0
31 Mar 2025
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
Yang Sui
Yu-Neng Chuang
Guanchu Wang
Jiamu Zhang
Tianyi Zhang
...
Andrew Wen
Shaochen
Zhong
Hanjie Chen
Helen Zhou
OffRLReLMLRM
803
327
0
20 Mar 2025
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization
Shuo Xing
Peiran Li
Peiran Li
Ruizheng Bai
Longji Xu
Chan-wei Hu
Chengxuan Qian
Huaxiu Yao
Zhengzhong Tu
624
23
0
18 Feb 2025
DriveLM: Driving with Graph Visual Question Answering
DriveLM: Driving with Graph Visual Question AnsweringEuropean Conference on Computer Vision (ECCV), 2023
Chonghao Sima
Katrin Renz
Kashyap Chitta
Lawrence Yunliang Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
906
426
0
17 Jan 2025
Hallucination of Multimodal Large Language Models: A Survey
Hallucination of Multimodal Large Language Models: A Survey
Zechen Bai
Pichao Wang
Tianjun Xiao
Tong He
Zongbo Han
Zheng Zhang
Mike Zheng Shou
VLMLRM
785
330
0
29 Apr 2024
EqDrive: Efficient Equivariant Motion Forecasting with Multi-Modality for Autonomous Driving
EqDrive: Efficient Equivariant Motion Forecasting with Multi-Modality for Autonomous DrivingInternational Conference Robotics and Automation Engineering (ICRAE), 2023
Yuping Wang
Jier Chen
257
9
0
26 Oct 2023
Equivariant Map and Agent Geometry for Autonomous Driving Motion Prediction
Equivariant Map and Agent Geometry for Autonomous Driving Motion Prediction
Yuping Wang
Jier Chen
226
7
0
21 Oct 2023
1
Page 1 of 1