ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.14540
  4. Cited By
Evaluating Spatial Understanding of Large Language Models
v1v2v3 (latest)

Evaluating Spatial Understanding of Large Language Models

23 October 2023
Yutaro Yamada
Yihan Bao
Andrew Kyle Lampinen
Jungo Kasai
Ilker Yildirim
    LRM
ArXiv (abs)PDFHTMLGithub (16★)

Papers citing "Evaluating Spatial Understanding of Large Language Models"

24 / 24 papers shown
CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding
CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding
H. Ung
Guillaume Habault
Yasutaka Nishimura
Hao Niu
Roberto Legaspi
...
Ryoichi Kojima
Masato Taya
Chihiro Ono
A. Minamikawa
Y. Liu
223
0
0
03 Dec 2025
SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
Peiran Xu
Sudong Wang
Yao Zhu
Jianing Li
Yunjian Zhang
Yunjian Zhang
LRM
449
13
0
26 Nov 2025
SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards
SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards
Hunar Batra
Haoqin Tu
Hardy Chen
Yuanze Lin
Cihang Xie
Ronald Clark
OffRLReLMLRM
417
7
0
10 Nov 2025
GOPLA: Generalizable Object Placement Learning via Synthetic Augmentation of Human Arrangement
GOPLA: Generalizable Object Placement Learning via Synthetic Augmentation of Human Arrangement
Yao Zhong
Hanzhi Chen
Simon Schaefer
Anran Zhang
Stefan Leutenegger
325
0
0
16 Oct 2025
See it. Say it. Sorted: Agentic System for Compositional Diagram Generation
See it. Say it. Sorted: Agentic System for Compositional Diagram Generation
Hantao Zhang
Jingyang Liu
Ed Li
138
0
0
21 Aug 2025
Scene Graph-Guided Proactive Replanning for Failure-Resilient Embodied Agent
Scene Graph-Guided Proactive Replanning for Failure-Resilient Embodied Agent
Che Rin Yu
Daewon Chae
Dabin Seo
Sangwon Lee
Hyeongwoo Im
Jinkyu Kim
194
1
0
15 Aug 2025
Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited
Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited
Anthony G Cohn
Robert E Blackwell
LRMELM
195
2
0
16 Jul 2025
FloorplanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations
FloorplanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations
Fedor Rodionov
Abdelrahman Eldesokey
Michael Birsak
John C. Femiani
Bernard Ghanem
Peter Wonka
LRM
342
8
0
10 Jul 2025
GIQ: Benchmarking 3D Geometric Reasoning of Vision Foundation Models with Simulated and Real Polyhedra
Mateusz Michalkiewicz
Anekha Sokhal
Tadeusz Michalkiewicz
Piotr Pawlikowski
Mahsa Baktashmotlagh
Varun Jampani
Guha Balakrishnan
351
2
0
09 Jun 2025
The World As Large Language Models See It: Exploring the reliability of LLMs in representing geographical features
The World As Large Language Models See It: Exploring the reliability of LLMs in representing geographical features
Omid Reza Abbasi
Franz Welscher
Georg Weinberger
Johannes Scholz
288
2
0
30 May 2025
Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary
Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary
Keanu Nichols
Nazia Tasnim
Yuting Yan
Nicholas Ikechukwu
Elva Zou
Deepti Ghadiyaram
Bryan A. Plummer
501
1
0
27 May 2025
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning
Alan Dao
Dinh Bach Vu
Bui Quang Huy
438
0
0
24 Mar 2025
Navigating Motion Agents in Dynamic and Cluttered Environments through LLM Reasoning
Navigating Motion Agents in Dynamic and Cluttered Environments through LLM Reasoning
Yubo Zhao
Qi Wu
Yifan Wang
Yu-Wing Tai
Chi-Keung Tang
LLMAGLRM
1.0K
0
0
10 Mar 2025
Factorio Learning Environment
Factorio Learning Environment
Jack Hopkins
Mart Bakler
Akbir Khan
LRMAI4CELLMAG
231
3
0
06 Mar 2025
BIG-Bench Extra Hard
BIG-Bench Extra HardAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Mehran Kazemi
Bahare Fatemi
Hritik Bansal
John Palowitch
Chrysovalantis Anastasiou
...
Kate Olszewska
Yi Tay
Vinh Q. Tran
Quoc V. Le
Orhan Firat
ELMLRM
783
85
0
26 Feb 2025
GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning
GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning
Zhisheng Tang
Mayank Kejriwal
LRM
373
8
0
20 Jan 2025
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall SpacesComputer Vision and Pattern Recognition (CVPR), 2024
Jihan Yang
Shusheng Yang
Anjali W. Gupta
Rilyn Han
Li Fei-Fei
Saining Xie
LRM
582
471
0
18 Dec 2024
Evaluating Vision-Language Models as Evaluators in Path Planning
Evaluating Vision-Language Models as Evaluators in Path PlanningComputer Vision and Pattern Recognition (CVPR), 2024
Mohamed Aghzal
Xiang Yue
Erion Plaku
Ziyu Yao
LRM
725
7
0
27 Nov 2024
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for RoboticsComputer Vision and Pattern Recognition (CVPR), 2024
Chan Hee Song
Valts Blukis
Jonathan Tremblay
Stephen Tyree
Yu-Chuan Su
Stan Birchfield
1.1K
123
0
25 Nov 2024
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
BALROG: Benchmarking Agentic LLM and VLM Reasoning On GamesInternational Conference on Learning Representations (ICLR), 2024
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAGLRM
761
88
0
20 Nov 2024
Evaluating the Ability of Large Language Models to Reason about Cardinal
  Directions
Evaluating the Ability of Large Language Models to Reason about Cardinal Directions
Anthony G Cohn
Robert E Blackwell
335
17
0
24 Jun 2024
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Sachit Menon
Richard Zemel
Carl Vondrick
LRM
439
11
0
20 Jun 2024
CityGPT: Empowering Urban Spatial Cognition of Large Language Models
CityGPT: Empowering Urban Spatial Cognition of Large Language Models
Jie Feng
Tianhui Liu
Junbo Yan
Siqi Guo
Yuming Lin
Yong Li
473
40
0
20 Jun 2024
Scaffolding Language Learning via Multi-modal Tutoring Systems with
  Pedagogical Instructions
Scaffolding Language Learning via Multi-modal Tutoring Systems with Pedagogical InstructionsConference on Algebraic Informatics (CAI), 2024
Zhengyuan Liu
Stella Xin Yin
Carolyn Lee
Nancy F. Chen
AI4Ed
201
33
0
04 Apr 2024
1
Page 1 of 1