v1v2v3 (latest)

Evaluating Spatial Understanding of Large Language Models

23 October 2023

ArXiv (abs)PDF HTML Github (16★)

Papers citing "Evaluating Spatial Understanding of Large Language Models"

24 / 24 papers shown

CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding

...

223

03 Dec 2025

SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition

449

26 Nov 2025

SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards

417

10 Nov 2025

GOPLA: Generalizable Object Placement Learning via Synthetic Augmentation of Human Arrangement

325

16 Oct 2025

See it. Say it. Sorted: Agentic System for Compositional Diagram Generation

Hantao Zhang

Jingyang Liu

Ed Li

138

21 Aug 2025

Scene Graph-Guided Proactive Replanning for Failure-Resilient Embodied Agent

194

15 Aug 2025

Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited

Anthony G Cohn

Robert E Blackwell

LRM ELM

195

16 Jul 2025

FloorplanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations

Fedor Rodionov

Abdelrahman Eldesokey

342

10 Jul 2025

GIQ: Benchmarking 3D Geometric Reasoning of Vision Foundation Models with Simulated and Real Polyhedra

Mateusz Michalkiewicz

Anekha Sokhal

Tadeusz Michalkiewicz

351

09 Jun 2025

The World As Large Language Models See It: Exploring the reliability of LLMs in representing geographical features

288

30 May 2025

Seeing Isn't Orienting: A Cognitively Grounded Benchmark Reveals Systematic Orientation Failures in MLLMs Supplementary

501

27 May 2025

AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning

Alan Dao

Dinh Bach Vu

Bui Quang Huy

438

24 Mar 2025

Navigating Motion Agents in Dynamic and Cluttered Environments through LLM Reasoning

1.0K

10 Mar 2025

Factorio Learning Environment

231

06 Mar 2025

BIG-Bench Extra HardAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Chrysovalantis Anastasiou

...

783

26 Feb 2025

GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning

Zhisheng Tang

Mayank Kejriwal

LRM

373

20 Jan 2025

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall SpacesComputer Vision and Pattern Recognition (CVPR), 2024

582

471

18 Dec 2024

Evaluating Vision-Language Models as Evaluators in Path PlanningComputer Vision and Pattern Recognition (CVPR), 2024

725

27 Nov 2024

RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for RoboticsComputer Vision and Pattern Recognition (CVPR), 2024

1.1K

123

25 Nov 2024

BALROG: Benchmarking Agentic LLM and VLM Reasoning On GamesInternational Conference on Learning Representations (ICLR), 2024

...

761

20 Nov 2024

Evaluating the Ability of Large Language Models to Reason about Cardinal Directions

Anthony G Cohn

Robert E Blackwell

335

24 Jun 2024

Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

Sachit Menon

Richard Zemel

Carl Vondrick

LRM

439

20 Jun 2024

CityGPT: Empowering Urban Spatial Cognition of Large Language Models

473

20 Jun 2024

Scaffolding Language Learning via Multi-modal Tutoring Systems with Pedagogical InstructionsConference on Algebraic Informatics (CAI), 2024

Nancy F. Chen

201

04 Apr 2024