ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.03135
  4. Cited By
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
v1v2 (latest)

OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

3 June 2025
Mengdi Jia
Zekun Qi
Shaochen Zhang
Wenyao Zhang
Xinqiang Yu
Jiawei He
He Wang
L. Yi
    LRMVLM
ArXiv (abs)PDFHTMLHuggingFace (37 upvotes)

Papers citing "OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models"

18 / 18 papers shown
Reasoning Path and Latent State Analysis for Multi-view Visual Spatial Reasoning: A Cognitive Science Perspective
Reasoning Path and Latent State Analysis for Multi-view Visual Spatial Reasoning: A Cognitive Science Perspective
Qiyao Xue
Weichen Liu
Shiqi Wang
Haoming Wang
Yuyang Wu
Wei Gao
LRM
93
0
0
02 Dec 2025
Geometrically-Constrained Agent for Spatial Reasoning
Geometrically-Constrained Agent for Spatial Reasoning
Zeren Chen
Xiaoya Lu
Zhijie Zheng
Pengrui Li
Lehan He
Yijin Zhou
Jing Shao
Bohan Zhuang
Lu Sheng
LRM
121
0
0
27 Nov 2025
DualVLA: Building a Generalizable Embodied Agent via Partial Decoupling of Reasoning and Action
DualVLA: Building a Generalizable Embodied Agent via Partial Decoupling of Reasoning and Action
Zhen Fang
Zhuoyang Liu
Jiaming Liu
Hao Chen
Y. Zeng
Shiting Huang
Zehui Chen
L. Chen
Shanghang Zhang
Feng Zhao
LRM
112
3
0
27 Nov 2025
G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
G2^22VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
Wenbo Hu
Jingli Lin
Yilin Long
Yunlong Ran
Lihan Jiang
Y. Wang
Chenming Zhu
Runsen Xu
Tai Wang
Jiangmiao Pang
VLM
295
0
0
26 Nov 2025
Video2Layout: Recall and Reconstruct Metric-Grounded Cognitive Map for Spatial Reasoning
Yibin Huang
Wang Xu
Wanyue Zhang
Helu Zhi
JingJing Huang
Yangbin Xu
Yangang Sun
Conghui Zhu
Tiejun Zhao
205
0
0
20 Nov 2025
FlexiCup: Wireless Multimodal Suction Cup with Dual-Zone Vision-Tactile Sensing
FlexiCup: Wireless Multimodal Suction Cup with Dual-Zone Vision-Tactile Sensing
Junhao Gong
Shoujie Li
Kit-Wa Sou
Changqing Guo
Hourong Huang
...
Yifan Xie
Chenxin Liang
Chuqiao Lyu
Xiaojun Liang
Wenbo Ding
153
2
0
18 Nov 2025
Spatial Reasoning in Multimodal Large Language Models: A Survey of Tasks, Benchmarks and Methods
Weichen Liu
Qiyao Xue
Haoming Wang
Xiangyu Yin
Boyuan Yang
Wei Gao
117
1
0
14 Nov 2025
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark
Ziyu Guo
Xinyan Chen
Renrui Zhang
Ruichuan An
Yu Qi
Dongzhi Jiang
Xiangtai Li
M. Zhang
Jiaming Song
Pheng-Ann Heng
VGenLRM
202
14
0
30 Oct 2025
Pelican-VL 1.0: A Foundation Brain Model for Embodied Intelligence
Pelican-VL 1.0: A Foundation Brain Model for Embodied Intelligence
Yi Zhang
Che Liu
Xiancong Ren
Hanchu Ni
Shuai Zhang
...
Z. Xu
Bin Shen
Qifan Wang
Jian Tang
Xiaozhu Ju
160
1
0
30 Oct 2025
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
Xu Zheng
Zihao Dongfang
Lutao Jiang
Boyuan Zheng
Yulong Guo
...
L. Zhang
Danda Pani Paudel
Nicu Sebe
Luc Van Gool
Xuming Hu
LRMVLM
731
5
0
29 Oct 2025
Seeing Across Views: Benchmarking Spatial Reasoning of Vision-Language Models in Robotic Scenes
Seeing Across Views: Benchmarking Spatial Reasoning of Vision-Language Models in Robotic Scenes
Zhiyuan Feng
Zhaolu Kang
Qijie Wang
Zhiying Du
Jiongrui Yan
...
Shawn Chen
Sicheng Xu
Yaobo Liang
Jiaolong Yang
B. Guo
160
1
0
22 Oct 2025
Spatial-DISE: A Unified Benchmark for Evaluating Spatial Reasoning in Vision-Language Models
Spatial-DISE: A Unified Benchmark for Evaluating Spatial Reasoning in Vision-Language Models
Xinmiao Huang
Qisong He
Zhenglin Huang
Boxuan Wang
Zhuoyun Li
Guangliang Cheng
Yi Dong
Xiaowei Huang
CoGe
282
0
0
15 Oct 2025
Automated Repeatable Adversary Threat Emulation with Effects Language (EL)
Automated Repeatable Adversary Threat Emulation with Effects Language (EL)
Suresh Damodaran
Paul D. Rowe
AAML
141
10
0
07 Oct 2025
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective
How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective
S. Yu
Yuxin Chen
Hao Ju
Lianjie Jia
Fuxi Zhang
...
Lin Song
Lijun Wang
Yanwei Li
Y. Shan
Huchuan Lu
LRM
324
12
0
23 Sep 2025
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Weiyun Wang
Zhangwei Gao
Lixin Gu
Hengjun Pu
Long Cui
...
Bowen Zhou
Kai Chen
Yu Qiao
Wenhai Wang
Gen Luo
MLLMLRM
306
298
0
25 Aug 2025
Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
Zhongang Cai
Yubo Wang
Qingping Sun
Ruisi Wang
Chenyang Gu
...
Quan-ding Wang
Dahua Lin
Lei Yang
Dahua Lin
L. Yang
ELM
272
0
0
18 Aug 2025
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
Wenyao Zhang
Hongsi Liu
Zekun Qi
Yunnan Wang
X. Yu
...
He Wang
Dongbin Zhao
Li Yi
Wenjun Zeng
Xin Jin
VLM
230
51
0
06 Jul 2025
Positional Prompt Tuning for Efficient 3D Representation Learning
Positional Prompt Tuning for Efficient 3D Representation Learning
Shaochen Zhang
Zekun Qi
Runpei Dong
Xiuxiu Bai
Xing Wei
403
10
0
21 Aug 2024
1
Page 1 of 1