Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.20728
Cited By
v1
v2
v3
v4 (latest)
Jigsaw-Puzzles: From Seeing to Understanding to Reasoning in Vision-Language Models
27 May 2025
Zesen Lyu
Dandan Zhang
Wei Ye
Fangdi Li
Zhihang Jiang
Yao Yang
ReLM
VLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Jigsaw-Puzzles: From Seeing to Understanding to Reasoning in Vision-Language Models"
4 / 4 papers shown
GRAID: Enhancing Spatial Reasoning of VLMs Through High-Fidelity Data Generation
Karim Elmaaroufi
Liheng Lai
Justin Svegliato
Yutong Bai
Sanjit A. Seshia
Matei A. Zaharia
203
0
0
25 Oct 2025
Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective
Zhihao Zhang
Qiaole Dong
Qi Zhang
Jun Zhao
Enyu Zhou
...
Yanwei Fu
Changzhi Sun
Tao Gui
Xuanjing Huang
Kai Chen
CLL
233
0
0
30 Jun 2025
CAPTURe: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting
Atin Pothiraj
Elias Stengel-Eskin
Jaemin Cho
Joey Tianyi Zhou
401
19
0
21 Apr 2025
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
Kexian Tang
Junyao Gao
Yanhong Zeng
Haodong Duan
Yanan Sun
Zhening Xing
Wenran Liu
Kaifeng Lyu
Kai-xiang Chen
ELM
LRM
447
31
0
25 Mar 2025
1
Page 1 of 1