ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.03622
  4. Cited By
Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning
  in Large Language Models

Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models

4 April 2024
Wenshan Wu
Shaoguang Mao
Yadong Zhang
Yan Xia
Li Dong
Lei Cui
Furu Wei
    LRM
ArXivPDFHTML

Papers citing "Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models"

18 / 18 papers shown
Title
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
Phillip Y. Lee
Jihyeon Je
Chanho Park
Mikaela Angelina Uy
Leonidas J. Guibas
Minhyuk Sung
LRM
41
0
0
24 Apr 2025
Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning
Collaborating Action by Action: A Multi-agent LLM Framework for Embodied Reasoning
Isadora White
Kolby Nottingham
Ayush Maniar
Max Robinson
Hansen Lillemark
Mehul Maheshwari
Lianhui Qin
Prithviraj Ammanabrolu
LLMAG
LM&Ro
101
0
0
24 Apr 2025
Research on Navigation Methods Based on LLMs
Research on Navigation Methods Based on LLMs
Anlong Zhang
Jianmin Ji
29
0
0
22 Apr 2025
A Call for New Recipes to Enhance Spatial Reasoning in MLLMs
A Call for New Recipes to Enhance Spatial Reasoning in MLLMs
Huanyu Zhang
Chengzu Li
Wenshan Wu
Shaoguang Mao
Yan Xia
Ivan Vulić
Z. Zhang
Liang Wang
T. Tan
Furu Wei
LRM
34
1
0
21 Apr 2025
GraphicBench: A Planning Benchmark for Graphic Design with Language Agents
GraphicBench: A Planning Benchmark for Graphic Design with Language Agents
Dayeon Ki
Tianyi Zhou
Marine Carpuat
Gang Wu
Puneet Mathur
Viswanathan Swaminathan
LLMAG
LM&Ro
48
0
0
15 Apr 2025
SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models
SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models
Junfeng Fang
Y. Wang
Ruipeng Wang
Zijun Yao
Kun Wang
An Zhang
X. Wang
Tat-Seng Chua
AAML
LRM
60
2
0
09 Apr 2025
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models
Text Speaks Louder than Vision: ASCII Art Reveals Textual Biases in Vision-Language Models
Zhaochen Wang
Yujun Cai
Zi Huang
Bryan Hooi
Yiwei Wang
Ming Yang
CoGe
VLM
67
0
0
02 Apr 2025
Geometric Reasoning in the Embedding Space
Geometric Reasoning in the Embedding Space
Jan Hůla
David Mojžíšek
Jiří Janeček
David Herel
Mikoláš Janota
31
0
0
02 Apr 2025
CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation
CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation
Jixuan Leng
Chengsong Huang
Langlin Huang
Bill Yuchen Lin
William W. Cohen
Haohan Wang
Jiaxin Huang
LRM
34
0
0
30 Mar 2025
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning
Alan Dao
Dinh Bach Vu
Bui Quang Huy
55
0
0
24 Mar 2025
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Y. Wang
Shengqiong Wu
Y. Zhang
William Yang Wang
Ziwei Liu
Jiebo Luo
Hao Fei
LRM
74
7
0
16 Mar 2025
Dynamic Path Navigation for Motion Agents with LLM Reasoning
Yubo Zhao
Qi Wu
Yifan Wang
Yu-Wing Tai
Chi-Keung Tang
LRM
LLMAG
74
0
0
10 Mar 2025
VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
Zhangquan Chen
Xufang Luo
Dongsheng Li
OffRL
LRM
58
3
0
10 Mar 2025
Do Multimodal Language Models Really Understand Direction? A Benchmark
  for Compass Direction Reasoning
Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Hang Yin
Zhifeng Lin
Xin Liu
Bin Sun
Kan Li
LRM
64
1
0
21 Dec 2024
VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges
  in Video Cognition
VCBench: A Controllable Benchmark for Symbolic and Abstract Challenges in Video Cognition
Chenglin Li
Qianglong Chen
Zhi Li
Feng Tao
Yin Zhang
29
0
0
14 Nov 2024
Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency
Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency
Akila Wickramasekara
F. Breitinger
Mark Scanlon
39
7
0
29 Feb 2024
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
197
2,232
0
22 Mar 2023
Grounding Natural Language Instructions: Can Large Language Models
  Capture Spatial Information?
Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?
Julia Rozanova
Deborah Ferreira
K. Dubba
Weiwei Cheng
Dell Zhang
André Freitas
LM&Ro
20
5
0
17 Sep 2021
1