ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.08743
  4. Cited By
MMToM-QA: Multimodal Theory of Mind Question Answering
v1v2 (latest)

MMToM-QA: Multimodal Theory of Mind Question Answering

Annual Meeting of the Association for Computational Linguistics (ACL), 2024
16 January 2024
Chuanyang Jin
Yutong Wu
Jing Cao
Jiannan Xiang
Yen-Ling Kuo
Zhiting Hu
T. Ullman
Antonio Torralba
Joshua B. Tenenbaum
Tianmin Shu
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "MMToM-QA: Multimodal Theory of Mind Question Answering"

36 / 36 papers shown
Title
Mind the Motions: Benchmarking Theory-of-Mind in Everyday Body Language
Seungbeen Lee
Jinhong Jeong
Donghyun Kim
Yejin Son
Youngjae Yu
74
1
0
19 Nov 2025
Spot The Ball: A Benchmark for Visual Social Inference
Spot The Ball: A Benchmark for Visual Social Inference
Neha Balamurugan
Sarah Wu
Adam Chun
Gabe Gaw
Cristobal Eyzaguirre
Tobias Gerstenberg
LRM
87
0
0
31 Oct 2025
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Integrating Machine Learning into Belief-Desire-Intention Agents: Current Advances and Open Challenges
Andrea Agiollo
Andrea Omicini
LM&RoAI4CE
116
0
0
23 Oct 2025
Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning
Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning
Jialu Du
Guiyang Hou
Yihui Fu
Chen Wu
Wenqi Zhang
Yongliang Shen
Weiming Lu
LLMAGLRM
115
0
0
09 Oct 2025
Circuit Distillation
Circuit Distillation
Somin Wadhwa
Silvio Amir
Byron C. Wallace
113
0
0
29 Sep 2025
OnlineMate: An LLM-Based Multi-Agent Companion System for Cognitive Support in Online Learning
OnlineMate: An LLM-Based Multi-Agent Companion System for Cognitive Support in Online Learning
Xian Gao
Zongyun Zhang
Ting Liu
Yuzhuo Fu
LLMAG
167
0
0
18 Sep 2025
LVLMs are Bad at Overhearing Human Referential Communication
LVLMs are Bad at Overhearing Human Referential Communication
Zhengxiang Wang
Weiling Li
Panagiotis Kaliosis
Owen Rambow
Susan E. Brennan
101
1
0
15 Sep 2025
ToM-SSI: Evaluating Theory of Mind in Situated Social Interactions
ToM-SSI: Evaluating Theory of Mind in Situated Social Interactions
Matteo Bortoletto
Constantin Ruhdorfer
Andreas Bulling
103
0
0
05 Sep 2025
HumanPCR: Probing MLLM Capabilities in Diverse Human-Centric Scenes
HumanPCR: Probing MLLM Capabilities in Diverse Human-Centric Scenes
Keliang Li
Hongze Shen
Hao Shi
Ruibing Hou
Hong Chang
...
Wen Wang
Yiling Wu
Shihong Deng
Shiguang Shan
Xilin Chen
LRM
120
1
0
19 Aug 2025
What Do Agents Think One Another Want? Level-2 Inverse Games for Inferring Agents' Estimates of Others' Objectives
What Do Agents Think One Another Want? Level-2 Inverse Games for Inferring Agents' Estimates of Others' Objectives
Hamzah I. Khan
Jingqi Li
David Fridovich-Keil
96
0
0
05 Aug 2025
MOMENTS: A Comprehensive Multimodal Benchmark for Theory of Mind
MOMENTS: A Comprehensive Multimodal Benchmark for Theory of Mind
Emilio Villa-Cueva
Snegha A
Rendi Chevi
Jan Christian Blaise Cruz
Kareem Elzeky
Fermin Cristobal
Alham Fikri Aji
Skyler Wang
Amélie Reymond
Thamar Solorio
128
1
0
06 Jul 2025
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions
Xianzhe Fan
Xuhui Zhou
Chuanyang Jin
Kolby Nottingham
Hao Zhu
Maarten Sap
175
2
0
29 Jun 2025
Language-Informed Synthesis of Rational Agent Models for Grounded Theory-of-Mind Reasoning On-The-Fly
Language-Informed Synthesis of Rational Agent Models for Grounded Theory-of-Mind Reasoning On-The-Fly
Lance Ying
Ryan Truong
Katherine M. Collins
Cedegao E. Zhang
Megan Wei
Tyler Brooke-Wilson
Tan Zhi-Xuan
Lionel Wong
J. Tenenbaum
LLMAG
128
5
0
20 Jun 2025
From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
Xinyang Li
Siqi Liu
Bochao Zou
Jiansheng Chen
Huimin Ma
164
1
0
17 Jun 2025
M$^3$FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset
M3^33FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset
Jie Zhu
Junhui Li
Yalong Wen
Xiandong Li
Lifan Guo
Feng-Xiang Chen
179
0
0
03 Jun 2025
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
XToM: Exploring the Multilingual Theory of Mind for Large Language Models
Chunkit Chan
Yauwai Yim
Hongchuan Zeng
Zhiying Zou
Xinyuan Cheng
...
Ginny Wong
Helmut Schmid
Hinrich Schütze
Simon See
Yangqiu Song
LRM
168
0
0
03 Jun 2025
Growing Through Experience: Scaling Episodic Grounding in Language Models
Growing Through Experience: Scaling Episodic Grounding in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Chunhui Zhang
Sirui
Wang
Z. Ouyang
Xiangchi Yuan
Soroush Vosoughi
CLL
176
5
0
02 Jun 2025
Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning: A Scalable Bayesian Planner
Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning: A Scalable Bayesian Planner
Chunhui Zhang
Z. Ouyang
Kwonjoon Lee
Nakul Agarwal
Sean Dae Houlihan
Soroush Vosoughi
Shao-Yuan Lo
LRM
161
3
0
02 Jun 2025
TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence
TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence
Guiyang Hou
Xing Gao
Yuchuan Wu
Xiang Huang
Wenqi Zhang
...
Yongliang Shen
Jialu Du
Fei Huang
Yongbin Li
Weiming Lu
160
0
0
30 May 2025
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human StatesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yang Xiao
Jiashuo Wang
Qiancheng Xu
Changhe Song
Chunpu Xu
Yi Cheng
Wenjie Li
Pengfei Liu
383
5
0
23 May 2025
Language Models use Lookbacks to Track Beliefs
Language Models use Lookbacks to Track Beliefs
Nikhil Prakash
Natalie Shapira
Arnab Sen Sharma
Christoph Riedl
Yonatan Belinkov
Tamar Rott Shaham
David Bau
Atticus Geiger
KELM
255
7
0
20 May 2025
Measurement of LLM's Philosophies of Human Nature
Measurement of LLM's Philosophies of Human Nature
Minheng Ni
Ennan Wu
Zidong Gong
Zhiyong Yang
Linjie Li
Chung-Ching Lin
Kevin Qinghong Lin
Lijuan Wang
Wangmeng Zuo
288
0
0
03 Apr 2025
How Well Can Vison-Language Models Understand Humans' Intention? An Open-ended Theory of Mind Question Evaluation Benchmark
How Well Can Vison-Language Models Understand Humans' Intention? An Open-ended Theory of Mind Question Evaluation Benchmark
Ximing Wen
Mallika Mainali
Anik Sen
300
0
0
28 Mar 2025
PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues
PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues
Fangxu Yu
Lai Jiang
Shenyi Huang
Zhen Wu
Xinyu Dai
LLMAG
405
6
0
28 Feb 2025
Time-MQA: Time Series Multi-Task Question Answering with Context Enhancement
Time-MQA: Time Series Multi-Task Question Answering with Context EnhancementAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yaxuan Kong
Yiyuan Yang
Yoontae Hwang
Wenjie Du
Stefan Zohren
Zhangyang Wang
Ming Jin
Qingsong Wen
AI4TS
284
27
0
26 Feb 2025
MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models
MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models
Hengzhi Li
Megan Tjandrasuwita
Yi R. Fung
Armando Solar-Lezama
Paul Pu Liang
381
5
0
23 Feb 2025
Standard Benchmarks Fail - Auditing LLM Agents in Finance Must Prioritize Risk
Standard Benchmarks Fail - Auditing LLM Agents in Finance Must Prioritize Risk
Zichen Chen
Jiaao Chen
Jianda Chen
Misha Sra
ELM
383
2
0
21 Feb 2025
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models
Hyunwoo Kim
Melanie Sclar
Tan Zhi-Xuan
Lance Ying
Sydney Levine
Yang Liu
Joshua B. Tenenbaum
Yejin Choi
LRMLLMAG
214
11
0
17 Feb 2025
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Fangwei Zhong
Kui Wu
Churan Wang
Hao Chen
Hai Ci
Zhoujun Li
Yizhou Wang
VGen
216
9
0
30 Dec 2024
Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning
Mind Your Theory: Theory of Mind Goes Deeper Than ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Eitan Wagner
Nitay Alon
J. Barnby
Omri Abend
LRM
413
7
0
18 Dec 2024
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under Ambiguities
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under AmbiguitiesInternational Conference on Learning Representations (ICLR), 2024
Zheyuan Zhang
Fengyuan Hu
Jayjun Lee
Freda Shi
Parisa Kordjamshidi
Joyce Chai
Ziqiao Ma
330
34
0
22 Oct 2024
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind
MuMA-ToM: Multi-modal Multi-Agent Theory of MindAAAI Conference on Artificial Intelligence (AAAI), 2024
Haojun Shi
Suyu Ye
Xinyu Fang
Chuanyang Jin
Leyla Isik
Yen-Ling Kuo
Tianmin Shu
LLMAG
369
30
0
22 Aug 2024
Understanding Epistemic Language with a Language-augmented Bayesian Theory of Mind
Understanding Epistemic Language with a Language-augmented Bayesian Theory of MindTransactions of the Association for Computational Linguistics (TACL), 2024
Lance Ying
Tan Zhi-Xuan
Lionel Wong
Vikash K. Mansinghka
J. Tenenbaum
209
1
0
21 Aug 2024
Advancing Social Intelligence in AI Agents: Technical Challenges and
  Open Questions
Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions
Leena Mathur
Paul Pu Liang
Louis-Philippe Morency
LLMAG
241
21
0
17 Apr 2024
GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
Lance Ying
Kunal Jha
Shivam Aarya
Joshua B. Tenenbaum
Antonio Torralba
Tianmin Shu
206
19
0
17 Mar 2024
Language Models, Agent Models, and World Models: The LAW for Machine
  Reasoning and Planning
Language Models, Agent Models, and World Models: The LAW for Machine Reasoning and Planning
Zhiting Hu
Tianmin Shu
LLMAGLM&RoLRM
294
47
0
08 Dec 2023
1