Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2102.12321
Cited By
v1
v2
v3
v4 (latest)
AGENT: A Benchmark for Core Psychological Reasoning
International Conference on Machine Learning (ICML), 2021
24 February 2021
Tianmin Shu
Abhishek Bhandwaldar
Chuang Gan
Kevin A. Smith
Shari Liu
Dan Gutfreund
E. Spelke
J. Tenenbaum
T. Ullman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AGENT: A Benchmark for Core Psychological Reasoning"
41 / 41 papers shown
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions
Xianzhe Fan
Xuhui Zhou
Chuanyang Jin
Kolby Nottingham
Hao Zhu
Maarten Sap
315
6
0
29 Jun 2025
From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
Xinyang Li
Siqi Liu
Bochao Zou
Jiansheng Chen
Huimin Ma
296
2
0
17 Jun 2025
Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning: A Scalable Bayesian Planner
Chunhui Zhang
Z. Ouyang
Kwonjoon Lee
Nakul Agarwal
Sean Dae Houlihan
Soroush Vosoughi
Shao-Yuan Lo
LRM
239
4
0
02 Jun 2025
CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
Arnav Verma
Kushin Mukherjee
Christopher Potts
Elisa Kreiss
Judith E. Fan
227
2
0
22 May 2025
Re-evaluating Theory of Mind evaluation in large language models
Philosophical transactions of the Royal Society of London. Series B, Biological sciences (Philos Trans R Soc Lond B Biol Sci), 2025
Jennifer Hu
Felix Sosa
T. Ullman
392
14
0
28 Feb 2025
Few-Shot Task Learning through Inverse Generative Modeling
Neural Information Processing Systems (NeurIPS), 2024
Aviv Netanyahu
Yilun Du
Antonia Bronars
Jyothish Pari
J. Tenenbaum
Tianmin Shu
Pulkit Agrawal
529
5
0
07 Nov 2024
EgoSocialArena: Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective
Guiyang Hou
Wenqi Zhang
Yongliang Shen
Zeqi Tan
Sihao Shen
Weiming Lu
377
0
0
08 Oct 2024
MARPLE: A Benchmark for Long-Horizon Inference
Neural Information Processing Systems (NeurIPS), 2024
Emily Jin
Zhuoyi Huang
Jan-Philipp Fränken
Weiyu Liu
Hannah Cha
Erik Brockbank
Sarah Wu
Ruohan Zhang
Jiajun Wu
Tobias Gerstenberg
319
5
0
02 Oct 2024
Vision Language Models See What You Want but not What You See
Qingying Gao
Yijiang Li
Haiyun Lyu
Haoran Sun
Dezhi Luo
Hokin Deng
LRM
VLM
598
11
0
01 Oct 2024
Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind
Lance Ying
Xinyi Li
Shivam Aarya
Yizirui Fang
Stefanie Tellex
J. Tenenbaum
Tianmin Shu
Joshua B. Tenenbaum
Tianmin Shu
LM&Ro
372
3
0
17 Sep 2024
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind
AAAI Conference on Artificial Intelligence (AAAI), 2024
Haojun Shi
Suyu Ye
Xinyu Fang
Chuanyang Jin
Leyla Isik
Yen-Ling Kuo
Tianmin Shu
LLMAG
513
42
0
22 Aug 2024
Explicit Modelling of Theory of Mind for Belief Prediction in Nonverbal Social Interactions
Matteo Bortoletto
Constantin Ruhdorfer
Lei Shi
Andreas Bulling
392
7
0
09 Jul 2024
TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind
Guiyang Hou
Wenqi Zhang
Yongliang Shen
Linjuan Wu
Weiming Lu
LRM
AI4CE
245
19
0
01 Jul 2024
GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
Lance Ying
Kunal Jha
Shivam Aarya
Joshua B. Tenenbaum
Antonio Torralba
Tianmin Shu
356
19
0
17 Mar 2024
Language Models Represent Beliefs of Self and Others
Wentao Zhu
Zhining Zhang
Yizhou Wang
MILM
LRM
420
20
0
28 Feb 2024
Towards Unified Alignment Between Agents, Humans, and Environment
Zonghan Yang
An Liu
Zijun Liu
Wenbing Huang
Fangzhou Xiong
...
Zhenhe Zhang
Ziyue Wang
Zhicheng Guo
Peng Li
Yang Liu
373
5
0
12 Feb 2024
BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of Mind
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yuanyuan Mao
Xin Lin
Qin Ni
Liang He
292
6
0
12 Feb 2024
MMToM-QA: Multimodal Theory of Mind Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Chuanyang Jin
Yutong Wu
Jing Cao
Jiannan Xiang
Yen-Ling Kuo
Zhiting Hu
T. Ullman
Antonio Torralba
Joshua B. Tenenbaum
Tianmin Shu
429
76
0
16 Jan 2024
Neural Reasoning About Agents' Goals, Preferences, and Actions
AAAI Conference on Artificial Intelligence (AAAI), 2023
Matteo Bortoletto
Lei Shi
Andreas Bulling
295
8
0
12 Dec 2023
Robot Learning in the Era of Foundation Models: A Survey
Xuan Xiao
Jiahang Liu
Zhipeng Wang
Yanmin Zhou
Yong Qi
Qian Cheng
Bin He
Shuo Jiang
AI4CE
LM&Ro
462
51
0
24 Nov 2023
A Brain-inspired Theory of Collective Mind Model for Efficient Social Cooperation
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
Zhuoya Zhao
Feifei Zhao
Shiwen Wang
Yinqian Sun
Yi Zeng
298
5
0
06 Nov 2023
Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ziqiao Ma
Jacob Sansom
Run Peng
Joyce Chai
313
32
0
30 Oct 2023
Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph Reasoning
ACM Multimedia (ACM MM), 2023
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
Jian Zhao
280
9
0
29 Aug 2023
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
202
28
0
15 Jul 2023
The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences from Linguistic Inputs
Lance Ying
Katherine M. Collins
Megan Wei
Cedegao E. Zhang
Tan Zhi-Xuan
Adrian Weller
J. Tenenbaum
L. Wong
392
22
0
25 Jun 2023
Understanding Social Reasoning in Language Models with Language Models
Neural Information Processing Systems (NeurIPS), 2023
Kanishk Gandhi
Jan-Philipp Fränken
Tobias Gerstenberg
Noah D. Goodman
LRM
450
195
0
21 Jun 2023
A Review on Machine Theory of Mind
Yuanyuan Mao
Shuang Liu
Pengshuai Zhao
Qin Ni
Xin Lin
Liang He
190
12
0
21 Mar 2023
Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks
T. Ullman
LRM
504
334
0
16 Feb 2023
Benchmarks for Automated Commonsense Reasoning: A Survey
ACM Computing Surveys (ACM Comput. Surv.), 2023
E. Davis
ELM
LRM
443
82
0
09 Feb 2023
Memory-Augmented Theory of Mind Network
AAAI Conference on Artificial Intelligence (AAAI), 2023
D. Nguyen
Phuoc Nguyen
Hung Le
Kien Do
Svetha Venkatesh
T. Tran
240
6
0
17 Jan 2023
NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants
IEEE International Conference on Robotics and Automation (ICRA), 2023
Xavier Puig
Tianmin Shu
J. Tenenbaum
Antonio Torralba
172
27
0
12 Jan 2023
Solving the Baby Intuitions Benchmark with a Hierarchically Bayesian Theory of Mind
Tan Zhi-Xuan
Nishad Gothoskar
Falk Pollok
Dan Gutfreund
J. Tenenbaum
Vikash K. Mansinghka
232
13
0
04 Aug 2022
Learning Latent Traits for Simulated Cooperative Driving Tasks
Jonathan A. DeCastro
Deepak Gopinath
Guy Rosman
Emily S. Sumner
Shabnam Hakimi
Simon Stent
223
0
0
20 Jul 2022
Brain-inspired Graph Spiking Neural Networks for Commonsense Knowledge Representation and Reasoning
H. Fang
Yi Zeng
Jianbo Tang
Yuwei Wang
Yao Liang
Xin Liu
220
3
0
11 Jul 2022
Learning Theory of Mind via Dynamic Traits Attribution
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
D. Nguyen
Phuoc Nguyen
Hung Le
Kien Do
Svetha Venkatesh
T. Tran
163
6
0
17 Apr 2022
A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning Across Event Categories
Arijit Dasgupta
Jiafei Duan
M. Ang
Yi Lin
Su-hua Wang
R. Baillargeon
Cheston Tan
214
10
0
16 Nov 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
Yuan-Fang Wang
Fangwei Zhong
Jing Xu
Yizhou Wang
LLMAG
285
99
0
15 Oct 2021
AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation for Artificial Cognition
Arijit Dasgupta
Jiafei Duan
M. Ang
Cheston Tan
324
5
0
12 Oct 2021
Towards A Measure Of General Machine Intelligence
Gautham Venkatasubramanian
Sibesh Kar
Abhimanyu Singh
Shubham Mishra
Dushyant Yadav
Shreyansh Chandak
ALM
ELM
414
2
0
24 Sep 2021
SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environments
Jiafei Duan
Samson Yu
Cheston Tan
210
16
0
13 Aug 2021
Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others
Neural Information Processing Systems (NeurIPS), 2021
Kanishk Gandhi
Gala Stojnic
Brenden M. Lake
M. Dillon
434
55
0
23 Feb 2021
1
Page 1 of 1