ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.12321
  4. Cited By
AGENT: A Benchmark for Core Psychological Reasoning
v1v2v3v4 (latest)

AGENT: A Benchmark for Core Psychological Reasoning

International Conference on Machine Learning (ICML), 2021
24 February 2021
Tianmin Shu
Abhishek Bhandwaldar
Chuang Gan
Kevin A. Smith
Shari Liu
Dan Gutfreund
E. Spelke
J. Tenenbaum
T. Ullman
ArXiv (abs)PDFHTML

Papers citing "AGENT: A Benchmark for Core Psychological Reasoning"

41 / 41 papers shown
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions
Xianzhe Fan
Xuhui Zhou
Chuanyang Jin
Kolby Nottingham
Hao Zhu
Maarten Sap
315
6
0
29 Jun 2025
From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
Xinyang Li
Siqi Liu
Bochao Zou
Jiansheng Chen
Huimin Ma
296
2
0
17 Jun 2025
Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning: A Scalable Bayesian Planner
Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning: A Scalable Bayesian Planner
Chunhui Zhang
Z. Ouyang
Kwonjoon Lee
Nakul Agarwal
Sean Dae Houlihan
Soroush Vosoughi
Shao-Yuan Lo
LRM
239
4
0
02 Jun 2025
CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
Arnav Verma
Kushin Mukherjee
Christopher Potts
Elisa Kreiss
Judith E. Fan
227
2
0
22 May 2025
Re-evaluating Theory of Mind evaluation in large language models
Re-evaluating Theory of Mind evaluation in large language modelsPhilosophical transactions of the Royal Society of London. Series B, Biological sciences (Philos Trans R Soc Lond B Biol Sci), 2025
Jennifer Hu
Felix Sosa
T. Ullman
392
14
0
28 Feb 2025
Few-Shot Task Learning through Inverse Generative Modeling
Few-Shot Task Learning through Inverse Generative ModelingNeural Information Processing Systems (NeurIPS), 2024
Aviv Netanyahu
Yilun Du
Antonia Bronars
Jyothish Pari
J. Tenenbaum
Tianmin Shu
Pulkit Agrawal
529
5
0
07 Nov 2024
EgoSocialArena: Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective
EgoSocialArena: Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective
Guiyang Hou
Wenqi Zhang
Yongliang Shen
Zeqi Tan
Sihao Shen
Weiming Lu
377
0
0
08 Oct 2024
MARPLE: A Benchmark for Long-Horizon Inference
MARPLE: A Benchmark for Long-Horizon InferenceNeural Information Processing Systems (NeurIPS), 2024
Emily Jin
Zhuoyi Huang
Jan-Philipp Fränken
Weiyu Liu
Hannah Cha
Erik Brockbank
Sarah Wu
Ruohan Zhang
Jiajun Wu
Tobias Gerstenberg
319
5
0
02 Oct 2024
Vision Language Models See What You Want but not What You See
Vision Language Models See What You Want but not What You See
Qingying Gao
Yijiang Li
Haiyun Lyu
Haoran Sun
Dezhi Luo
Hokin Deng
LRMVLM
598
11
0
01 Oct 2024
Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind
Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind
Lance Ying
Xinyi Li
Shivam Aarya
Yizirui Fang
Stefanie Tellex
J. Tenenbaum
Tianmin Shu
Joshua B. Tenenbaum
Tianmin Shu
LM&Ro
372
3
0
17 Sep 2024
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind
MuMA-ToM: Multi-modal Multi-Agent Theory of MindAAAI Conference on Artificial Intelligence (AAAI), 2024
Haojun Shi
Suyu Ye
Xinyu Fang
Chuanyang Jin
Leyla Isik
Yen-Ling Kuo
Tianmin Shu
LLMAG
513
42
0
22 Aug 2024
Explicit Modelling of Theory of Mind for Belief Prediction in Nonverbal
  Social Interactions
Explicit Modelling of Theory of Mind for Belief Prediction in Nonverbal Social Interactions
Matteo Bortoletto
Constantin Ruhdorfer
Lei Shi
Andreas Bulling
392
7
0
09 Jul 2024
TimeToM: Temporal Space is the Key to Unlocking the Door of Large
  Language Models' Theory-of-Mind
TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind
Guiyang Hou
Wenqi Zhang
Yongliang Shen
Linjuan Wu
Weiming Lu
LRMAI4CE
245
19
0
01 Jul 2024
GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment
Lance Ying
Kunal Jha
Shivam Aarya
Joshua B. Tenenbaum
Antonio Torralba
Tianmin Shu
356
19
0
17 Mar 2024
Language Models Represent Beliefs of Self and Others
Language Models Represent Beliefs of Self and Others
Wentao Zhu
Zhining Zhang
Yizhou Wang
MILMLRM
420
20
0
28 Feb 2024
Towards Unified Alignment Between Agents, Humans, and Environment
Towards Unified Alignment Between Agents, Humans, and Environment
Zonghan Yang
An Liu
Zijun Liu
Wenbing Huang
Fangzhou Xiong
...
Zhenhe Zhang
Ziyue Wang
Zhicheng Guo
Peng Li
Yang Liu
373
5
0
12 Feb 2024
BDIQA: A New Dataset for Video Question Answering to Explore Cognitive
  Reasoning through Theory of Mind
BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of MindAAAI Conference on Artificial Intelligence (AAAI), 2024
Yuanyuan Mao
Xin Lin
Qin Ni
Liang He
292
6
0
12 Feb 2024
MMToM-QA: Multimodal Theory of Mind Question Answering
MMToM-QA: Multimodal Theory of Mind Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Chuanyang Jin
Yutong Wu
Jing Cao
Jiannan Xiang
Yen-Ling Kuo
Zhiting Hu
T. Ullman
Antonio Torralba
Joshua B. Tenenbaum
Tianmin Shu
429
76
0
16 Jan 2024
Neural Reasoning About Agents' Goals, Preferences, and Actions
Neural Reasoning About Agents' Goals, Preferences, and ActionsAAAI Conference on Artificial Intelligence (AAAI), 2023
Matteo Bortoletto
Lei Shi
Andreas Bulling
295
8
0
12 Dec 2023
Robot Learning in the Era of Foundation Models: A Survey
Robot Learning in the Era of Foundation Models: A Survey
Xuan Xiao
Jiahang Liu
Zhipeng Wang
Yanmin Zhou
Yong Qi
Qian Cheng
Bin He
Shuo Jiang
AI4CELM&Ro
462
51
0
24 Nov 2023
A Brain-inspired Theory of Collective Mind Model for Efficient Social
  Cooperation
A Brain-inspired Theory of Collective Mind Model for Efficient Social CooperationIEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
Zhuoya Zhao
Feifei Zhao
Shiwen Wang
Yinqian Sun
Yi Zeng
298
5
0
06 Nov 2023
Towards A Holistic Landscape of Situated Theory of Mind in Large
  Language Models
Towards A Holistic Landscape of Situated Theory of Mind in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ziqiao Ma
Jacob Sansom
Run Peng
Joyce Chai
313
32
0
30 Oct 2023
Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior
  Graph Reasoning
Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph ReasoningACM Multimedia (ACM MM), 2023
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
Jian Zhao
280
9
0
29 Aug 2023
The SocialAI School: Insights from Developmental Psychology Towards
  Artificial Socio-Cultural Agents
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
202
28
0
15 Jul 2023
The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling
  Probabilistic Social Inferences from Linguistic Inputs
The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences from Linguistic Inputs
Lance Ying
Katherine M. Collins
Megan Wei
Cedegao E. Zhang
Tan Zhi-Xuan
Adrian Weller
J. Tenenbaum
L. Wong
392
22
0
25 Jun 2023
Understanding Social Reasoning in Language Models with Language Models
Understanding Social Reasoning in Language Models with Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Kanishk Gandhi
Jan-Philipp Fränken
Tobias Gerstenberg
Noah D. Goodman
LRM
450
195
0
21 Jun 2023
A Review on Machine Theory of Mind
A Review on Machine Theory of Mind
Yuanyuan Mao
Shuang Liu
Pengshuai Zhao
Qin Ni
Xin Lin
Liang He
190
12
0
21 Mar 2023
Large Language Models Fail on Trivial Alterations to Theory-of-Mind
  Tasks
Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks
T. Ullman
LRM
504
334
0
16 Feb 2023
Benchmarks for Automated Commonsense Reasoning: A Survey
Benchmarks for Automated Commonsense Reasoning: A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023
E. Davis
ELMLRM
443
82
0
09 Feb 2023
Memory-Augmented Theory of Mind Network
Memory-Augmented Theory of Mind NetworkAAAI Conference on Artificial Intelligence (AAAI), 2023
D. Nguyen
Phuoc Nguyen
Hung Le
Kien Do
Svetha Venkatesh
T. Tran
240
6
0
17 Jan 2023
NOPA: Neurally-guided Online Probabilistic Assistance for Building
  Socially Intelligent Home Assistants
NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home AssistantsIEEE International Conference on Robotics and Automation (ICRA), 2023
Xavier Puig
Tianmin Shu
J. Tenenbaum
Antonio Torralba
172
27
0
12 Jan 2023
Solving the Baby Intuitions Benchmark with a Hierarchically Bayesian
  Theory of Mind
Solving the Baby Intuitions Benchmark with a Hierarchically Bayesian Theory of Mind
Tan Zhi-Xuan
Nishad Gothoskar
Falk Pollok
Dan Gutfreund
J. Tenenbaum
Vikash K. Mansinghka
232
13
0
04 Aug 2022
Learning Latent Traits for Simulated Cooperative Driving Tasks
Learning Latent Traits for Simulated Cooperative Driving Tasks
Jonathan A. DeCastro
Deepak Gopinath
Guy Rosman
Emily S. Sumner
Shabnam Hakimi
Simon Stent
223
0
0
20 Jul 2022
Brain-inspired Graph Spiking Neural Networks for Commonsense Knowledge
  Representation and Reasoning
Brain-inspired Graph Spiking Neural Networks for Commonsense Knowledge Representation and Reasoning
H. Fang
Yi Zeng
Jianbo Tang
Yuwei Wang
Yao Liang
Xin Liu
220
3
0
11 Jul 2022
Learning Theory of Mind via Dynamic Traits Attribution
Learning Theory of Mind via Dynamic Traits AttributionAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
D. Nguyen
Phuoc Nguyen
Hung Le
Kien Do
Svetha Venkatesh
T. Tran
163
6
0
17 Apr 2022
A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning
  Across Event Categories
A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning Across Event Categories
Arijit Dasgupta
Jiafei Duan
M. Ang
Yi Lin
Su-hua Wang
R. Baillargeon
Cheston Tan
214
10
0
16 Nov 2021
ToM2C: Target-oriented Multi-agent Communication and Cooperation with
  Theory of Mind
ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind
Yuan-Fang Wang
Fangwei Zhong
Jing Xu
Yizhou Wang
LLMAG
285
99
0
15 Oct 2021
AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation
  for Artificial Cognition
AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation for Artificial Cognition
Arijit Dasgupta
Jiafei Duan
M. Ang
Cheston Tan
324
5
0
12 Oct 2021
Towards A Measure Of General Machine Intelligence
Towards A Measure Of General Machine Intelligence
Gautham Venkatasubramanian
Sibesh Kar
Abhimanyu Singh
Shubham Mishra
Dushyant Yadav
Shreyansh Chandak
ALMELM
414
2
0
24 Sep 2021
SPACE: A Simulator for Physical Interactions and Causal Learning in 3D
  Environments
SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environments
Jiafei Duan
Samson Yu
Cheston Tan
210
16
0
13 Aug 2021
Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and
  actions of others
Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of othersNeural Information Processing Systems (NeurIPS), 2021
Kanishk Gandhi
Gala Stojnic
Brenden M. Lake
M. Dillon
434
55
0
23 Feb 2021
1
Page 1 of 1