v1v2v3v4 (latest)

AGENT: A Benchmark for Core Psychological Reasoning

International Conference on Machine Learning (ICML), 2021

24 February 2021

Chuang Gan

Papers citing "AGENT: A Benchmark for Core Psychological Reasoning"

41 / 41 papers shown

SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions

315

29 Jun 2025

From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models

296

17 Jun 2025

Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning: A Scalable Bayesian Planner

239

02 Jun 2025

CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models

227

22 May 2025

Re-evaluating Theory of Mind evaluation in large language modelsPhilosophical transactions of the Royal Society of London. Series B, Biological sciences (Philos Trans R Soc Lond B Biol Sci), 2025

Jennifer Hu

Felix Sosa

T. Ullman

392

28 Feb 2025

Few-Shot Task Learning through Inverse Generative ModelingNeural Information Processing Systems (NeurIPS), 2024

529

07 Nov 2024

EgoSocialArena: Benchmarking the Social Intelligence of Large Language Models from a First-person Perspective

377

08 Oct 2024

MARPLE: A Benchmark for Long-Horizon InferenceNeural Information Processing Systems (NeurIPS), 2024

Ruohan Zhang

Jiajun Wu

Tobias Gerstenberg

319

02 Oct 2024

Vision Language Models See What You Want but not What You See

Qingying Gao

Yijiang Li

Haiyun Lyu

Haoran Sun

Dezhi Luo

Hokin Deng

LRM VLM

598

01 Oct 2024

Pragmatic Embodied Spoken Instruction Following in Human-Robot Collaboration with Theory of Mind

372

17 Sep 2024

MuMA-ToM: Multi-modal Multi-Agent Theory of MindAAAI Conference on Artificial Intelligence (AAAI), 2024

Leyla Isik

Yen-Ling Kuo

Tianmin Shu

LLMAG

513

22 Aug 2024

Explicit Modelling of Theory of Mind for Belief Prediction in Nonverbal Social Interactions

392

09 Jul 2024

TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind

Weiming Lu

245

01 Jul 2024

GOMA: Proactive Embodied Cooperative Communication via Goal-Oriented Mental Alignment

Lance Ying

Kunal Jha

Shivam Aarya

Joshua B. Tenenbaum

Antonio Torralba

Tianmin Shu

356

17 Mar 2024

Language Models Represent Beliefs of Self and Others

420

28 Feb 2024

Towards Unified Alignment Between Agents, Humans, and Environment

...

Peng Li

Yang Liu

373

12 Feb 2024

BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of MindAAAI Conference on Artificial Intelligence (AAAI), 2024

292

12 Feb 2024

MMToM-QA: Multimodal Theory of Mind Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Joshua B. Tenenbaum

429

16 Jan 2024

Neural Reasoning About Agents' Goals, Preferences, and ActionsAAAI Conference on Artificial Intelligence (AAAI), 2023

Matteo Bortoletto

Lei Shi

Andreas Bulling

295

12 Dec 2023

Robot Learning in the Era of Foundation Models: A Survey

462

24 Nov 2023

A Brain-inspired Theory of Collective Mind Model for Efficient Social CooperationIEEE Transactions on Artificial Intelligence (IEEE TAI), 2023

Yi Zeng

298

06 Nov 2023

Towards A Holistic Landscape of Situated Theory of Mind in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

313

30 Oct 2023

Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph ReasoningACM Multimedia (ACM MM), 2023

Zheng Wang

280

29 Aug 2023

The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents

202

15 Jul 2023

The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences from Linguistic Inputs

392

25 Jun 2023

Understanding Social Reasoning in Language Models with Language ModelsNeural Information Processing Systems (NeurIPS), 2023

450

195

21 Jun 2023

A Review on Machine Theory of Mind

190

21 Mar 2023

Large Language Models Fail on Trivial Alterations to Theory-of-Mind Tasks

T. Ullman

LRM

504

334

16 Feb 2023

Benchmarks for Automated Commonsense Reasoning: A SurveyACM Computing Surveys (ACM Comput. Surv.), 2023

E. Davis

ELM LRM

443

09 Feb 2023

Memory-Augmented Theory of Mind NetworkAAAI Conference on Artificial Intelligence (AAAI), 2023

240

17 Jan 2023

NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home AssistantsIEEE International Conference on Robotics and Automation (ICRA), 2023

Xavier Puig

Tianmin Shu

J. Tenenbaum

Antonio Torralba

172

12 Jan 2023

Solving the Baby Intuitions Benchmark with a Hierarchically Bayesian Theory of Mind

232

04 Aug 2022

Learning Latent Traits for Simulated Cooperative Driving Tasks

223

20 Jul 2022

Brain-inspired Graph Spiking Neural Networks for Commonsense Knowledge Representation and Reasoning

Yi Zeng

220

11 Jul 2022

Learning Theory of Mind via Dynamic Traits AttributionAdaptive Agents and Multi-Agent Systems (AAMAS), 2022

163

17 Apr 2022

A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning Across Event Categories

Cheston Tan

214

16 Nov 2021

ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind

285

15 Oct 2021

AVoE: A Synthetic 3D Dataset on Understanding Violation of Expectation for Artificial Cognition

Arijit Dasgupta

Jiafei Duan

M. Ang

Cheston Tan

324

12 Oct 2021

Towards A Measure Of General Machine Intelligence

Gautham Venkatasubramanian

414

24 Sep 2021

SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environments

Jiafei Duan

Samson Yu

Cheston Tan

210

13 Aug 2021

Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of othersNeural Information Processing Systems (NeurIPS), 2021

434

23 Feb 2021