v1v2v3v4v5v6 (latest)

Evaluating Large Language Models in Theory of Mind Tasks

Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023

4 February 2023

Michal Kosinskihttps://www.semanticscholar.org/me/account

LLMAG

LRM

ArXiv (abs)PDF HTML

Papers citing "Evaluating Large Language Models in Theory of Mind Tasks"

50 / 109 papers shown

Tacit Bidder-Side Collusion: Artificial Intelligence in Dynamic Auctions

Sriram Tolety

26 Nov 2025

Mind the Motions: Benchmarking Theory-of-Mind in Everyday Body Language

105

19 Nov 2025

From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

Niranjan Chebrolu

Gerard Christopher Yeo

Kokil Jaidka

LLMSV

209

16 Nov 2025

Pruning as Regularization: Sensitivity-Aware One-Shot Pruning in ASR

Julian Irigoyen

Arthur Söhler

Andreas Søeborg Kirkedal

129

11 Nov 2025

Social Simulations with Large Language Model Risk Utopian Illusion

24 Oct 2025

Are Large Language Models Sensitive to the Motives Behind Communication?

164

22 Oct 2025

DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans

329

16 Oct 2025

Doing Things with Words: Rethinking Theory of Mind Simulation in Large Language Models

A. Lombardi

Alessandro Lenci

LLMAG

133

15 Oct 2025

Do You Get the Hint? Benchmarking LLMs on the Board Game Concept

I. Gevers

Walter Daelemans

LRM

166

15 Oct 2025

142

29 Sep 2025

Infusing Theory of Mind into Socially Intelligent LLM Agents

1.6K

26 Sep 2025

LVLMs are Bad at Overhearing Human Referential Communication

135

15 Sep 2025

Preservation of Language Understanding Capabilities in Speech-aware Large Language Models

Marek Kubis

Paweł Skórzewski

Iwona Christop

Mateusz Czyżnikiewicz

188

15 Sep 2025

One Model, Two Minds: A Context-Gated Graph Learner that Recreates Human Biases

Shalima Binta Manir

Tim Oates

10 Sep 2025

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

250

04 Sep 2025

The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs

331

03 Sep 2025

LLMs and their Limited Theory of Mind: Evaluating Mental State Annotations in Situated Dialogue

Katharine Kowalyshyn

Matthias Scheutz

100

02 Sep 2025

Bridging Minds and Machines: Toward an Integration of AI and Cognitive Science

108

28 Aug 2025

Who Sees What? Structured Thought-Action Sequences for Epistemic Reasoning in LLMs

120

20 Aug 2025

Large Language Models Do Not Simulate Human Psychology

161

09 Aug 2025

UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents

278

01 Aug 2025

Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task

179

22 Jul 2025

Investigating VLM Hallucination from a Cognitive Psychology Perspective: A First Step Toward Interpretation with Intriguing Observations

154

03 Jul 2025

Visual Structures Helps Visual Reasoning: Addressing the Binding Problem in VLMs

Amirmohammad Izadi

Mohammad Ali Banayeeanzade

Fatemeh Askari

Ali Rahimiakbar

Mohammad Mahdi Vahedi

Hosein Hasani

M. Baghshah

LRM

250

27 Jun 2025

Language-Informed Synthesis of Rational Agent Models for Grounded Theory-of-Mind Reasoning On-The-Fly

150

20 Jun 2025

From Prompts to Constructs: A Dual-Validity Framework for LLM Research in Psychology

Zhicheng Lin

200

20 Jun 2025

PRISON: Unmasking the Criminal Potential of Large Language Models

247

19 Jun 2025

Can structural correspondences ground real world representational content in Large Language Models?

Iwan Williams

154

19 Jun 2025

Large Language Models are Near-Optimal Decision-Makers with a Non-Human Learning Behavior

179

19 Jun 2025

From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models

215

17 Jun 2025

Behavioral Generative Agents for Energy Operations

14 Jun 2025

UniToMBench: Integrating Perspective-Taking to Improve Theory of Mind in LLMs

Prameshwar Thiyagarajan

186

11 Jun 2025

LLM-D12: A Dual-Dimensional Scale of Instrumental and Relational Dependencies on Large Language ModelsACM Transactions on the Web (TWEB), 2025

215

07 Jun 2025

Can Vision Language Models Infer Human Gaze Direction? A Controlled Study

227

04 Jun 2025

Representations of Fact, Fiction and Forecast in Large Language Models: Epistemics and AttitudesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Meng Li

Michael Vrazitulis

David Schlangen

229

02 Jun 2025

Effects of Theory of Mind and Prosocial Beliefs on Steering Human-Aligned Behaviors of LLMs in Ultimatum Games

Neemesh Yadav

Palakorn Achananuparp

Jing Jiang

Ee-Peng Lim

LRM

112

30 May 2025

ValueSim: Generating Backstories to Model Individual Value Systems

196

28 May 2025

Large Language Models Miss the Multi-Agent Mark

312

27 May 2025

The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models

285

24 May 2025

Multi-Party Conversational Agents: A Survey

276

24 May 2025

DEL-ToM: Inference-Time Scaling for Theory-of-Mind Reasoning via Dynamic Epistemic Logic

218

22 May 2025

Language Models use Lookbacks to Track Beliefs

325

20 May 2025

Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities

354

19 May 2025

PsyMem: Fine-grained psychological alignment and Explicit Memory Control for Advanced Role-Playing LLMs

373

19 May 2025

BeliefNest: A Joint Action Simulator for Embodied Agents with Theory of Mind

419

18 May 2025

AI-enhanced semantic feature norms for 786 concepts

195

15 May 2025

A large-scale evaluation of commonsense knowledge in humans and large language models

391

15 May 2025

Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models

279

03 May 2025

The Convergent Ethics of AI? Analyzing Moral Foundation Priorities in Large Language Models with a Multi-Framework Approach

220

27 Apr 2025

461

25 Apr 2025