Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.02083
Cited By
Evaluating Large Language Models in Theory of Mind Tasks
4 February 2023
Michal Kosinskihttps://www.semanticscholar.org/me/account
LLMAG
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating Large Language Models in Theory of Mind Tasks"
50 / 59 papers shown
Title
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models
Gracjan Góral
Alicja Ziarko
Piotr Miłoś
Michał Nauman
Maciej Wołczyk
Michał Kosiński
LRM
22
0
0
03 May 2025
The Convergent Ethics of AI? Analyzing Moral Foundation Priorities in Large Language Models with a Multi-Framework Approach
Chad Coleman
W. Russell Neuman
Ali Dasdan
Safinah Ali
Manan Shah
ELM
LRM
38
0
0
27 Apr 2025
AI Awareness
X. Li
Haoyuan Shi
Rongwu Xu
Wei Xu
54
0
0
25 Apr 2025
Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models
Yuheng Wu
Wentao Guo
Zirui Liu
Heng Ji
Zhaozhuo Xu
Denghui Zhang
33
0
0
05 Apr 2025
LLM Social Simulations Are a Promising Research Method
Jacy Reese Anthis
Ryan Liu
Sean M. Richardson
Austin C. Kozlowski
Bernard Koch
James A. Evans
Erik Brynjolfsson
Michael S. Bernstein
ALM
51
4
0
03 Apr 2025
Trapped by Expectations: Functional Fixedness in LLM-Enabled Chat Search
Jiqun Liu
Jamshed Karimnazarov
Ryen W. White
36
0
0
02 Apr 2025
ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs
Yi-Long Lu
Chunhui Zhang
Jiajun Song
Lifeng Fan
Wei Wang
OffRL
46
0
0
02 Apr 2025
The Mind in the Machine: A Survey of Incorporating Psychological Theories in LLMs
Zizhou Liu
Ziwei Gong
Lin Ai
Zheng Hui
Run Chen
Colin Wayne Leach
Michelle R. Greene
Julia Hirschberg
LLMAG
90
0
0
28 Mar 2025
Gricean Norms as a Basis for Effective Collaboration
Fardin Saad
Pradeep K. Murukannaiah
Munindar P. Singh
106
0
0
18 Mar 2025
MetaScale: Test-Time Scaling with Evolving Meta-Thoughts
Qin Liu
Wenxuan Zhou
Nan Xu
James Y. Huang
Fei-Yue Wang
Sheng Zhang
Hoifung Poon
M. Chen
LLMAG
ReLM
AI4Cl
LRM
90
1
0
17 Mar 2025
Re-evaluating Theory of Mind evaluation in large language models
Jennifer Hu
Felix Sosa
T. Ullman
40
0
0
28 Feb 2025
Towards properly implementing Theory of Mind in AI systems: An account of four misconceptions
Ramira van der Meulen
Rineke Verbrugge
Max van Duijn
41
0
0
28 Feb 2025
On Benchmarking Human-Like Intelligence in Machines
Lance Ying
K. M. Collins
L. Wong
Ilia Sucholutsky
Ryan Liu
Adrian Weller
Tianmin Shu
Thomas L. Griffiths
Joshua B. Tenenbaum
ALM
ELM
93
2
0
27 Feb 2025
AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society
J. Piao
Yuwei Yan
Jun Zhang
Nian Li
Junbo Yan
...
Fengli Xu
Fang Zhang
Ke Rong
Jun Su
Y. Li
AI4CE
73
8
0
12 Feb 2025
Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning
Eitan Wagner
Nitay Alon
J. Barnby
Omri Abend
LRM
85
2
0
18 Dec 2024
Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
Song Jiang
Da JU
Andrew Cohen
Sasha Mitts
Aaron Foss
Justine T Kao
Xian Li
Yuandong Tian
62
2
0
21 Nov 2024
Advancements and limitations of LLMs in replicating human color-word associations
Makoto Fukushima
Shusuke Eshita
Hiroshige Fukuhara
44
0
0
04 Nov 2024
RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework
Yifan Wang
Vera Demberg
24
0
0
24 Oct 2024
Chatting with Bots: AI, Speech Acts, and the Edge of Assertion
Iwan Williams
Tim Bayne
34
1
0
22 Oct 2024
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Yuling Gu
Oyvind Tafjord
Hyunwoo Kim
Jared Moore
Ronan Le Bras
Peter Clark
Yejin Choi
28
8
0
17 Oct 2024
Large Model Strategic Thinking, Small Model Efficiency: Transferring Theory of Mind in Large Language Models
Nunzio Lorè
Alireza Ilami
Babak Heydari
LRM
37
0
0
05 Aug 2024
Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models
Logan Cross
Violet Xiang
Agam Bhatia
Daniel L. K. Yamins
Nick Haber
LM&Ro
LRM
LLMAG
46
4
0
09 Jul 2024
LTLBench: Towards Benchmarks for Evaluating Temporal Logic Reasoning in Large Language Models
Weizhi Tang
Vaishak Belle
LRM
42
1
0
07 Jul 2024
Over the Edge of Chaos? Excess Complexity as a Roadblock to Artificial General Intelligence
Teo Susnjak
Timothy R. McIntosh
A. Barczak
N. Reyes
Tong Liu
Paul Watters
Malka N. Halgamuge
30
3
0
04 Jul 2024
Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory
Suyeon Lee
Sunghwan Kim
Minju Kim
Dongjin Kang
Dongil Yang
...
Seungbeen Lee
Kyoung-Mee Chung
Youngjae Yu
Dongha Lee
Jinyoung Yeo
32
5
0
03 Jul 2024
Self-Cognition in Large Language Models: An Exploratory Study
Dongping Chen
Jiawen Shi
Yao Wan
Pan Zhou
Neil Zhenqiang Gong
Lichao Sun
LRM
LLMAG
22
3
0
01 Jul 2024
Towards a Science Exocortex
Kevin G. Yager
74
0
0
24 Jun 2024
Large Language Models Assume People are More Rational than We Really are
Ryan Liu
Jiayi Geng
Joshua C. Peterson
Ilia Sucholutsky
Thomas L. Griffiths
63
16
0
24 Jun 2024
Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
Zhawnen Chen
Tianchun Wang
Yizhou Wang
Michal Kosinski
Xiang Zhang
Yun Fu
Sheng R. Li
LRM
24
2
0
19 Jun 2024
Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions
Yongyi Ji
Zhisheng Tang
M. Kejriwal
29
4
0
18 Jun 2024
Tracking the perspectives of interacting language models
Hayden Helm
Brandon Duderstadt
Youngser Park
Carey E. Priebe
49
6
0
17 Jun 2024
Grammaticality Representation in ChatGPT as Compared to Linguists and Laypeople
Zhuang Qiu
Xufeng Duan
Zhenguang G. Cai
29
2
0
17 Jun 2024
The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Bolei Ma
Xinpeng Wang
Tiancheng Hu
Anna Haensch
Michael A. Hedderich
Barbara Plank
Frauke Kreuter
ALM
28
2
0
16 Jun 2024
A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
Bowen Jiang
Yangxinyu Xie
Zhuoqun Hao
Xiaomeng Wang
Tanwi Mallick
Weijie J. Su
Camillo J. Taylor
Dan Roth
LRM
37
40
0
16 Jun 2024
Ollabench: Evaluating LLMs' Reasoning for Human-centric Interdependent Cybersecurity
Tam n. Nguyen
ELM
34
2
0
11 Jun 2024
Zero, Finite, and Infinite Belief History of Theory of Mind Reasoning in Large Language Models
Weizhi Tang
Vaishak Belle
LLMAG
LRM
16
1
0
07 Jun 2024
LLM-Generated Black-box Explanations Can Be Adversarially Helpful
R. Ajwani
Shashidhar Reddy Javaji
Frank Rudzicz
Zining Zhu
AAML
32
6
0
10 May 2024
ToM-LM: Delegating Theory of Mind Reasoning to External Symbolic Executors in Large Language Models
Weizhi Tang
Vaishak Belle
LRM
LLMAG
26
1
0
23 Apr 2024
Language Models as Critical Thinking Tools: A Case Study of Philosophers
Andre Ye
Jared Moore
Rose Novick
Amy X. Zhang
KELM
ELM
LRM
LLMAG
23
7
0
06 Apr 2024
Distributed agency in second language learning and teaching through generative AI
Robert Godwin-Jones
26
14
0
29 Mar 2024
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Jen-tse Huang
E. Li
Man Ho Lam
Tian Liang
Wenxuan Wang
Youliang Yuan
Wenxiang Jiao
Xing Wang
Zhaopeng Tu
Michael R. Lyu
ELM
LLMAG
77
32
0
18 Mar 2024
Shall We Team Up: Exploring Spontaneous Cooperation of Competing LLM Agents
Zengqing Wu
Run Peng
Shuyuan Zheng
Qianying Liu
Xu Han
Brian Inhyuk Kwon
Makoto Onizuka
Shaojie Tang
Chuan Xiao
28
10
0
19 Feb 2024
Can Generative Agents Predict Emotion?
Ciaran Regan
Nanami Iwahashi
Shogo Tanaka
Mizuki Oka
10
0
0
06 Feb 2024
What should I say? -- Interacting with AI and Natural Language Interfaces
Mark Adkins
20
0
0
12 Jan 2024
Exploring the Frontiers of LLMs in Psychological Applications: A Comprehensive Review
Luoma Ke
Song Tong
Peng Cheng
Kaiping Peng
OffRL
LM&MA
51
18
0
03 Jan 2024
The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems: A Scoping Survey
Dhruv Dhamani
Mary Lou Maher
27
1
0
29 Dec 2023
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
Tian Liang
Zhiwei He
Jen-tse Huang
Wenxuan Wang
Wenxiang Jiao
Rui Wang
Yujiu Yang
Zhaopeng Tu
Shuming Shi
Xing Wang
LLMAG
50
5
0
31 Oct 2023
Generative Language Models Exhibit Social Identity Biases
Tiancheng Hu
Yara Kyrychenko
Steve Rathje
Nigel Collier
S. V. D. Linden
Jon Roozenbeek
22
34
0
24 Oct 2023
The Cultural Psychology of Large Language Models: Is ChatGPT a Holistic or Analytic Thinker?
Chuanyang Jin
Songyang Zhang
Tianmin Shu
Zhihan Cui
LLMAG
AI4MH
17
4
0
28 Aug 2023
Playing repeated games with Large Language Models
Elif Akata
Lion Schulz
Julian Coda-Forno
Seong Joon Oh
Matthias Bethge
Eric Schulz
402
117
0
26 May 2023
1
2
Next