Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.01386
Cited By
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
2 October 2023
Jen-tse Huang
Wenxuan Wang
E. Li
Man Ho Lam
Shujie Ren
Youliang Yuan
Wenxiang Jiao
Zhaopeng Tu
Michael R. Lyu
LM&MA
AI4MH
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench"
25 / 25 papers shown
Title
Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Wenhan Dong
Yuemeng Zhao
Zhen Sun
Yule Liu
Zifan Peng
...
Jun Wu
Ruiming Wang
Shengmin Xu
Xinyi Huang
Xinlei He
LLMAG
55
0
0
30 Apr 2025
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
Yuchen Wen
Keping Bi
Wei Chen
J. Guo
Xueqi Cheng
75
1
0
20 Feb 2025
Can LLMs make trade-offs involving stipulated pain and pleasure states?
Geoff Keeling
Winnie Street
Martyna Stachaczyk
Daria Zakharova
Iulia M. Comsa
Anastasiya Sakovych
Isabella Logothesis
Zejia Zhang
Blaise Agüera y Arcas
Jonathan Birch
33
0
0
01 Nov 2024
Jigsaw Puzzles: Splitting Harmful Questions to Jailbreak Large Language Models
Hao Yang
Lizhen Qu
Ehsan Shareghi
Gholamreza Haffari
AAML
34
1
0
15 Oct 2024
Assessment and manipulation of latent constructs in pre-trained language models using psychometric scales
Maor Reuben
Ortal Slobodin
Aviad Elyshar
Idan-Chaim Cohen
Orna Braun-Lewensohn
Odeya Cohen
Rami Puzis
38
0
0
29 Sep 2024
LLMs' ways of seeing User Personas
Swaroop Panda
19
1
0
23 Sep 2024
Enhancing AI-Driven Psychological Consultation: Layered Prompts with Large Language Models
Rafael Souza
Jia-Hao Lim
Alexander Davis
LM&MA
AI4MH
18
0
0
29 Aug 2024
The Better Angels of Machine Personality: How Personality Relates to LLM Safety
Jie M. Zhang
Dongrui Liu
Chao Qian
Ziyue Gan
Yong-jin Liu
Yu Qiao
Jing Shao
LLMAG
PILM
40
12
0
17 Jul 2024
Self-assessment, Exhibition, and Recognition: a Review of Personality in Large Language Models
Zhiyuan Wen
Yu Yang
Jiannong Cao
Haoming Sun
Ruosong Yang
Shuaiqi Liu
35
5
0
25 Jun 2024
Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis
Nikolay B Petrov
Gregory Serapio-García
Jason Rentfrow
17
14
0
12 May 2024
SeSaMe: A Framework to Simulate Self-Reported Ground Truth for Mental Health Sensing Studies
Akshat Choube
V. D. Swain
Varun Mishra
45
1
0
25 Mar 2024
AgentGroupChat: An Interactive Group Chat Simulacra For Better Eliciting Emergent Behavior
Zhouhong Gu
Xiaoxuan Zhu
Haoran Guo
Lin Zhang
Yin Cai
...
Yifei Dai
Yan Gao
Yao Hu
Hongwei Feng
Yanghua Xiao
AI4CE
32
1
0
20 Mar 2024
Consistency Matters: Explore LLMs Consistency From a Black-Box Perspective
Fufangchen Zhao
Guoqiang Jin
Jiaheng Huang
Rui Zhao
Fei Tan
20
1
0
27 Feb 2024
An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios
Zongjie Li
Wenying Qiu
Pingchuan Ma
Yichen Li
You Li
Sijia He
Baozheng Jiang
Shuai Wang
Weixi Gu
13
2
0
27 Jan 2024
PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety
Zaibin Zhang
Yongting Zhang
Lijun Li
Hongzhi Gao
Lijun Wang
Huchuan Lu
Feng Zhao
Yu Qiao
Jing Shao
LLMAG
12
29
0
22 Jan 2024
Open Models, Closed Minds? On Agents Capabilities in Mimicking Human Personalities through Open Large Language Models
Lucio La Cava
Andrea Tagarelli
LLMAG
AI4CE
55
12
0
13 Jan 2024
CogGPT: Unleashing the Power of Cognitive Dynamics on Large Language Models
Yaojia Lv
Haojie Pan
Ruiji Fu
Ming Liu
Zhongyuan Wang
Bing Qin
17
5
0
06 Jan 2024
Challenging the Validity of Personality Tests for Large Language Models
Tom Sühr
Florian E. Dorner
Samira Samadi
Augustin Kelava
6
9
0
09 Nov 2023
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews
Xintao Wang
Yunze Xiao
Jen-tse Huang
Siyu Yuan
Rui Xu
...
Ziang Leng
Wei Wang
Jiangjie Chen
Cheng Li
Yanghua Xiao
13
84
0
27 Oct 2023
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
Jen-tse Huang
Man Ho Adrian Lam
E. Li
Shujie Ren
Wenxuan Wang
Wenxiang Jiao
Zhaopeng Tu
Michael R. Lyu
43
40
0
07 Aug 2023
Position: AI Evaluation Should Learn from How We Test Humans
Yan Zhuang
Q. Liu
Yuting Ning
Wei Huang
Rui Lv
Zhenya Huang
Guanhao Zhao
Zheng-Wei Zhang
ELM
ALM
62
21
0
18 Jun 2023
Revisiting the Reliability of Psychological Scales on Large Language Models
Jen-tse Huang
Wenxuan Wang
Man Ho Lam
E. Li
Wenxiang Jiao
Michael R. Lyu
24
21
0
31 May 2023
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4
Kent K. Chang
Mackenzie Cramer
Sandeep Soni
David Bamman
RALM
138
109
0
28 Apr 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
206
2,232
0
22 Mar 2023
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
4,048
0
24 May 2022
1