Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.01768
Cited By
The political ideology of conversational AI: Converging evidence on ChatGPT's pro-environmental, left-libertarian orientation
5 January 2023
Jochen Hartmann
Jasper Schwenzow
Maximilian Witte
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The political ideology of conversational AI: Converging evidence on ChatGPT's pro-environmental, left-libertarian orientation"
47 / 97 papers shown
Title
Whose Emotions and Moral Sentiments Do Language Models Reflect?
Zihao He
Siyi Guo
Ashwin Rao
Kristina Lerman
39
12
0
16 Feb 2024
LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop
Maryam Amirizaniani
Jihan Yao
Adrian Lavergne
Elizabeth Snell Okada
Aman Chadha
Tanya Roosta
Chirag Shah
HILM
28
2
0
14 Feb 2024
A Roadmap to Pluralistic Alignment
Taylor Sorensen
Jared Moore
Jillian R. Fisher
Mitchell L. Gordon
Niloofar Mireshghallah
...
Liwei Jiang
Ximing Lu
Nouha Dziri
Tim Althoff
Yejin Choi
65
80
0
07 Feb 2024
Behind the Screen: Investigating ChatGPT's Dark Personality Traits and Conspiracy Beliefs
Erik Weber
Jérôme Rutinowski
Markus Pauly
31
2
0
06 Feb 2024
The Political Preferences of LLMs
David Rozado
38
36
0
02 Feb 2024
WARM: On the Benefits of Weight Averaged Reward Models
Alexandre Ramé
Nino Vieillard
Léonard Hussenot
Robert Dadashi
Geoffrey Cideron
Olivier Bachem
Johan Ferret
106
93
0
22 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
Zujie Wen
Ke Xu
Qi Li
57
56
0
11 Jan 2024
From Bytes to Biases: Investigating the Cultural Self-Perception of Large Language Models
Wolfgang Messner
Tatum Greene
Josephine Matalone
21
4
0
21 Dec 2023
Generative AI in Higher Education: Seeing ChatGPT Through Universities' Policies, Resources, and Guidelines
Hui Wang
Anh Dang
Zihao Wu
Son Mac
19
28
0
08 Dec 2023
Exploring the Jungle of Bias: Political Bias Attribution in Language Models via Dependency Analysis
David F. Jenny
Yann Billeter
Mrinmaya Sachan
Bernhard Schölkopf
Zhijing Jin
20
2
0
15 Nov 2023
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews
Xintao Wang
Yunze Xiao
Jen-tse Huang
Siyu Yuan
Rui Xu
...
Ziang Leng
Wei Wang
Jiangjie Chen
Cheng Li
Yanghua Xiao
23
84
0
27 Oct 2023
Multilingual Coarse Political Stance Classification of Media. The Editorial Line of a ChatGPT and Bard Newspaper
Cristina España-Bonet
6
8
0
25 Oct 2023
What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations
Kavel Rao
Liwei Jiang
Valentina Pyatkin
Yuling Gu
Niket Tandon
Nouha Dziri
Faeze Brahman
Yejin Choi
26
15
0
24 Oct 2023
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Wenxuan Wang
Wenxiang Jiao
Jingyuan Huang
Ruyi Dai
Jen-tse Huang
Zhaopeng Tu
Michael R. Lyu
51
27
0
19 Oct 2023
Compositional preference models for aligning LMs
Dongyoung Go
Tomasz Korbak
Germán Kruszewski
Jos Rozen
Marc Dymetman
21
15
0
17 Oct 2023
Large Language Model Soft Ideologization via AI-Self-Consciousness
F. Balabdaoui
Qian Wang
Alexander Henzi
Haixu Tang
Xiaozhong Liu
14
3
0
28 Sep 2023
Human-AI Interactions and Societal Pitfalls
Francisco Castro
Jian Gao
Sébastien Martin
20
2
0
19 Sep 2023
Generative AI
Stefan Feuerriegel
Jochen Hartmann
Christian Janiesch
Patrick Zschech
39
546
0
13 Sep 2023
Generative Social Choice
Sara Fish
Paul Gölz
David C. Parkes
Ariel D. Procaccia
Gili Rusak
Itai Shapira
Manuel Wüthrich
25
26
0
03 Sep 2023
What has ChatGPT read? The origins of archaeological citations used by a generative artificial intelligence application
D. Spennemann
16
2
0
07 Aug 2023
On the Trustworthiness Landscape of State-of-the-art Generative Models: A Survey and Outlook
Mingyuan Fan
Chengyu Wang
Cen Chen
Yang Liu
Jun Huang
HILM
31
3
0
31 Jul 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALM
OffRL
47
472
0
27 Jul 2023
Evaluating the Moral Beliefs Encoded in LLMs
Nino Scherrer
Claudia Shi
Amir Feder
David M. Blei
27
116
0
26 Jul 2023
How User Language Affects Conflict Fatality Estimates in ChatGPT
Daniel Kazenwadel
C. Steinert
17
1
0
26 Jul 2023
A Survey on Evaluation of Large Language Models
Yu-Chu Chang
Xu Wang
Jindong Wang
Yuanyi Wu
Linyi Yang
...
Yue Zhang
Yi-Ju Chang
Philip S. Yu
Qian Yang
Xingxu Xie
ELM
LM&MA
ALM
63
1,510
0
06 Jul 2023
Towards Measuring the Representation of Subjective Global Opinions in Language Models
Esin Durmus
Karina Nyugen
Thomas I. Liao
Nicholas Schiefer
Amanda Askell
...
Alex Tamkin
Janel Thamkul
Jared Kaplan
Jack Clark
Deep Ganguli
33
205
0
28 Jun 2023
Apolitical Intelligence? Auditing Delphi's responses on controversial political issues in the US
J. H. Rystrøm
11
0
0
22 Jun 2023
Opportunities and Risks of LLMs for Scalable Deliberation with Polis
Christopher T. Small
Ivan Vendrov
Esin Durmus
Hadjar Homaei
Elizabeth Barry
Julien Cornebise
Ted Suzman
Deep Ganguli
Colin Megill
24
26
0
20 Jun 2023
Questioning the Survey Responses of Large Language Models
Ricardo Dominguez-Olmedo
Moritz Hardt
Celestine Mendler-Dünner
26
30
0
13 Jun 2023
Fairness-Sensitive Policy-Gradient Reinforcement Learning for Reducing Bias in Robotic Assistance
Jie Zhu
Mengsha Hu
Xueyao Liang
Amy Zhang
Ruoming Jin
Rui Liu
16
1
0
07 Jun 2023
I'm Afraid I Can't Do That: Predicting Prompt Refusal in Black-Box Generative Language Models
Max Reuter
William B. Schulze
26
4
0
06 Jun 2023
ChatGPT is a Remarkable Tool -- For Experts
A. Azaria
Rina Azoulay-Schwartz
S. Reches
22
58
0
02 Jun 2023
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Md Tahmid Rahman Laskar
M Saiful Bari
Mizanur Rahman
Md Amran Hossen Bhuiyan
Shafiq R. Joty
J. Huang
LM&MA
ELM
ALM
41
179
0
29 May 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models
Tarek Naous
Michael Joseph Ryan
Alan Ritter
Wei-ping Xu
28
85
0
23 May 2023
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation
Patrick Fernandes
Aman Madaan
Emmy Liu
António Farinhas
Pedro Henrique Martins
...
José G. C. de Souza
Shuyan Zhou
Tongshuang Wu
Graham Neubig
André F. T. Martins
ALM
117
56
0
01 May 2023
In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Xinyue Shen
Z. Chen
Michael Backes
Yang Zhang
19
55
0
18 Apr 2023
The Self-Perception and Political Biases of ChatGPT
Jérôme Rutinowski
Sven Franke
Jan Endendyk
Ina Dormuth
Markus Pauly
33
97
0
14 Apr 2023
Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models
Yi-Hsien Liu
Tianle Han
Siyuan Ma
Jia-Yu Zhang
Yuanyu Yang
...
Xiang Li
Ning Qiang
Dingang Shen
Tianming Liu
Bao Ge
ALM
ELM
AI4CE
LM&MA
LLMAG
38
461
0
04 Apr 2023
One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era
Chaoning Zhang
Chenshuang Zhang
Chenghao Li
Yu Qiao
Sheng Zheng
...
Sung-Ho Bae
Lik-Hang Lee
Pan Hui
In So Kweon
Choong Seon Hong
LM&MA
AI4MH
LRM
ELM
31
130
0
04 Apr 2023
Whose Opinions Do Language Models Reflect?
Shibani Santurkar
Esin Durmus
Faisal Ladhak
Cinoo Lee
Percy Liang
Tatsunori Hashimoto
19
383
0
30 Mar 2023
Talking Abortion (Mis)information with ChatGPT on TikTok
Filipo Sharevski
J. Loop
Peter Jachim
Amy Devine
Emma Pieroni
29
5
0
23 Feb 2023
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
Kai Greshake
Sahar Abdelnabi
Shailesh Mishra
C. Endres
Thorsten Holz
Mario Fritz
SILM
47
433
0
23 Feb 2023
BiasTestGPT: Using ChatGPT for Social Bias Testing of Language Models
Rafal Kocielnik
Shrimai Prabhumoye
Vivian Zhang
Roy Jiang
R. Alvarez
Anima Anandkumar
36
6
0
14 Feb 2023
Diminished Diversity-of-Thought in a Standard Large Language Model
Peter S. Park
P. Schoenegger
Chongyang Zhu
ELM
AI4CE
ALM
25
37
0
13 Feb 2023
A Categorical Archive of ChatGPT Failures
Ali Borji
ELM
25
379
0
06 Feb 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,915
0
04 Mar 2022
The Woman Worked as a Babysitter: On Biases in Language Generation
Emily Sheng
Kai-Wei Chang
Premkumar Natarajan
Nanyun Peng
211
616
0
03 Sep 2019
Previous
1
2