Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.18189
Cited By
Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
29 May 2023
Myra Cheng
Esin Durmus
Dan Jurafsky
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models"
50 / 145 papers shown
AI Will Always Love You: Studying Implicit Biases in Romantic AI Companions
Clare Grogan
Jackie Kay
Maria Perez-Ortiz
343
2
0
27 Feb 2025
Unsupervised Concept Vector Extraction for Bias Control in LLMs
Hannah Cyberey
Yangfeng Ji
David Evans
LLMSV
490
1
0
27 Feb 2025
FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users
Anikait Singh
Sheryl Hsu
Kyle Hsu
E. Mitchell
Stefano Ermon
Tatsunori Hashimoto
Archit Sharma
Chelsea Finn
SyDa
OffRL
302
16
0
26 Feb 2025
Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Joseph Suh
Erfan Jahanparast
Suhong Moon
Minwoo Kang
Serina Chang
ALM
LM&MA
406
26
0
24 Feb 2025
Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models
Conference on Fairness, Accountability and Transparency (FAccT), 2025
Yuxuan Li
Hirokazu Shirado
Sauvik Das
199
12
0
29 Jan 2025
An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts
Dhia Elhaq Rzig
Dhruba Jyoti Paul
Kaiser Pister
Jordan Henkel
Foyzul Hassan
310
0
0
21 Jan 2025
Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yachao Zhao
Bo Wang
Yan Wang
Dongming Zhao
Ruifang He
Yuexian Hou
475
14
0
04 Jan 2025
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents
Xinyi Mou
Xuanwen Ding
Qi He
Liang Wang
Jingcong Liang
...
Lin Sun
Jiayu Lin
Jie Zhou
Xuanjing Huang
Zhongyu Wei
LLMAG
LM&Ro
AI4CE
434
50
0
04 Dec 2024
How far can bias go? -- Tracing bias from pretraining data to alignment
Marion Thaler
Abdullatif Köksal
Alina Leidinger
Anna Korhonen
Hinrich Schutze
402
4
0
28 Nov 2024
Evaluating the Prompt Steerability of Large Language Models
Erik Miehling
Michael Desmond
Karthikeyan N. Ramamurthy
Elizabeth M. Daly
Pierre Dognin
Jesus Rios
Djallel Bouneffouf
Miao Liu
LLMSV
434
14
0
19 Nov 2024
Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
Volume 1 (V1), 2024
Minh Duc Bui
Katharina von der Wense
Anne Lauscher
VLM
267
5
0
06 Nov 2024
Does ChatGPT Have a Poetic Style?
Workshop on Computational Humanities Research (CHR), 2024
Melanie Walsh
Anna Preus
Elizabeth Gronski
246
8
0
20 Oct 2024
Speciesism in Natural Language Processing Research
AI and Ethics (AI & Ethics), 2024
Masashi Takeshita
Rafal Rzepka
214
7
0
18 Oct 2024
LLMs are Biased Teachers: Evaluating LLM Bias in Personalized Education
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Iain Xie Weissburg
Sathvika Anand
Sharon Levy
Haewon Jeong
539
25
0
17 Oct 2024
Which Demographics do LLMs Default to During Annotation?
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Johannes Schäfer
Aidan Combs
Christopher Bagdon
Jiahui Li
Nadine Probol
...
Yarik Menchaca Resendiz
Aswathy Velutharambath
Amelie Wuhrl
Sabine Weber
Roman Klinger
337
7
0
11 Oct 2024
Why am I seeing this: Democratizing End User Auditing for Online Content Recommendations
ACM Symposium on User Interface Software and Technology (UIST), 2024
Chaoran Chen
Leyang Li
Luke Cao
Yanfang Ye
Tianshi Li
Yaxing Yao
Toby Jia-jun Li
MLAU
249
6
0
07 Oct 2024
SoK: Towards Security and Safety of Edge AI
Tatjana Wingarz
Anne Lauscher
Janick Edinger
Dominik Kaaser
Stefan Schulte
Mathias Fischer
269
0
0
07 Oct 2024
On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Abhilasha Sancheti
Haozhe An
Rachel Rudinger
237
0
0
05 Oct 2024
Evaluating and Enhancing Large Language Models for Novelty Assessment in Scholarly Publications
Ethan Lin
Zhiyuan Peng
Yi Fang
654
13
0
25 Sep 2024
ChainBuddy: An AI Agent System for Generating LLM Pipelines
International Conference on Human Factors in Computing Systems (CHI), 2024
Jingyue Zhang
Ian Arawjo
LLMAG
259
0
0
20 Sep 2024
Multimodal Fusion with LLMs for Engagement Prediction in Natural Conversation
Cheng Charles Ma
Kevin Hyekang Joo
Alexandria K. Vail
Sunreeta Bhattacharya
Álvaro Fernández García
Kailana Baker-Matsuoka
Sheryl Mathew
Lori L. Holt
Fernando De la Torre
215
9
0
13 Sep 2024
Agentic Society: Merging skeleton from real world and texture from Large Language Model
Yuqi Bai
Kun Sun
Huishi Yin
234
1
0
02 Sep 2024
LLMs generate structurally realistic social networks but overestimate political homophily
International Conference on Web and Social Media (ICWSM), 2024
Serina Chang
Alicja Chaszczewicz
Emma Wang
Maya Josifovska
Emma Pierson
J. Leskovec
363
23
0
29 Aug 2024
Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning
Rochelle Choenni
Ekaterina Shutova
376
15
0
29 Aug 2024
Can Unconfident LLM Annotations Be Used for Confident Conclusions?
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Kristina Gligorić
Tijana Zrnic
Cinoo Lee
Emmanuel J. Candès
Dan Jurafsky
384
25
0
27 Aug 2024
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hila Gonen
Terra Blevins
Alisa Liu
Luke Zettlemoyer
Noah A. Smith
518
11
0
12 Aug 2024
Do LLMs have Consistent Values?
Naama Rozen
G. Elidan
Amir Globerson
Ella Daniel
328
5
0
16 Jul 2024
Probability of Differentiation Reveals Brittleness of Homogeneity Bias in Large Language Models
Messi H.J. Lee
Calvin K. Lai
126
0
0
10 Jul 2024
Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models
Zara Siddique
Liam D. Turner
Luis Espinosa-Anke
217
2
0
09 Jul 2024
Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models
Flor Miriam Plaza del Arco
Amanda Cercas Curry
Susanna Paoli
Alba Curry
Dirk Hovy
226
9
0
09 Jul 2024
Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior
Pedro Henrique Luz de Araujo
Benjamin Roth
303
17
0
02 Jul 2024
Native Design Bias: Studying the Impact of English Nativeness on Language Model Performance
Manon Reusens
Philipp Borchert
Jochen De Weerdt
Bart Baesens
412
4
0
25 Jun 2024
Shortcomings of LLMs for Low-Resource Translation: Retrieval and Understanding are Both the Problem
Sara Court
Micha Elsner
202
9
0
21 Jun 2024
Exploring Changes in Nation Perception with Nationality-Assigned Personas in LLMs
M. Kamruzzaman
Gene Louis Kim
196
10
0
20 Jun 2024
Who's asking? User personas and the mechanics of latent misalignment
Asma Ghandeharioun
Ann Yuan
Marius Guerard
Emily Reif
Michael A. Lepori
Lucas Dixon
LLMSV
352
17
0
17 Jun 2024
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting
Sagnik Mukherjee
Muhammad Farid Adilazuarda
Sunayana Sitaram
Kalika Bali
Alham Fikri Aji
Monojit Choudhury
254
21
0
17 Jun 2024
Cultural Value Differences of LLMs: Prompt, Language, and Model Size
Qishuai Zhong
Yike Yun
Aixin Sun
189
9
0
17 Jun 2024
Exploring Safety-Utility Trade-Offs in Personalized Language Models
Anvesh Rao Vijjini
Somnath Basu Roy Chowdhury
Snigdha Chaturvedi
532
16
0
17 Jun 2024
The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Bolei Ma
Xinpeng Wang
Tiancheng Hu
Anna Haensch
Michael A. Hedderich
Barbara Plank
Frauke Kreuter
ALM
295
16
0
16 Jun 2024
Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Haozhe An
Christabel Acquaye
Colin Wang
Zongxia Li
Rachel Rudinger
273
22
0
15 Jun 2024
PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences
Daiwei Chen
Yi Chen
Aniket Rege
Ramya Korlakai Vinayak
279
36
0
12 Jun 2024
MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs
Vera Neplenbroek
Arianna Bisazza
Raquel Fernández
367
22
0
11 Jun 2024
A Taxonomy of Challenges to Curating Fair Datasets
Dora Zhao
M. Scheuerman
Pooja Chitre
Jerone T. A. Andrews
Georgia Panagiotidou
Shawn Walker
Kathleen H. Pine
Alice Xiang
277
4
0
10 Jun 2024
Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models
Jisu Shin
Hoyun Song
Huije Lee
Soyeong Jeong
Jong C. Park
344
14
0
06 Jun 2024
Aligning Language Models with Demonstrated Feedback
Omar Shaikh
Michelle S. Lam
Joey Hejna
Yijia Shao
Michael S. Bernstein
Michael S. Bernstein
Diyi Yang
ALM
364
11
0
02 Jun 2024
More Distinctively Black and Feminine Faces Lead to Increased Stereotyping in Vision-Language Models
Messi H.J. Lee
Jacob M. Montgomery
Calvin K. Lai
VLM
192
0
0
22 May 2024
FairMonitor: A Dual-framework for Detecting Stereotypes and Biases in Large Language Models
Yanhong Bai
Jiabao Zhao
Jinxin Shi
Zhentao Xie
Xingjiao Wu
Xiaoling Wang
168
5
0
06 May 2024
PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning
International Conference on Machine Learning (ICML), 2024
Hyeong Kyu Choi
Yixuan Li
298
27
0
03 May 2024
From Persona to Personalization: A Survey on Role-Playing Language Agents
Jiangjie Chen
Xintao Wang
Rui Xu
Siyu Yuan
Yikai Zhang
...
Caiyu Hu
Siye Wu
Scott Ren
Ziquan Fu
Yanghua Xiao
367
174
0
28 Apr 2024
GeniL: A Multilingual Dataset on Generalizing Language
Aida Mostafazadeh Davani
S. Gubbi
Sunipa Dev
Shachi Dave
Vinodkumar Prabhakaran
222
2
0
08 Apr 2024
Previous
1
2
3
Next