Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models

Annual Meeting of the Association for Computational Linguistics (ACL), 2023

29 May 2023

Myra Cheng

Esin Durmus

Dan Jurafsky

ArXiv (abs)PDF HTML

Papers citing "Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models"

50 / 145 papers shown

AI Will Always Love You: Studying Implicit Biases in Romantic AI Companions

Clare Grogan

Jackie Kay

Maria Perez-Ortiz

343

27 Feb 2025

Unsupervised Concept Vector Extraction for Bias Control in LLMs

490

27 Feb 2025

FSPO: Few-Shot Preference Optimization of Synthetic Preference Data in LLMs Elicits Effective Personalization to Real Users

302

26 Feb 2025

Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public OpinionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

406

24 Feb 2025

Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language ModelsConference on Fairness, Accountability and Transparency (FAccT), 2025

Yuxuan Li

Hirokazu Shirado

Sauvik Das

199

29 Jan 2025

An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts

310

21 Jan 2025

Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-ReflectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

475

04 Jan 2025

From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based Agents

...

434

04 Dec 2024

How far can bias go? -- Tracing bias from pretraining data to alignment

402

28 Nov 2024

Evaluating the Prompt Steerability of Large Language Models

Erik Miehling

Michael Desmond

Karthikeyan N. Ramamurthy

434

19 Nov 2024

Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language ModelsVolume 1 (V1), 2024

Minh Duc Bui

Katharina von der Wense

Anne Lauscher

VLM

267

06 Nov 2024

Does ChatGPT Have a Poetic Style?Workshop on Computational Humanities Research (CHR), 2024

Melanie Walsh

Anna Preus

Elizabeth Gronski

246

20 Oct 2024

Speciesism in Natural Language Processing ResearchAI and Ethics (AI & Ethics), 2024

Masashi Takeshita

Rafal Rzepka

214

18 Oct 2024

LLMs are Biased Teachers: Evaluating LLM Bias in Personalized EducationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

539

17 Oct 2024

Which Demographics do LLMs Default to During Annotation?Annual Meeting of the Association for Computational Linguistics (ACL), 2024

...

Yarik Menchaca Resendiz

Aswathy Velutharambath

Amelie Wuhrl

Sabine Weber

Roman Klinger

337

11 Oct 2024

Why am I seeing this: Democratizing End User Auditing for Online Content RecommendationsACM Symposium on User Interface Software and Technology (UIST), 2024

Luke Cao

Toby Jia-jun Li

249

07 Oct 2024

SoK: Towards Security and Safety of Edge AI

269

07 Oct 2024

On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Abhilasha Sancheti

Haozhe An

Rachel Rudinger

237

05 Oct 2024

Evaluating and Enhancing Large Language Models for Novelty Assessment in Scholarly Publications

Ethan Lin

Zhiyuan Peng

Yi Fang

654

25 Sep 2024

ChainBuddy: An AI Agent System for Generating LLM PipelinesInternational Conference on Human Factors in Computing Systems (CHI), 2024

Jingyue Zhang

Ian Arawjo

LLMAG

259

20 Sep 2024

Multimodal Fusion with LLMs for Engagement Prediction in Natural Conversation

Cheng Charles Ma

Kevin Hyekang Joo

Alexandria K. Vail

Sunreeta Bhattacharya

Álvaro Fernández García

Kailana Baker-Matsuoka

Sheryl Mathew

Lori L. Holt

Fernando De la Torre

215

13 Sep 2024

Agentic Society: Merging skeleton from real world and texture from Large Language Model

Yuqi Bai

Kun Sun

Huishi Yin

234

02 Sep 2024

LLMs generate structurally realistic social networks but overestimate political homophilyInternational Conference on Web and Social Media (ICWSM), 2024

363

29 Aug 2024

Self-Alignment: Improving Alignment of Cultural Values in LLMs via In-Context Learning

Rochelle Choenni

Ekaterina Shutova

376

29 Aug 2024

Can Unconfident LLM Annotations Be Used for Confident Conclusions?North American Chapter of the Association for Computational Linguistics (NAACL), 2024

384

27 Aug 2024

Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

518

12 Aug 2024

Do LLMs have Consistent Values?

328

16 Jul 2024

Probability of Differentiation Reveals Brittleness of Homogeneity Bias in Large Language Models

Messi H.J. Lee

Calvin K. Lai

126

10 Jul 2024

Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models

Zara Siddique

Liam D. Turner

Luis Espinosa-Anke

217

09 Jul 2024

Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models

Flor Miriam Plaza del Arco

Amanda Cercas Curry

Susanna Paoli

Alba Curry

Dirk Hovy

226

09 Jul 2024

Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior

Pedro Henrique Luz de Araujo

Benjamin Roth

303

02 Jul 2024

Native Design Bias: Studying the Impact of English Nativeness on Language Model Performance

412

25 Jun 2024

Shortcomings of LLMs for Low-Resource Translation: Retrieval and Understanding are Both the Problem

Sara Court

Micha Elsner

202

21 Jun 2024

Exploring Changes in Nation Perception with Nationality-Assigned Personas in LLMs

M. Kamruzzaman

Gene Louis Kim

196

20 Jun 2024

Who's asking? User personas and the mechanics of latent misalignment

Emily Reif

352

17 Jun 2024

Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting

Sagnik Mukherjee

Muhammad Farid Adilazuarda

254

17 Jun 2024

Cultural Value Differences of LLMs: Prompt, Language, and Model Size

Qishuai Zhong

Yike Yun

Aixin Sun

189

17 Jun 2024

Exploring Safety-Utility Trade-Offs in Personalized Language Models

Anvesh Rao Vijjini

Somnath Basu Roy Chowdhury

Snigdha Chaturvedi

532

17 Jun 2024

The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models

Barbara Plank

295

16 Jun 2024

Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?Annual Meeting of the Association for Computational Linguistics (ACL), 2024

273

15 Jun 2024

PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences

Daiwei Chen

Yi Chen

Aniket Rege

Ramya Korlakai Vinayak

279

12 Jun 2024

MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs

Vera Neplenbroek

Arianna Bisazza

Raquel Fernández

367

11 Jun 2024

A Taxonomy of Challenges to Curating Fair Datasets

Dora Zhao

Alice Xiang

277

10 Jun 2024

Ask LLMs Directly, "What shapes your bias?": Measuring Social Bias in Large Language Models

344

06 Jun 2024

Aligning Language Models with Demonstrated Feedback

Diyi Yang

364

02 Jun 2024

More Distinctively Black and Feminine Faces Lead to Increased Stereotyping in Vision-Language Models

192

22 May 2024

FairMonitor: A Dual-framework for Detecting Stereotypes and Biases in Large Language Models

Xingjiao Wu

168

06 May 2024

PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context LearningInternational Conference on Machine Learning (ICML), 2024

Hyeong Kyu Choi

Yixuan Li

298

03 May 2024

From Persona to Personalization: A Survey on Role-Playing Language Agents

...

Ziquan Fu

Yanghua Xiao

367

174

28 Apr 2024

GeniL: A Multilingual Dataset on Generalizing Language

Aida Mostafazadeh Davani

S. Gubbi

Sunipa Dev

Shachi Dave

Vinodkumar Prabhakaran

222

08 Apr 2024