Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models

Annual Meeting of the Association for Computational Linguistics (ACL), 2023

29 May 2023

Myra Cheng

Esin Durmus

Dan Jurafsky

ArXiv (abs)PDF HTML

Papers citing "Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models"

45 / 145 papers shown

Social Skill Training with Large Language Models

Diyi Yang

179

05 Apr 2024

Do Large Language Models Rank Fairly? An Empirical Study on the Fairness of LLMs as RankersNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

286

04 Apr 2024

Template-Based Probes Are Imperfect Lenses for Counterfactual Bias Evaluation in LLMs

Farnaz Kohankhaki

D. B. Emerson

David B. Emerson

Laleh Seyyed-Kalantari

Faiza Khan Khattak

392

04 Apr 2024

Fairness in Large Language Models: A Taxonomic Survey

Zhibo Chu

Sribala Vidyadhari Chinta

Wenbin Zhang

AILaw

261

31 Mar 2024

Argument Quality Assessment in the Age of Instruction-Following Large Language Models

230

24 Mar 2024

Can AI Outperform Human Experts in Creating Social Media Creatives?

Eunkyung Park

Raymond K. Wong

Junbum Kwon

210

19 Mar 2024

HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

H. Nghiem

Hal Daumé

367

18 Mar 2024

On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models

Maosong Sun

Xing Xie

OffRL

390

07 Mar 2024

Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution

Flor Miriam Plaza del Arco

477

05 Mar 2024

Counterspeakers' Perspectives: Unveiling Barriers and AI Needs in the Fight against Online Hate

232

29 Feb 2024

Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information

279

28 Feb 2024

Shallow Synthesis of Knowledge in GPT-Generated Texts: A Case Study in Automatic Related Work Composition

177

19 Feb 2024

Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images

Kathleen C. Fraser

S. Kiritchenko

262

08 Feb 2024

Measuring Implicit Bias in Explicitly Unbiased Large Language Models

333

06 Feb 2024

AnthroScore: A Computational Linguistic Measure of Anthropomorphism

Myra Cheng

Kristina Gligorić

Tiziano Piccardi

Dan Jurafsky

180

03 Feb 2024

Beyond Behaviorist Representational Harms: A Plan for Measurement and MitigationConference on Fairness, Accountability and Transparency (FAccT), 2024

Jennifer Chien

David Danks

298

25 Jan 2024

Canvil: Designerly Adaptation for LLM-Powered User ExperiencesInternational Conference on Human Factors in Computing Systems (CHI), 2024

K. J. Kevin Feng

Q. V. Liao

Ziang Xiao

Jennifer Wortman Vaughan

Amy X. Zhang

David W. McDonald

197

17 Jan 2024

Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in HumansConference on Fairness, Accountability and Transparency (FAccT), 2024

Messi H.J. Lee

Jacob M. Montgomery

Calvin K. Lai

215

16 Jan 2024

"What's important here?": Opportunities and Challenges of Using LLMs in Retrieving Information from Web Interfaces

Faria Huq

Jeffrey P. Bigham

Nikolas Martelaro

236

11 Dec 2023

Fair Text Classification with Wasserstein IndependenceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

251

21 Nov 2023

P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models

Yuhan Liu

Shangbin Feng

Xiaochuang Han

Vidhisha Balachandran

258

16 Nov 2023

You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments

282

16 Nov 2023

Simulating Opinion Dynamics with Networks of LLM-based Agents

487

127

16 Nov 2023

A Material Lens on Coloniality in NLP

William B. Held

Camille Harris

Michael Best

Diyi Yang

322

14 Nov 2023

Intentional Biases in LLM ResponsesUbiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), 2023

Nicklaus Badyal

Derek Jacoby

Yvonne Coady

123

11 Nov 2023

ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human PreferencesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

465

10 Nov 2023

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs

Shashank Gupta

Vaishnavi Shrivastava

465

170

08 Nov 2023

Personas as a Way to Model Truthfulness in Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

394

27 Oct 2023

SOTOPIA: Interactive Evaluation for Social Intelligence in Language AgentsInternational Conference on Learning Representations (ICLR), 2023

Xuhui Zhou

...

Louis-Philippe Morency

Yonatan Bisk

Daniel Fried

Graham Neubig

Maarten Sap

LLMAG

381

222

18 Oct 2023

CoMPosT: Characterizing and Evaluating Caricature in LLM SimulationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

329

106

17 Oct 2023

Rehearsal: Simulating Conflict to Teach Conflict ResolutionInternational Conference on Human Factors in Computing Systems (CHI), 2023

Diyi Yang

199

21 Sep 2023

Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic PromptingConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

321

13 Sep 2023

FairMonitor: A Four-Stage Automatic Framework for Detecting Stereotypes and Biases in Large Language Models

Xingjiao Wu

132

21 Aug 2023

A Survey on Fairness in Large Language Models

Ying Wang

383

20 Aug 2023

Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench

Michael R. Lyu

362

07 Aug 2023

How User Language Affects Conflict Fatality Estimates in ChatGPT

Daniel Kazenwadel

C. Steinert

26 Jul 2023

Unveiling Gender Bias in Terms of Profession Across LLMs: Analyzing and Addressing Sociological Implications

Vishesh Thakur

227

18 Jul 2023

Evaluating Biased Attitude Associations of Language Models in an Intersectional ContextAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023

Shiva Omrani Sabbaghi

Robert Wolfe

Aylin Caliskan

201

07 Jul 2023

Towards Measuring the Representation of Subjective Global Opinions in Language Models

Esin Durmus

...

Deep Ganguli

353

335

28 Jun 2023

Opportunities and Risks of LLMs for Scalable Deliberation with Polis

Esin Durmus

Deep Ganguli

195

20 Jun 2023

This Land is {Your, My} Land: Evaluating Geopolitical Biases in Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Bryan Li

Samar Haider

Chris Callison-Burch

472

24 May 2023

ChatGPT Perpetuates Gender Bias in Machine Translation and Ignores Non-Gendered Pronouns: Findings across Bengali and Five other Low-Resource LanguagesInternational Computing Education Research Workshop (ICER), 2023

Sourojit Ghosh

Aylin Caliskan

212

103

17 May 2023

Coarse race data conceals disparities in clinical risk score performanceMachine Learning in Health Care (MLHC), 2023

194

18 Apr 2023

BiasTestGPT: Using ChatGPT for Social Bias Testing of Language Models

306

14 Feb 2023

Generalized Word Shift Graphs: A Method for Visualizing and Explaining Pairwise Comparisons Between Texts

218

05 Aug 2020