v1v2v3v4v5 (latest)

Intrinsic Bias Metrics Do Not Correlate with Application Bias

Annual Meeting of the Association for Computational Linguistics (ACL), 2020

31 December 2020

Seraphina Goldfarb-Tarrant

Rebecca Marchant

Ricardo Muñoz Sánchez

Mugdha Pandya

Adam Lopez

ArXiv (abs)PDF HTML

Papers citing "Intrinsic Bias Metrics Do Not Correlate with Application Bias"

50 / 81 papers shown

A word association network methodology for evaluating implicit biases in LLMs compared to humans

Katherine Abramski

Giulio Rossetti

Massimo Stella

190

28 Oct 2025

Intrinsic Meets Extrinsic Fairness: Assessing the Downstream Impact of Bias Mitigation in Large Language Models

'Mina Arzaghi'

Álireza Dehghanpour Farashah'

'Florian Carichon'

' Golnoosh Farnadi'

237

19 Sep 2025

Bias after Prompting: Persistent Discrimination in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

233

09 Sep 2025

GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition

Md Asgor Hossain Reaj

249

15 Aug 2025

Do Biased Models Have Biased Thoughts?

218

08 Aug 2025

Toward Revealing Nuanced Biases in Medical LLMs

Farzana Islam Adiba

Rahmatollah Beheshti

180

26 Jul 2025

Uncertainty Quantification for Evaluating Machine Translation Bias

366

24 Jul 2025

Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language

267

22 Jul 2025

Are Bias Evaluation Methods Biased ?

260

20 Jun 2025

Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based SystemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Emma Harvey

Emily Sheng

Su Lin Blodgett

Alexandra Chouldechova

Jean Garcia-Gathright

Alexandra Olteanu

Hanna M. Wallach

236

04 Jun 2025

Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Bumjin Park

Jinsil Lee

Jaesik Choi

252

01 Jun 2025

Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text

297

05 May 2025

Agree to Disagree? A Meta-Evaluation of LLM Misgendering

613

23 Apr 2025

Toward an Evaluation Science for Generative AI Systems

453

07 Mar 2025

Linear Representations of Political Perspective Emerge in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025

Junsol Kim

James Evans

Aaron Schein

458

03 Mar 2025

Evaluating the Effect of Retrieval Augmentation on Social Biases

Tianhui Zhang

Yi Zhou

Danushka Bollegala

387

24 Feb 2025

LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesJournal of Open Source Software (JOSS), 2025

380

06 Jan 2025

Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models

372

17 Oct 2024

ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social ScienceAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024

Robert Wolfe

Alexis Hiniker

Bill Howe

322

04 Aug 2024

Fairness Definitions in Language Models Explained

Thang Viet Doan

Zhibo Chu

Sribala Vidyadhari Chinta

Wenbin Zhang

ALM

456

26 Jul 2024

Extrinsic Evaluation of Cultural Competence in Large Language Models

Shaily Bhatt

Fernando Diaz

ELM EGVM

396

17 Jun 2024

Why Don't Prompt-Based Fairness Metrics Correlate?Annual Meeting of the Association for Computational Linguistics (ACL), 2024

335

09 Jun 2024

The Impossibility of Fair LLMs

553

28 May 2024

Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution

Flor Miriam Plaza del Arco

513

05 Mar 2024

Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation

611

20 Feb 2024

Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting

320

28 Jan 2024

Co$^2$PT: Mitigating Bias in Pre-trained Language Models through
Counterfactual Contrastive Prompt Tuning

^2

PT: Mitigating Bias in Pre-trained Language Models through Counterfactual Contrastive Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

364

19 Oct 2023

Survey of Social Bias in Vision-Language Models

443

24 Sep 2023

TIDE: Textual Identity Detection for Evaluating and Augmenting Classification and Language Models

Emmanuel Klu

Sameer Sethi

253

07 Sep 2023

Bias and Fairness in Large Language Models: A SurveyComputational Linguistics (CL), 2023

Isabel O. Gallegos

Ryan Rossi

Joe Barrow

Md Mehrab Tanjim

Sungchul Kim

482

1,011

02 Sep 2023

Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection

Fatma Elsafoury

381

31 Aug 2023

Gender mobility in the labor market with skills-based matching models

17 Jul 2023

Sociodemographic Bias in Language Models: A Survey and Forward Path

Vipul Gupta

Pranav Narayanan Venkit

Shomir Wilson

R. Passonneau

569

13 Jun 2023

Detecting and Mitigating Indirect Stereotypes in Word Embeddings

Erin E. George

Joyce A. Chew

Deanna Needell

200

23 May 2023

Having Beer after Prayer? Measuring Cultural Bias in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

674

158

23 May 2023

This Prompt is Measuring <MASK>: Evaluating Bias Evaluation in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Seraphina Goldfarb-Tarrant

Eddie L. Ungless

Esma Balkir

Su Lin Blodgett

290

22 May 2023

Bias Beyond English: Counterfactual Tests for Bias in Sentiment Analysis in Four LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Seraphina Goldfarb-Tarrant

Adam Lopez

Roi Blanco

Diego Marcheggiani

233

19 May 2023

On the Independence of Association Bias and Empirical Fairness in Language ModelsConference on Fairness, Accountability and Transparency (FAccT), 2023

Laura Cabello

Anna Katrine van Zee

Anders Søgaard

265

20 Apr 2023

ACROCPoLis: A Descriptive Framework for Making Sense of FairnessConference on Fairness, Accountability and Transparency (FAccT), 2023

Andrea Aler Tubella

Dimitri Coelho Mollo

Adam Dahlgren Lindstrom

...

Julian Alfredo Mendez

J. Nieves

256

19 Apr 2023

Overwriting Pretrained Bias with Finetuning DataIEEE International Conference on Computer Vision (ICCV), 2023

Angelina Wang

Olga Russakovsky

364

10 Mar 2023

In-Depth Look at Word Filling Societal Bias MeasuresConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Matúš Pikuliak

Ivana Benová

Viktor Bachratý

299

24 Feb 2023

Fairness in Language Models Beyond English: Gaps and ChallengesFindings (Findings), 2023

Krithika Ramesh

Sunayana Sitaram

Monojit Choudhury

400

24 Feb 2023

BiasTestGPT: Using ChatGPT for Social Bias Testing of Language Models

412

14 Feb 2023

Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLPConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023

Xudong Han

Timothy Baldwin

Trevor Cohn

282

11 Feb 2023

A Comprehensive Study of Gender Bias in Chemical Named Entity Recognition ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Xingmeng Zhao

A. Niazi

Anthony Rios

276

24 Dec 2022

Trustworthy Social Bias MeasurementAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022

Rishi Bommasani

Abigail Z. Jacobs

278

20 Dec 2022

On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Diyi Yang

556

257

15 Dec 2022

Undesirable Biases in NLP: Addressing Challenges of Measurement

533

24 Nov 2022

ADEPT: A DEbiasing PrompT FrameworkAAAI Conference on Artificial Intelligence (AAAI), 2022

416

10 Nov 2022

HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models

Ge Zhang

250

05 Nov 2022