Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.15859
Cited By
v1
v2
v3
v4
v5 (latest)
Intrinsic Bias Metrics Do Not Correlate with Application Bias
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
31 December 2020
Seraphina Goldfarb-Tarrant
Rebecca Marchant
Ricardo Muñoz Sánchez
Mugdha Pandya
Adam Lopez
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Intrinsic Bias Metrics Do Not Correlate with Application Bias"
50 / 81 papers shown
A word association network methodology for evaluating implicit biases in LLMs compared to humans
Katherine Abramski
Giulio Rossetti
Massimo Stella
190
1
0
28 Oct 2025
Intrinsic Meets Extrinsic Fairness: Assessing the Downstream Impact of Bias Mitigation in Large Language Models
'Mina Arzaghi'
Álireza Dehghanpour Farashah'
'Florian Carichon'
' Golnoosh Farnadi'
237
2
0
19 Sep 2025
Bias after Prompting: Persistent Discrimination in Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
N. Sivakumar
Natalie Mackraz
Samira Khorshidi
Krishna Patel
B. Theobald
Luca Zappella
N. Apostoloff
AI4CE
233
1
0
09 Sep 2025
GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition
Md Asgor Hossain Reaj
Rajan Das Gupta
Md. Yeasin Rahat
Nafiz Fahad
Md Jawadul Hasan
Tze Hui Liew
249
0
0
15 Aug 2025
Do Biased Models Have Biased Thoughts?
Swati Rajwal
Shivank Garg
Reem Abdel-Salam
Abdelrahman Zayed
LRM
218
0
0
08 Aug 2025
Toward Revealing Nuanced Biases in Medical LLMs
Farzana Islam Adiba
Rahmatollah Beheshti
180
0
0
26 Jul 2025
Uncertainty Quantification for Evaluating Machine Translation Bias
Ieva Staliunaite
Julius Cheng
Andreas Vlachos
UQLM
366
1
0
24 Jul 2025
Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language
Kristin Gnadt
David Thulke
Simone Kopeinik
Ralf Schluter
267
2
0
22 Jul 2025
Are Bias Evaluation Methods Biased ?
Lina Berrayana
Sean Rooney
Luis Garces-Erice
Ioana Giurgiu
ELM
260
4
0
20 Jun 2025
Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Emma Harvey
Emily Sheng
Su Lin Blodgett
Alexandra Chouldechova
Jean Garcia-Gathright
Alexandra Olteanu
Hanna M. Wallach
236
4
0
04 Jun 2025
Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Bumjin Park
Jinsil Lee
Jaesik Choi
252
0
0
01 Jun 2025
Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text
Jennifer Healey
Laurie Byrum
Md Nadeem Akhtar
Surabhi Bhargava
Moumita Sinha
297
0
0
05 May 2025
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Arjun Subramonian
Vagrant Gautam
Preethi Seshadri
Dietrich Klakow
Kai-Wei Chang
Zhaoxin Fan
613
3
0
23 Apr 2025
Toward an Evaluation Science for Generative AI Systems
Laura Weidinger
Deb Raji
Hanna M. Wallach
Margaret Mitchell
Angelina Wang
Olawale Salaudeen
Rishi Bommasani
Sayash Kapoor
Deep Ganguli
Sanmi Koyejo
EGVM
ELM
453
37
0
07 Mar 2025
Linear Representations of Political Perspective Emerge in Large Language Models
International Conference on Learning Representations (ICLR), 2025
Junsol Kim
James Evans
Aaron Schein
458
23
0
03 Mar 2025
Evaluating the Effect of Retrieval Augmentation on Social Biases
Tianhui Zhang
Yi Zhou
Danushka Bollegala
387
2
0
24 Feb 2025
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases
Journal of Open Source Software (JOSS), 2025
Dylan Bouchard
Mohit Singh Chauhan
David Skarbrevik
Viren Bajaj
Zeya Ahmad
380
6
0
06 Jan 2025
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models
Eddie L. Ungless
Nikolas Vitsakis
Zeerak Talat
James Garforth
Bjorn Ross
Arno Onken
Atoosa Kasirzadeh
Alexandra Birch
372
3
0
17 Oct 2024
ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Science
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024
Robert Wolfe
Alexis Hiniker
Bill Howe
322
2
0
04 Aug 2024
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Sribala Vidyadhari Chinta
Wenbin Zhang
ALM
456
21
0
26 Jul 2024
Extrinsic Evaluation of Cultural Competence in Large Language Models
Shaily Bhatt
Fernando Diaz
ELM
EGVM
396
20
0
17 Jun 2024
Why Don't Prompt-Based Fairness Metrics Correlate?
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
A. Zayed
Gonçalo Mordido
Ioana Baldini
Sarath Chandar
ALM
335
10
0
09 Jun 2024
The Impossibility of Fair LLMs
Jacy Reese Anthis
Kristian Lum
Michael Ekstrand
Avi Feller
Alexander D’Amour
FaML
553
29
0
28 May 2024
Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
Flor Miriam Plaza del Arco
Amanda Cercas Curry
Alba Curry
Gavin Abercrombie
Dirk Hovy
513
45
0
05 Mar 2024
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
Kristian Lum
Jacy Reese Anthis
Chirag Nagpal
Alex DÁmour
Alexander D’Amour
611
38
0
20 Feb 2024
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
320
58
0
28 Jan 2024
Co
2
^2
2
PT: Mitigating Bias in Pre-trained Language Models through Counterfactual Contrastive Prompt Tuning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiangjue Dong
Ziwei Zhu
Zhuoer Wang
Maria Teleki
James Caverlee
364
20
0
19 Oct 2023
Survey of Social Bias in Vision-Language Models
Nayeon Lee
Yejin Bang
Holy Lovenia
Samuel Cahyawijaya
Wenliang Dai
Pascale Fung
VLM
443
34
0
24 Sep 2023
TIDE: Textual Identity Detection for Evaluating and Augmenting Classification and Language Models
Emmanuel Klu
Sameer Sethi
253
0
0
07 Sep 2023
Bias and Fairness in Large Language Models: A Survey
Computational Linguistics (CL), 2023
Isabel O. Gallegos
Ryan Rossi
Joe Barrow
Md Mehrab Tanjim
Sungchul Kim
Franck Dernoncourt
Tong Yu
Ruiyi Zhang
Nesreen Ahmed
AILaw
482
1,011
0
02 Sep 2023
Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Fatma Elsafoury
381
5
0
31 Aug 2023
Gender mobility in the labor market with skills-based matching models
Ajaya Adhikari
S. Vethman
Daan Vos
Marc V. Lenz
Ioana Cocu
I. Tolios
C. Veenman
89
2
0
17 Jul 2023
Sociodemographic Bias in Language Models: A Survey and Forward Path
Vipul Gupta
Pranav Narayanan Venkit
Shomir Wilson
R. Passonneau
569
37
0
13 Jun 2023
Detecting and Mitigating Indirect Stereotypes in Word Embeddings
Erin E. George
Joyce A. Chew
Deanna Needell
200
0
0
23 May 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Tarek Naous
Michael Joseph Ryan
Alan Ritter
Wei Xu
674
158
0
23 May 2023
This Prompt is Measuring <MASK>: Evaluating Bias Evaluation in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Seraphina Goldfarb-Tarrant
Eddie L. Ungless
Esma Balkir
Su Lin Blodgett
290
15
0
22 May 2023
Bias Beyond English: Counterfactual Tests for Bias in Sentiment Analysis in Four Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Seraphina Goldfarb-Tarrant
Adam Lopez
Roi Blanco
Diego Marcheggiani
233
17
0
19 May 2023
On the Independence of Association Bias and Empirical Fairness in Language Models
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Laura Cabello
Anna Katrine van Zee
Anders Søgaard
265
38
0
20 Apr 2023
ACROCPoLis: A Descriptive Framework for Making Sense of Fairness
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Andrea Aler Tubella
Dimitri Coelho Mollo
Adam Dahlgren Lindstrom
Hannah Devinney
Virginia Dignum
...
Anna Jonsson
T. Kampik
Tom Lenaerts
Julian Alfredo Mendez
J. Nieves
256
14
0
19 Apr 2023
Overwriting Pretrained Bias with Finetuning Data
IEEE International Conference on Computer Vision (ICCV), 2023
Angelina Wang
Olga Russakovsky
364
49
0
10 Mar 2023
In-Depth Look at Word Filling Societal Bias Measures
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Matúš Pikuliak
Ivana Benová
Viktor Bachratý
299
12
0
24 Feb 2023
Fairness in Language Models Beyond English: Gaps and Challenges
Findings (Findings), 2023
Krithika Ramesh
Sunayana Sitaram
Monojit Choudhury
400
30
0
24 Feb 2023
BiasTestGPT: Using ChatGPT for Social Bias Testing of Language Models
Rafal Kocielnik
Shrimai Prabhumoye
Vivian Zhang
Roy Jiang
R. Alvarez
Anima Anandkumar
412
11
0
14 Feb 2023
Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Xudong Han
Timothy Baldwin
Trevor Cohn
282
15
0
11 Feb 2023
A Comprehensive Study of Gender Bias in Chemical Named Entity Recognition Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Xingmeng Zhao
A. Niazi
Anthony Rios
276
3
0
24 Dec 2022
Trustworthy Social Bias Measurement
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022
Rishi Bommasani
Abigail Z. Jacobs
278
14
0
20 Dec 2022
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Omar Shaikh
Hongxin Zhang
William B. Held
Michael S. Bernstein
Diyi Yang
ReLM
LRM
556
257
0
15 Dec 2022
Undesirable Biases in NLP: Addressing Challenges of Measurement
Oskar van der Wal
Dominik Bachmann
Alina Leidinger
L. Maanen
Willem H. Zuidema
K. Schulz
533
8
0
24 Nov 2022
ADEPT: A DEbiasing PrompT Framework
AAAI Conference on Artificial Intelligence (AAAI), 2022
Ke Yang
Charles Yu
Yi R. Fung
Pengfei Yu
Heng Ji
416
35
0
10 Nov 2022
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models
Yi Zhou
Ge Zhang
Bohao Yang
Chenghua Lin
Shi Wang
Anton Ragni
Jie Fu
250
10
0
05 Nov 2022
1
2
Next
Page 1 of 2