ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.15859
  4. Cited By
Intrinsic Bias Metrics Do Not Correlate with Application Bias
v1v2v3v4v5 (latest)

Intrinsic Bias Metrics Do Not Correlate with Application Bias

Annual Meeting of the Association for Computational Linguistics (ACL), 2020
31 December 2020
Seraphina Goldfarb-Tarrant
Rebecca Marchant
Ricardo Muñoz Sánchez
Mugdha Pandya
Adam Lopez
ArXiv (abs)PDFHTML

Papers citing "Intrinsic Bias Metrics Do Not Correlate with Application Bias"

50 / 81 papers shown
A word association network methodology for evaluating implicit biases in LLMs compared to humans
A word association network methodology for evaluating implicit biases in LLMs compared to humans
Katherine Abramski
Giulio Rossetti
Massimo Stella
190
1
0
28 Oct 2025
Intrinsic Meets Extrinsic Fairness: Assessing the Downstream Impact of Bias Mitigation in Large Language Models
Intrinsic Meets Extrinsic Fairness: Assessing the Downstream Impact of Bias Mitigation in Large Language Models
'Mina Arzaghi'
Álireza Dehghanpour Farashah'
'Florian Carichon'
' Golnoosh Farnadi'
237
2
0
19 Sep 2025
Bias after Prompting: Persistent Discrimination in Large Language Models
Bias after Prompting: Persistent Discrimination in Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
N. Sivakumar
Natalie Mackraz
Samira Khorshidi
Krishna Patel
B. Theobald
Luca Zappella
N. Apostoloff
AI4CE
233
1
0
09 Sep 2025
GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition
GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition
Md Asgor Hossain Reaj
Rajan Das Gupta
Md. Yeasin Rahat
Nafiz Fahad
Md Jawadul Hasan
Tze Hui Liew
249
0
0
15 Aug 2025
Do Biased Models Have Biased Thoughts?
Do Biased Models Have Biased Thoughts?
Swati Rajwal
Shivank Garg
Reem Abdel-Salam
Abdelrahman Zayed
LRM
218
0
0
08 Aug 2025
Toward Revealing Nuanced Biases in Medical LLMs
Toward Revealing Nuanced Biases in Medical LLMs
Farzana Islam Adiba
Rahmatollah Beheshti
180
0
0
26 Jul 2025
Uncertainty Quantification for Evaluating Machine Translation Bias
Uncertainty Quantification for Evaluating Machine Translation Bias
Ieva Staliunaite
Julius Cheng
Andreas Vlachos
UQLM
366
1
0
24 Jul 2025
Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language
Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language
Kristin Gnadt
David Thulke
Simone Kopeinik
Ralf Schluter
267
2
0
22 Jul 2025
Are Bias Evaluation Methods Biased ?
Are Bias Evaluation Methods Biased ?
Lina Berrayana
Sean Rooney
Luis Garces-Erice
Ioana Giurgiu
ELM
260
4
0
20 Jun 2025
Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based Systems
Understanding and Meeting Practitioner Needs When Measuring Representational Harms Caused by LLM-Based SystemsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Emma Harvey
Emily Sheng
Su Lin Blodgett
Alexandra Chouldechova
Jean Garcia-Gathright
Alexandra Olteanu
Hanna M. Wallach
236
4
0
04 Jun 2025
Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language Models
Deontological Keyword Bias: The Impact of Modal Expressions on Normative Judgments of Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Bumjin Park
Jinsil Lee
Jaesik Choi
252
0
0
01 Jun 2025
Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text
Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text
Jennifer Healey
Laurie Byrum
Md Nadeem Akhtar
Surabhi Bhargava
Moumita Sinha
297
0
0
05 May 2025
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Agree to Disagree? A Meta-Evaluation of LLM Misgendering
Arjun Subramonian
Vagrant Gautam
Preethi Seshadri
Dietrich Klakow
Kai-Wei Chang
Zhaoxin Fan
613
3
0
23 Apr 2025
Toward an Evaluation Science for Generative AI Systems
Toward an Evaluation Science for Generative AI Systems
Laura Weidinger
Deb Raji
Hanna M. Wallach
Margaret Mitchell
Angelina Wang
Olawale Salaudeen
Rishi Bommasani
Sayash Kapoor
Deep Ganguli
Sanmi Koyejo
EGVMELM
453
37
0
07 Mar 2025
Linear Representations of Political Perspective Emerge in Large Language Models
Linear Representations of Political Perspective Emerge in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025
Junsol Kim
James Evans
Aaron Schein
458
23
0
03 Mar 2025
Evaluating the Effect of Retrieval Augmentation on Social Biases
Evaluating the Effect of Retrieval Augmentation on Social Biases
Tianhui Zhang
Yi Zhou
Danushka Bollegala
387
2
0
24 Feb 2025
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesJournal of Open Source Software (JOSS), 2025
Dylan Bouchard
Mohit Singh Chauhan
David Skarbrevik
Viren Bajaj
Zeya Ahmad
380
6
0
06 Jan 2025
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language
  Models
Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models
Eddie L. Ungless
Nikolas Vitsakis
Zeerak Talat
James Garforth
Bjorn Ross
Arno Onken
Atoosa Kasirzadeh
Alexandra Birch
372
3
0
17 Oct 2024
ML-EAT: A Multilevel Embedding Association Test for Interpretable and
  Transparent Social Science
ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social ScienceAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2024
Robert Wolfe
Alexis Hiniker
Bill Howe
322
2
0
04 Aug 2024
Fairness Definitions in Language Models Explained
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Sribala Vidyadhari Chinta
Wenbin Zhang
ALM
456
21
0
26 Jul 2024
Extrinsic Evaluation of Cultural Competence in Large Language Models
Extrinsic Evaluation of Cultural Competence in Large Language Models
Shaily Bhatt
Fernando Diaz
ELMEGVM
396
20
0
17 Jun 2024
Why Don't Prompt-Based Fairness Metrics Correlate?
Why Don't Prompt-Based Fairness Metrics Correlate?Annual Meeting of the Association for Computational Linguistics (ACL), 2024
A. Zayed
Gonçalo Mordido
Ioana Baldini
Sarath Chandar
ALM
335
10
0
09 Jun 2024
The Impossibility of Fair LLMs
The Impossibility of Fair LLMs
Jacy Reese Anthis
Kristian Lum
Michael Ekstrand
Avi Feller
Alexander D’Amour
FaML
553
29
0
28 May 2024
Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes
  in Emotion Attribution
Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution
Flor Miriam Plaza del Arco
Amanda Cercas Curry
Alba Curry
Gavin Abercrombie
Dirk Hovy
513
45
0
05 Mar 2024
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
Kristian Lum
Jacy Reese Anthis
Chirag Nagpal
Alex DÁmour
Alexander D’Amour
611
38
0
20 Feb 2024
Evaluating Gender Bias in Large Language Models via Chain-of-Thought
  Prompting
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
320
58
0
28 Jan 2024
Co$^2$PT: Mitigating Bias in Pre-trained Language Models through
  Counterfactual Contrastive Prompt Tuning
Co2^22PT: Mitigating Bias in Pre-trained Language Models through Counterfactual Contrastive Prompt TuningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiangjue Dong
Ziwei Zhu
Zhuoer Wang
Maria Teleki
James Caverlee
364
20
0
19 Oct 2023
Survey of Social Bias in Vision-Language Models
Survey of Social Bias in Vision-Language Models
Nayeon Lee
Yejin Bang
Holy Lovenia
Samuel Cahyawijaya
Wenliang Dai
Pascale Fung
VLM
443
34
0
24 Sep 2023
TIDE: Textual Identity Detection for Evaluating and Augmenting
  Classification and Language Models
TIDE: Textual Identity Detection for Evaluating and Augmenting Classification and Language Models
Emmanuel Klu
Sameer Sethi
253
0
0
07 Sep 2023
Bias and Fairness in Large Language Models: A Survey
Bias and Fairness in Large Language Models: A SurveyComputational Linguistics (CL), 2023
Isabel O. Gallegos
Ryan Rossi
Joe Barrow
Md Mehrab Tanjim
Sungchul Kim
Franck Dernoncourt
Tong Yu
Ruiyi Zhang
Nesreen Ahmed
AILaw
482
1,011
0
02 Sep 2023
Thesis Distillation: Investigating The Impact of Bias in NLP Models on
  Hate Speech Detection
Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Fatma Elsafoury
381
5
0
31 Aug 2023
Gender mobility in the labor market with skills-based matching models
Gender mobility in the labor market with skills-based matching models
Ajaya Adhikari
S. Vethman
Daan Vos
Marc V. Lenz
Ioana Cocu
I. Tolios
C. Veenman
89
2
0
17 Jul 2023
Sociodemographic Bias in Language Models: A Survey and Forward Path
Sociodemographic Bias in Language Models: A Survey and Forward Path
Vipul Gupta
Pranav Narayanan Venkit
Shomir Wilson
R. Passonneau
569
37
0
13 Jun 2023
Detecting and Mitigating Indirect Stereotypes in Word Embeddings
Detecting and Mitigating Indirect Stereotypes in Word Embeddings
Erin E. George
Joyce A. Chew
Deanna Needell
200
0
0
23 May 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language
  Models
Having Beer after Prayer? Measuring Cultural Bias in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Tarek Naous
Michael Joseph Ryan
Alan Ritter
Wei Xu
674
158
0
23 May 2023
This Prompt is Measuring <MASK>: Evaluating Bias Evaluation in Language
  Models
This Prompt is Measuring <MASK>: Evaluating Bias Evaluation in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Seraphina Goldfarb-Tarrant
Eddie L. Ungless
Esma Balkir
Su Lin Blodgett
290
15
0
22 May 2023
Bias Beyond English: Counterfactual Tests for Bias in Sentiment Analysis
  in Four Languages
Bias Beyond English: Counterfactual Tests for Bias in Sentiment Analysis in Four LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Seraphina Goldfarb-Tarrant
Adam Lopez
Roi Blanco
Diego Marcheggiani
233
17
0
19 May 2023
On the Independence of Association Bias and Empirical Fairness in
  Language Models
On the Independence of Association Bias and Empirical Fairness in Language ModelsConference on Fairness, Accountability and Transparency (FAccT), 2023
Laura Cabello
Anna Katrine van Zee
Anders Søgaard
265
38
0
20 Apr 2023
ACROCPoLis: A Descriptive Framework for Making Sense of Fairness
ACROCPoLis: A Descriptive Framework for Making Sense of FairnessConference on Fairness, Accountability and Transparency (FAccT), 2023
Andrea Aler Tubella
Dimitri Coelho Mollo
Adam Dahlgren Lindstrom
Hannah Devinney
Virginia Dignum
...
Anna Jonsson
T. Kampik
Tom Lenaerts
Julian Alfredo Mendez
J. Nieves
256
14
0
19 Apr 2023
Overwriting Pretrained Bias with Finetuning Data
Overwriting Pretrained Bias with Finetuning DataIEEE International Conference on Computer Vision (ICCV), 2023
Angelina Wang
Olga Russakovsky
364
49
0
10 Mar 2023
In-Depth Look at Word Filling Societal Bias Measures
In-Depth Look at Word Filling Societal Bias MeasuresConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Matúš Pikuliak
Ivana Benová
Viktor Bachratý
299
12
0
24 Feb 2023
Fairness in Language Models Beyond English: Gaps and Challenges
Fairness in Language Models Beyond English: Gaps and ChallengesFindings (Findings), 2023
Krithika Ramesh
Sunayana Sitaram
Monojit Choudhury
400
30
0
24 Feb 2023
BiasTestGPT: Using ChatGPT for Social Bias Testing of Language Models
BiasTestGPT: Using ChatGPT for Social Bias Testing of Language Models
Rafal Kocielnik
Shrimai Prabhumoye
Vivian Zhang
Roy Jiang
R. Alvarez
Anima Anandkumar
412
11
0
14 Feb 2023
Fair Enough: Standardizing Evaluation and Model Selection for Fairness
  Research in NLP
Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLPConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Xudong Han
Timothy Baldwin
Trevor Cohn
282
15
0
11 Feb 2023
A Comprehensive Study of Gender Bias in Chemical Named Entity
  Recognition Models
A Comprehensive Study of Gender Bias in Chemical Named Entity Recognition ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Xingmeng Zhao
A. Niazi
Anthony Rios
276
3
0
24 Dec 2022
Trustworthy Social Bias Measurement
Trustworthy Social Bias MeasurementAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2022
Rishi Bommasani
Abigail Z. Jacobs
278
14
0
20 Dec 2022
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in
  Zero-Shot Reasoning
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Omar Shaikh
Hongxin Zhang
William B. Held
Michael S. Bernstein
Diyi Yang
ReLMLRM
556
257
0
15 Dec 2022
Undesirable Biases in NLP: Addressing Challenges of Measurement
Undesirable Biases in NLP: Addressing Challenges of Measurement
Oskar van der Wal
Dominik Bachmann
Alina Leidinger
L. Maanen
Willem H. Zuidema
K. Schulz
533
8
0
24 Nov 2022
ADEPT: A DEbiasing PrompT Framework
ADEPT: A DEbiasing PrompT FrameworkAAAI Conference on Artificial Intelligence (AAAI), 2022
Ke Yang
Charles Yu
Yi R. Fung
Pengfei Yu
Heng Ji
416
35
0
10 Nov 2022
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language
  Models
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models
Yi Zhou
Ge Zhang
Bohao Yang
Chenghua Lin
Shi Wang
Anton Ragni
Jie Fu
250
10
0
05 Nov 2022
12
Next
Page 1 of 2