Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.00871
Cited By
v1
v2
v3 (latest)
It's All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
2 September 2019
Rowan Hall Maudslay
Hila Gonen
Robert Bamler
Simone Teufel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"It's All in the Name: Mitigating Gender Bias with Name-Based Counterfactual Data Substitution"
50 / 106 papers shown
Title
AesBiasBench: Evaluating Bias and Alignment in Multimodal Language Models for Personalized Image Aesthetic Assessment
Kun-Jhih Li
L. Po
Hongzheng Yang
Xuyuan Xu
Kangcheng Liu
Yuzhi Zhao
68
0
0
15 Sep 2025
A Survey on Data Security in Large Language Models
Kang Chen
Xiuze Zhou
Y. Lin
Jinhe Su
Yuanhui Yu
Li Shen
F. Lin
PILM
ELM
170
0
1
04 Aug 2025
What Is the Point of Equality in Machine Learning Fairness? Beyond Equality of Opportunity
ACM Journal on Responsible Computing (JRC), 2025
Youjin Kong
FaML
177
0
0
20 Jun 2025
FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yongbin Li
Zhiting Fan
Ruizhe Chen
Xiaotang Gai
Luqi Gong
Yan Zhang
Zuozhu Liu
LLMSV
241
16
0
20 Apr 2025
Name of Thrones: Evaluating How LLMs Rank Student Names, Race, and Gender in Status Hierarchies
Annabella Sakunkoo
Jonathan Sakunkoo
46
0
0
15 Apr 2025
On the Mutual Influence of Gender and Occupation in LLM Representations
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Haozhe An
Connor Baumler
Abhilasha Sancheti
Rachel Rudinger
AI4CE
226
2
0
09 Mar 2025
Bias in Large Language Models: Origin, Evaluation, and Mitigation
Yufei Guo
Muzhe Guo
Juntao Su
Zhou Yang
Mengqiu Zhu
Hongfei Li
Mengyang Qiu
Shuo Shuo Liu
AILaw
260
69
0
16 Nov 2024
Collapsed Language Models Promote Fairness
International Conference on Learning Representations (ICLR), 2024
Jingxuan Xu
Wuyang Chen
Linyi Li
Yao Zhao
Yunchao Wei
359
1
0
06 Oct 2024
On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Abhilasha Sancheti
Haozhe An
Rachel Rudinger
213
0
0
05 Oct 2024
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Hila Gonen
Terra Blevins
Alisa Liu
Luke Zettlemoyer
Noah A. Smith
412
10
0
12 Aug 2024
FairFlow: An Automated Approach to Model-based Counterfactual Data Augmentation For NLP
E. Tokpo
T. Calders
124
3
0
23 Jul 2024
Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models
Zara Siddique
Liam D. Turner
Luis Espinosa-Anke
153
2
0
09 Jul 2024
Uplifting Lower-Income Data: Strategies for Socioeconomic Perspective Shifts in Vision-Language Models
Joan Nwatu
Oana Ignat
Amélie Reymond
200
0
0
02 Jul 2024
Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Haozhe An
Christabel Acquaye
Colin Wang
Zongxia Li
Rachel Rudinger
220
18
0
15 Jun 2024
Unique Security and Privacy Threats of Large Language Models: A Comprehensive Survey
Shang Wang
Tianqing Zhu
B. Liu
Ming Ding
Dayong Ye
Dayong Ye
Wanlei Zhou
PILM
257
22
0
12 Jun 2024
Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas
Chengyuan Deng
Yiqun Duan
Xin Jin
Heng Chang
Yijun Tian
...
Kuofeng Gao
Sihong He
Jun Zhuang
Lu Cheng
Haohan Wang
AILaw
198
28
0
08 Jun 2024
Stop! In the Name of Flaws: Disentangling Personal Names and Sociodemographic Attributes in NLP
Vagrant Gautam
Arjun Subramonian
Anne Lauscher
O. Keyes
200
15
0
27 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
International Conference on Computational Linguistics (COLING), 2024
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
340
4
0
06 May 2024
Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement
Pushkar Shukla
Dhruv Srikanth
Lee Cohen
Matthew Turk
AAML
193
0
0
18 Apr 2024
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs
Kanishka Misra
Kyle Mahowald
402
36
0
28 Mar 2024
Measuring Bias in a Ranked List using Term-based Representations
European Conference on Information Retrieval (ECIR), 2024
Amin Abolghasemi
Leif Azzopardi
Arian Askari
Maarten de Rijke
Suzan Verberne
142
9
0
09 Mar 2024
AXOLOTL: Fairness through Assisted Self-Debiasing of Large Language Model Outputs
Sana Ebrahimi
Kaiwen Chen
Abolfazl Asudeh
Gautam Das
Nick Koudas
158
10
0
01 Mar 2024
Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models
Yunpeng Huang
Yaonan Gu
Jingwei Xu
Zhihong Zhu
Zhaorun Chen
Xiaoxing Ma
159
4
0
27 Feb 2024
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Felix Friedrich
Katharina Hämmerl
P. Schramowski
Manuel Brack
Jindrich Libovický
Kristian Kersting
Kangyang Luo
EGVM
466
17
0
29 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
ZuJie Wen
Ke Xu
Qi Li
248
87
0
11 Jan 2024
Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies
Vithya Yogarajan
Gillian Dobbie
Te Taka Keegan
R. Neuwirth
ALM
291
17
0
03 Dec 2023
The Ethics of Automating Legal Actors
Transactions of the Association for Computational Linguistics (TACL), 2023
Josef Valvoda
Alec Thompson
Robert Bamler
Simone Teufel
AILaw
ELM
189
2
0
01 Dec 2023
BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
D. Roussinov
Serge Sharoff
109
3
0
27 Nov 2023
Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision
AAAI Conference on Artificial Intelligence (AAAI), 2023
Nicholas Lui
Bryan Chia
William Berrios
Candace Ross
Douwe Kiela
214
2
0
25 Nov 2023
Model-based Counterfactual Generator for Gender Bias Mitigation
E. Tokpo
T. Calders
90
0
0
06 Nov 2023
On the Interplay between Fairness and Explainability
Stephanie Brandl
Emanuele Bugliarello
Ilias Chalkidis
FaML
206
7
0
25 Oct 2023
Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models
Hsuan Su
Cheng-Chu Cheng
Hua Farn
Shachi H. Kumar
Saurav Sahay
Shang-Tse Chen
Hung-yi Lee
134
6
0
17 Oct 2023
Will the Prince Get True Love's Kiss? On the Model Sensitivity to Gender Perturbation over Fairytale Texts
Christina Chance
Da Yin
Dakuo Wang
Kai-Wei Chang
228
0
0
16 Oct 2023
Survey of Social Bias in Vision-Language Models
Nayeon Lee
Yejin Bang
Holy Lovenia
Samuel Cahyawijaya
Wenliang Dai
Pascale Fung
VLM
319
28
0
24 Sep 2023
In-Contextual Gender Bias Suppression for Large Language Models
Findings (Findings), 2023
Daisuke Oba
Masahiro Kaneko
Danushka Bollegala
202
12
0
13 Sep 2023
Bias and Fairness in Large Language Models: A Survey
Computational Linguistics (CL), 2023
Isabel O. Gallegos
Ryan Rossi
Joe Barrow
Md Mehrab Tanjim
Sungchul Kim
Franck Dernoncourt
Tong Yu
Ruiyi Zhang
Nesreen Ahmed
AILaw
314
860
0
02 Sep 2023
Towards Robust Aspect-based Sentiment Analysis through Non-counterfactual Augmentations
Xinyu Liu
Yanl Ding
Kaikai An
Chunyang Xiao
Pranava Madhyastha
Tong Xiao
Jingbo Zhu
125
2
0
24 Jun 2023
Sociodemographic Bias in Language Models: A Survey and Forward Path
Vipul Gupta
Pranav Narayanan Venkit
Shomir Wilson
R. Passonneau
319
34
0
13 Jun 2023
Gender-Inclusive Grammatical Error Correction through Augmentation
Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2023
Gunnar Lund
Kostiantyn Omelianchuk
Igor Samokhin
173
8
0
12 Jun 2023
Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Himanshu Thakur
Atishay Jain
Praneetha Vaddamanu
Paul Pu Liang
Louis-Philippe Morency
223
43
0
07 Jun 2023
Gender, names and other mysteries: Towards the ambiguous for gender-inclusive translation
Danielle Saunders
Katrina Olsen
78
7
0
07 Jun 2023
Nichelle and Nancy: The Influence of Demographic Attributes and Tokenization Length on First Name Biases
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Haozhe An
Rachel Rudinger
133
11
0
26 May 2023
Detecting and Mitigating Indirect Stereotypes in Word Embeddings
Erin E. George
Joyce A. Chew
Deanna Needell
125
0
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Yongfeng Zhang
Jennifer Foster
Yue Zhang
OOD
246
3
0
23 May 2023
Improving Classifier Robustness through Active Generation of Pairwise Counterfactuals
Ananth Balashankar
Xuezhi Wang
Yao Qin
Ben Packer
Nithum Thain
Jilin Chen
Ed H. Chi
Alex Beutel
130
1
0
22 May 2023
Should We Attend More or Less? Modulating Attention for Fairness
A. Zayed
Gonçalo Mordido
Samira Shabanian
Sarath Chandar
199
15
0
22 May 2023
In the Name of Fairness: Assessing the Bias in Clinical Record De-identification
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Yuxin Xiao
S. Lim
Tom Pollard
Marzyeh Ghassemi
167
18
0
18 May 2023
Surfacing Biases in Large Language Models using Contrastive Input Decoding
G. Yona
Or Honovich
Itay Laish
Roee Aharoni
162
14
0
12 May 2023
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Zhongbin Xie
Vid Kocijan
Thomas Lukasiewicz
Oana-Maria Camburu
124
5
0
11 Feb 2023
How Far Can It Go?: On Intrinsic Gender Bias Mitigation for Text Classification
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
E. Tokpo
Pieter Delobelle
Bettina Berendt
T. Calders
147
11
0
30 Jan 2023
1
2
3
Next