Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings

3 April 2019

Papers citing "Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings"

50 / 78 papers shown

Title
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias Brandon Smith Mohamed Reda Bouadjenek Tahsin Alamgir Kheya Phillip Dawson S. Aryal ALM ELM 39 0 0 14 May 2025
A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient Yehor Tereshchenko Mika Hämäläinen ELM 51 1 0 06 May 2025
Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences Liu Yu Ludie Guo Ping Kuang Fan Zhou 44 0 0 12 Jan 2025
Collapsed Language Models Promote Fairness Jingxuan Xu Wuyang Chen Linyi Li Yao Zhao Yunchao Wei 46 0 0 06 Oct 2024
A Comprehensive Analysis of Static Word Embeddings for Turkish Karahan Sarıtaş Cahid Arda Öz Tunga Güngör 23 3 0 13 May 2024
Polarity Calibration for Opinion Summarization Yuanyuan Lei Kaiqiang Song Sangwoo Cho Xiaoyang Wang Ruihong Huang Dong Yu 38 0 0 02 Apr 2024
Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality Rahul Zalkikar Kanchan Chandra 37 1 0 21 Feb 2024
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models Mingfeng Xue Dayiheng Liu Kexin Yang Guanting Dong Wenqiang Lei Zheng Yuan Chang Zhou Jingren Zhou LLMAG 27 2 0 25 Oct 2023
NBIAS: A Natural Language Processing Framework for Bias Identification in Text Shaina Razaa Muskan Garg Deepak John Reji Syed Raza Bashir Chen Ding 42 45 0 03 Aug 2023
Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models Somayeh Ghanbarzadeh Yan-ping Huang Hamid Palangi R. C. Moreno Hamed Khanpour 42 12 0 20 Jul 2023
A Portrait of Emotion: Empowering Self-Expression through AI-Generated Art Y. Lee Yongha Park S. Hahn 19 3 0 26 Apr 2023
Transcending the "Male Code": Implicit Masculine Biases in NLP Contexts Katie Seaborn Shruti Chandra Thibault Fabre 23 11 0 22 Apr 2023
Beyond Accuracy: A Critical Review of Fairness in Machine Learning for Mobile and Wearable Computing Sofia Yfantidou Marios Constantinides Dimitris Spathis Athena Vakali Daniele Quercia F. Kawsar HAI FaML 28 18 0 27 Mar 2023
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning Hongyin Luo James R. Glass NAI 29 7 0 10 Mar 2023
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns Zhongbin Xie Vid Kocijan Thomas Lukasiewicz Oana-Maria Camburu 10 2 0 11 Feb 2023
FineDeb: A Debiasing Framework for Language Models Akash Saravanan Dhruv Mullick Habibur Rahman Nidhi Hegde FedML AI4CE 26 4 0 05 Feb 2023
Debiasing Vision-Language Models via Biased Prompts Ching-Yao Chuang Varun Jampani Yuanzhen Li Antonio Torralba Stefanie Jegelka VLM 34 97 0 31 Jan 2023
Trustworthy Social Bias Measurement Rishi Bommasani Percy Liang 34 10 0 20 Dec 2022
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language Models Silke Husse Andreas Spitz 28 6 0 15 Nov 2022
No Word Embedding Model Is Perfect: Evaluating the Representation Accuracy for Social Bias in the Media Maximilian Spliethover Maximilian Keiff Henning Wachsmuth 26 4 0 07 Nov 2022
The Shared Task on Gender Rewriting Bashar Alhafni Nizar Habash Houda Bouamor Ossama Obeid Sultan Alrowili ... Mohamed Gabr Abderrahmane Issam Abdelrahim Qaddoumi K. Vijay-Shanker Mahmoud Zyate 34 1 0 22 Oct 2022
Detecting Unintended Social Bias in Toxic Language Datasets Nihar Ranjan Sahoo Himanshu Gupta P. Bhattacharyya 21 18 0 21 Oct 2022
The User-Aware Arabic Gender Rewriter Bashar Alhafni Ossama Obeid Nizar Habash 29 2 0 14 Oct 2022
Social-Group-Agnostic Word Embedding Debiasing via the Stereotype Content Model Ali Omrani Brendan Kennedy M. Atari Morteza Dehghani 29 1 0 11 Oct 2022
Re-contextualizing Fairness in NLP: The Case of India Shaily Bhatt Sunipa Dev Partha P. Talukdar Shachi Dave Vinodkumar Prabhakaran 32 54 0 25 Sep 2022
Debiasing Word Embeddings with Nonlinear Geometry Lu Cheng Nayoung Kim Huan Liu 24 5 0 29 Aug 2022
Toward Understanding Bias Correlations for Mitigation in NLP Lu Cheng Suyu Ge Huan Liu 39 8 0 24 May 2022
Detoxifying Language Models with a Toxic Corpus Yoon A Park Frank Rudzicz 27 6 0 30 Apr 2022
Fair and Argumentative Language Modeling for Computational Argumentation Carolin Holtermann Anne Lauscher Simone Paolo Ponzetto 24 21 0 08 Apr 2022
Mapping the Multilingual Margins: Intersectional Biases of Sentiment Analysis Systems in English, Spanish, and Arabic Antonio Camara Nina Taneja Tamjeed Azad Emily Allaway R. Zemel 24 21 0 07 Apr 2022
Combining Static and Contextualised Multilingual Embeddings Katharina Hämmerl Jindrich Libovický Alexander Fraser 27 10 0 17 Mar 2022
Sense Embeddings are also Biased--Evaluating Social Biases in Static and Contextualised Sense Embeddings Yi Zhou Masahiro Kaneko Danushka Bollegala 36 23 0 14 Mar 2022
Speciesist Language and Nonhuman Animal Bias in English Masked Language Models Masashi Takeshita Rafal Rzepka K. Araki 34 6 0 10 Mar 2022
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model Shaden Smith M. Patwary Brandon Norick P. LeGresley Samyam Rajbhandari ... M. Shoeybi Yuxiong He Michael Houston Saurabh Tiwary Bryan Catanzaro MoE 93 733 0 28 Jan 2022
Causal effect of racial bias in data and machine learning algorithms on user persuasiveness & discriminatory decision making: An Empirical Study Kinshuk Sengupta Praveen Ranjan Srivastava 36 6 0 22 Jan 2022
A Survey on Gender Bias in Natural Language Processing Karolina Stañczak Isabelle Augenstein 30 110 0 28 Dec 2021
Simple Text Detoxification by Identifying a Linear Toxic Subspace in Language Model Embeddings Andrew Wang Mohit Sudhakar Yangfeng Ji 17 2 0 15 Dec 2021
Survey of Generative Methods for Social Media Analysis Stan Matwin Aristides Milios P. Prałat Amílcar Soares Franccois Théberge 27 3 0 13 Dec 2021
PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English Michael Kranzlein Emma Manning Siyao Peng Shira Wein Aryaman Arora Bradford Salen Nathan Schneider 19 8 0 23 Oct 2021
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models Nicholas Meade Elinor Poole-Dayan Siva Reddy 22 123 0 16 Oct 2021
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens Saad Hassan Matt Huenerfauth Cecilia Ovesdotter Alm 51 38 0 01 Oct 2021
Evaluating Debiasing Techniques for Intersectional Biases Shivashankar Subramanian Xudong Han Timothy Baldwin Trevor Cohn Lea Frermann 110 48 0 21 Sep 2021
Mitigating Language-Dependent Ethnic Bias in BERT Jaimeen Ahn Alice Oh 142 92 0 13 Sep 2021
Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search Jialu Wang Yang Liu Junfeng Fang FaML 157 95 0 12 Sep 2021
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech Mai Elsherief Caleb Ziems D. Muchlinski Vaishnavi Anupindi Jordyn Seybolt M. D. Choudhury Diyi Yang 106 239 0 11 Sep 2021
Assessing the Reliability of Word Embedding Gender Bias Measures Yupei Du Qixiang Fang D. Nguyen 49 21 0 10 Sep 2021
Left, Right, and Gender: Exploring Interaction Traces to Mitigate Human Biases Emily Wall Arpit Narechania Adam Joseph Coscia Jamal R Paden Alex Endert 23 31 0 07 Aug 2021
A Survey of Race, Racism, and Anti-Racism in NLP Anjalie Field Su Lin Blodgett Zeerak Talat Yulia Tsvetkov 42 122 0 21 Jun 2021
Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model Kathleen C. Fraser I. Nejadgholi S. Kiritchenko 19 37 0 04 Jun 2021
An Interpretability Illusion for BERT Tolga Bolukbasi Adam Pearce Ann Yuan Andy Coenen Emily Reif Fernanda Viégas Martin Wattenberg MILM FAtt 40 68 0 14 Apr 2021