Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.04047
Cited By
Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings
3 April 2019
Thomas Manzini
Y. Lim
Yulia Tsvetkov
A. Black
FaML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings"
50 / 78 papers shown
Title
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias
Brandon Smith
Mohamed Reda Bouadjenek
Tahsin Alamgir Kheya
Phillip Dawson
S. Aryal
ALM
ELM
39
0
0
14 May 2025
A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient
Yehor Tereshchenko
Mika Hämäläinen
ELM
51
1
0
06 May 2025
Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences
Liu Yu
Ludie Guo
Ping Kuang
Fan Zhou
44
0
0
12 Jan 2025
Collapsed Language Models Promote Fairness
Jingxuan Xu
Wuyang Chen
Linyi Li
Yao Zhao
Yunchao Wei
46
0
0
06 Oct 2024
A Comprehensive Analysis of Static Word Embeddings for Turkish
Karahan Sarıtaş
Cahid Arda Öz
Tunga Güngör
23
3
0
13 May 2024
Polarity Calibration for Opinion Summarization
Yuanyuan Lei
Kaiqiang Song
Sangwoo Cho
Xiaoyang Wang
Ruihong Huang
Dong Yu
38
0
0
02 Apr 2024
Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality
Rahul Zalkikar
Kanchan Chandra
37
1
0
21 Feb 2024
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models
Mingfeng Xue
Dayiheng Liu
Kexin Yang
Guanting Dong
Wenqiang Lei
Zheng Yuan
Chang Zhou
Jingren Zhou
LLMAG
27
2
0
25 Oct 2023
NBIAS: A Natural Language Processing Framework for Bias Identification in Text
Shaina Razaa
Muskan Garg
Deepak John Reji
Syed Raza Bashir
Chen Ding
42
45
0
03 Aug 2023
Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models
Somayeh Ghanbarzadeh
Yan-ping Huang
Hamid Palangi
R. C. Moreno
Hamed Khanpour
42
12
0
20 Jul 2023
A Portrait of Emotion: Empowering Self-Expression through AI-Generated Art
Y. Lee
Yongha Park
S. Hahn
19
3
0
26 Apr 2023
Transcending the "Male Code": Implicit Masculine Biases in NLP Contexts
Katie Seaborn
Shruti Chandra
Thibault Fabre
23
11
0
22 Apr 2023
Beyond Accuracy: A Critical Review of Fairness in Machine Learning for Mobile and Wearable Computing
Sofia Yfantidou
Marios Constantinides
Dimitris Spathis
Athena Vakali
Daniele Quercia
F. Kawsar
HAI
FaML
28
18
0
27 Mar 2023
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
Hongyin Luo
James R. Glass
NAI
29
7
0
10 Mar 2023
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns
Zhongbin Xie
Vid Kocijan
Thomas Lukasiewicz
Oana-Maria Camburu
10
2
0
11 Feb 2023
FineDeb: A Debiasing Framework for Language Models
Akash Saravanan
Dhruv Mullick
Habibur Rahman
Nidhi Hegde
FedML
AI4CE
26
4
0
05 Feb 2023
Debiasing Vision-Language Models via Biased Prompts
Ching-Yao Chuang
Varun Jampani
Yuanzhen Li
Antonio Torralba
Stefanie Jegelka
VLM
34
97
0
31 Jan 2023
Trustworthy Social Bias Measurement
Rishi Bommasani
Percy Liang
34
10
0
20 Dec 2022
Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language Models
Silke Husse
Andreas Spitz
28
6
0
15 Nov 2022
No Word Embedding Model Is Perfect: Evaluating the Representation Accuracy for Social Bias in the Media
Maximilian Spliethover
Maximilian Keiff
Henning Wachsmuth
26
4
0
07 Nov 2022
The Shared Task on Gender Rewriting
Bashar Alhafni
Nizar Habash
Houda Bouamor
Ossama Obeid
Sultan Alrowili
...
Mohamed Gabr
Abderrahmane Issam
Abdelrahim Qaddoumi
K. Vijay-Shanker
Mahmoud Zyate
34
1
0
22 Oct 2022
Detecting Unintended Social Bias in Toxic Language Datasets
Nihar Ranjan Sahoo
Himanshu Gupta
P. Bhattacharyya
21
18
0
21 Oct 2022
The User-Aware Arabic Gender Rewriter
Bashar Alhafni
Ossama Obeid
Nizar Habash
29
2
0
14 Oct 2022
Social-Group-Agnostic Word Embedding Debiasing via the Stereotype Content Model
Ali Omrani
Brendan Kennedy
M. Atari
Morteza Dehghani
29
1
0
11 Oct 2022
Re-contextualizing Fairness in NLP: The Case of India
Shaily Bhatt
Sunipa Dev
Partha P. Talukdar
Shachi Dave
Vinodkumar Prabhakaran
32
54
0
25 Sep 2022
Debiasing Word Embeddings with Nonlinear Geometry
Lu Cheng
Nayoung Kim
Huan Liu
24
5
0
29 Aug 2022
Toward Understanding Bias Correlations for Mitigation in NLP
Lu Cheng
Suyu Ge
Huan Liu
39
8
0
24 May 2022
Detoxifying Language Models with a Toxic Corpus
Yoon A Park
Frank Rudzicz
27
6
0
30 Apr 2022
Fair and Argumentative Language Modeling for Computational Argumentation
Carolin Holtermann
Anne Lauscher
Simone Paolo Ponzetto
24
21
0
08 Apr 2022
Mapping the Multilingual Margins: Intersectional Biases of Sentiment Analysis Systems in English, Spanish, and Arabic
Antonio Camara
Nina Taneja
Tamjeed Azad
Emily Allaway
R. Zemel
24
21
0
07 Apr 2022
Combining Static and Contextualised Multilingual Embeddings
Katharina Hämmerl
Jindrich Libovický
Alexander Fraser
27
10
0
17 Mar 2022
Sense Embeddings are also Biased--Evaluating Social Biases in Static and Contextualised Sense Embeddings
Yi Zhou
Masahiro Kaneko
Danushka Bollegala
36
23
0
14 Mar 2022
Speciesist Language and Nonhuman Animal Bias in English Masked Language Models
Masashi Takeshita
Rafal Rzepka
K. Araki
34
6
0
10 Mar 2022
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model
Shaden Smith
M. Patwary
Brandon Norick
P. LeGresley
Samyam Rajbhandari
...
M. Shoeybi
Yuxiong He
Michael Houston
Saurabh Tiwary
Bryan Catanzaro
MoE
93
733
0
28 Jan 2022
Causal effect of racial bias in data and machine learning algorithms on user persuasiveness & discriminatory decision making: An Empirical Study
Kinshuk Sengupta
Praveen Ranjan Srivastava
36
6
0
22 Jan 2022
A Survey on Gender Bias in Natural Language Processing
Karolina Stañczak
Isabelle Augenstein
30
110
0
28 Dec 2021
Simple Text Detoxification by Identifying a Linear Toxic Subspace in Language Model Embeddings
Andrew Wang
Mohit Sudhakar
Yangfeng Ji
17
2
0
15 Dec 2021
Survey of Generative Methods for Social Media Analysis
Stan Matwin
Aristides Milios
P. Prałat
Amílcar Soares
Franccois Théberge
27
3
0
13 Dec 2021
PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English
Michael Kranzlein
Emma Manning
Siyao Peng
Shira Wein
Aryaman Arora
Bradford Salen
Nathan Schneider
19
8
0
23 Oct 2021
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models
Nicholas Meade
Elinor Poole-Dayan
Siva Reddy
22
123
0
16 Oct 2021
Unpacking the Interdependent Systems of Discrimination: Ableist Bias in NLP Systems through an Intersectional Lens
Saad Hassan
Matt Huenerfauth
Cecilia Ovesdotter Alm
51
38
0
01 Oct 2021
Evaluating Debiasing Techniques for Intersectional Biases
Shivashankar Subramanian
Xudong Han
Timothy Baldwin
Trevor Cohn
Lea Frermann
110
48
0
21 Sep 2021
Mitigating Language-Dependent Ethnic Bias in BERT
Jaimeen Ahn
Alice Oh
142
92
0
13 Sep 2021
Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search
Jialu Wang
Yang Liu
Junfeng Fang
FaML
157
95
0
12 Sep 2021
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech
Mai Elsherief
Caleb Ziems
D. Muchlinski
Vaishnavi Anupindi
Jordyn Seybolt
M. D. Choudhury
Diyi Yang
106
239
0
11 Sep 2021
Assessing the Reliability of Word Embedding Gender Bias Measures
Yupei Du
Qixiang Fang
D. Nguyen
49
21
0
10 Sep 2021
Left, Right, and Gender: Exploring Interaction Traces to Mitigate Human Biases
Emily Wall
Arpit Narechania
Adam Joseph Coscia
Jamal R Paden
Alex Endert
23
31
0
07 Aug 2021
A Survey of Race, Racism, and Anti-Racism in NLP
Anjalie Field
Su Lin Blodgett
Zeerak Talat
Yulia Tsvetkov
42
122
0
21 Jun 2021
Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model
Kathleen C. Fraser
I. Nejadgholi
S. Kiritchenko
19
37
0
04 Jun 2021
An Interpretability Illusion for BERT
Tolga Bolukbasi
Adam Pearce
Ann Yuan
Andy Coenen
Emily Reif
Fernanda Viégas
Martin Wattenberg
MILM
FAtt
40
68
0
14 Apr 2021
1
2
Next