Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.06520
Cited By
Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings
21 July 2016
Tolga Bolukbasi
Kai-Wei Chang
James Zou
Venkatesh Saligrama
Adam Kalai
CVBM
FaML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings"
50 / 778 papers shown
Title
An Auditing Test To Detect Behavioral Shift in Language Models
Leo Richter
Xuanli He
Pasquale Minervini
Matt J. Kusner
97
0
0
25 Oct 2024
Natural Language Processing for Human Resources: A Survey
Naoki Otani
Nikita Bhutani
Estevam R. Hruschka
VLM
122
0
0
21 Oct 2024
LLMs are Biased Teachers: Evaluating LLM Bias in Personalized Education
Iain Xie Weissburg
Sathvika Anand
Sharon Levy
Haewon Jeong
220
8
0
17 Oct 2024
Improving Instruction-Following in Language Models through Activation Steering
Alessandro Stolfo
Vidhisha Balachandran
Safoora Yousefi
Eric Horvitz
Besmira Nushi
LLMSV
156
28
0
15 Oct 2024
Organizing Unstructured Image Collections using Natural Language
Mingxuan Liu
Zhun Zhong
Jun Li
Gianni Franchi
Subhankar Roy
Elisa Ricci
VLM
145
5
0
07 Oct 2024
Collapsed Language Models Promote Fairness
Jingxuan Xu
Wuyang Chen
Linyi Li
Yao Zhao
Yunchao Wei
122
0
0
06 Oct 2024
Attention layers provably solve single-location regression
Pierre Marion
Raphael Berthier
Gérard Biau
Claire Boyer
476
7
0
02 Oct 2024
Mitigating Propensity Bias of Large Language Models for Recommender Systems
Guixian Zhang
Guan Yuan
Debo Cheng
Lin Liu
Jiuyong Li
Shichao Zhang
111
5
0
30 Sep 2024
A Comprehensive Survey of Bias in LLMs: Current Landscape and Future Directions
Rajesh Ranjan
Shailja Gupta
Surya Narayan Singh
69
11
0
24 Sep 2024
Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models
Sahil Kuchlous
Marvin Li
Jeffrey G. Wang
78
0
0
15 Sep 2024
Identity-related Speech Suppression in Generative AI Content Moderation
Oghenefejiro Isaacs Anigboro
Charlie M. Crawford
Danaë Metaxa
Sorelle A. Friedler
Sorelle A. Friedler
145
0
0
09 Sep 2024
Counterfactual Fairness by Combining Factual and Counterfactual Predictions
Zeyu Zhou
Tianci Liu
Ruqi Bai
Jing Gao
Murat Kocaoglu
David I. Inouye
130
2
0
03 Sep 2024
Multi-Output Distributional Fairness via Post-Processing
Gang Li
Qihang Lin
Ayush Ghosh
Tianbao Yang
171
0
0
31 Aug 2024
GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models
Kunsheng Tang
Wenbo Zhou
Jie Zhang
Aishan Liu
Gelei Deng
Shuai Li
Peigui Qi
Weiming Zhang
Tianwei Zhang
Nenghai Yu
135
4
0
22 Aug 2024
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models
Hila Gonen
Terra Blevins
Alisa Liu
Luke Zettlemoyer
Noah A. Smith
142
5
0
12 Aug 2024
ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Science
Robert Wolfe
Alexis Hiniker
Bill Howe
72
0
0
04 Aug 2024
She Works, He Works: A Curious Exploration of Gender Bias in AI-Generated Imagery
Amalia Foka
25
1
0
26 Jul 2024
Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias
Jayanta Sadhu
Maneesha Rani Saha
Rifat Shahriyar
83
4
0
03 Jul 2024
Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective
Zhaotian Weng
Zijun Gao
Jerone Andrews
Jieyu Zhao
82
1
0
03 Jul 2024
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models
Song Wang
Peng Wang
Tong Zhou
Yushun Dong
Zhen Tan
Jundong Li
CoGe
163
9
0
02 Jul 2024
Exploring Safety-Utility Trade-Offs in Personalized Language Models
Anvesh Rao Vijjini
Somnath Basu Roy Chowdhury
Snigdha Chaturvedi
184
9
0
17 Jun 2024
Data Quality in Edge Machine Learning: A State-of-the-Art Survey
M. D. Belgoumri
Mohamed Reda Bouadjenek
Sunil Aryal
Hakim Hacid
106
1
0
01 Jun 2024
The Impossibility of Fair LLMs
Jacy Reese Anthis
Kristian Lum
Michael Ekstrand
Avi Feller
Alexander D’Amour
FaML
130
14
0
28 May 2024
Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT
Shucheng Zhu
Weikang Wang
Ying Liu
70
6
0
11 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
162
3
0
06 May 2024
Large Language Models (LLMs) as Agents for Augmented Democracy
Jairo Gudiño-Rosero
Umberto Grandi
César A. Hidalgo
LLMAG
99
128
0
06 May 2024
Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLP
Sanjana Gautam
Mukund Srinath
100
6
0
29 Apr 2024
MisgenderMender: A Community-Informed Approach to Interventions for Misgendering
Tamanna Hossain
Sunipa Dev
Sameer Singh
101
5
0
23 Apr 2024
Forcing Diffuse Distributions out of Language Models
Yiming Zhang
Avi Schwarzschild
Nicholas Carlini
Zico Kolter
Daphne Ippolito
ALM
DiffM
112
20
0
16 Apr 2024
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward
Xuan Xie
Jiayang Song
Zhehua Zhou
Yuheng Huang
Da Song
Lei Ma
OffRL
128
6
0
12 Apr 2024
What is Your Favorite Gender, MLM? Gender Bias Evaluation in Multilingual Masked Language Models
Emily M. Bender
Solon Barocas
Robert Sim
Hanna Wallach. 2021
64
3
0
09 Apr 2024
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
Paul Röttger
Fabio Pernisi
Bertie Vidgen
Dirk Hovy
ELM
KELM
167
39
0
08 Apr 2024
Counterfactual Fairness through Transforming Data Orthogonal to Bias
Shuyi Chen
Shixiang Zhu
FaML
182
2
0
26 Mar 2024
Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification
Robert Vacareanu
F. Alam
M. Islam
Haris Riaz
Mihai Surdeanu
NAI
81
2
0
05 Mar 2024
Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality
Rahul Zalkikar
Kanchan Chandra
139
1
0
21 Feb 2024
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation
Kristian Lum
Jacy Reese Anthis
Chirag Nagpal
Alex DÁmour
Alexander D’Amour
122
17
0
20 Feb 2024
Primary and Secondary Factor Consistency as Domain Knowledge to Guide Happiness Computing in Online Assessment
Xiaohua Wu
Lin Li
Xiaohui Tao
Frank Xing
Jingling Yuan
48
0
0
17 Feb 2024
ConFit: Improving Resume-Job Matching using Data Augmentation and Contrastive Learning
Xiao Yu
Jinzhong Zhang
Zhou Yu
69
1
0
29 Jan 2024
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You
Felix Friedrich
Katharina Hämmerl
P. Schramowski
Manuel Brack
Jindrich Libovický
Kristian Kersting
Alexander Fraser
EGVM
159
14
0
29 Jan 2024
Black-Box Access is Insufficient for Rigorous AI Audits
Stephen Casper
Carson Ezell
Charlotte Siegmann
Noam Kolt
Taylor Lynn Curtis
...
Michael Gerovitch
David Bau
Max Tegmark
David M. Krueger
Dylan Hadfield-Menell
AAML
154
95
0
25 Jan 2024
A Comprehensive View of the Biases of Toxicity and Sentiment Analysis Methods Towards Utterances with African American English Expressions
Guilherme H. Resende
L. F. Nery
Fabrício Benevenuto
Savvas Zannettou
Flavio Figueiredo
64
7
0
23 Jan 2024
Digital Divides in Scene Recognition: Uncovering Socioeconomic Biases in Deep Learning Systems
Michelle R. Greene
Mariam Josyula
Wentao Si
Jennifer A. Hart
97
0
0
23 Jan 2024
Manipulating Feature Visualizations with Gradient Slingshots
Dilyara Bareeva
Marina M.-C. Höhne
Alexander Warnecke
Lukas Pirch
Klaus-Robert Müller
Konrad Rieck
Sebastian Lapuschkin
Kirill Bykov
AAML
76
6
0
11 Jan 2024
Large Language Models for Conducting Advanced Text Analytics Information Systems Research
Benjamin Ampel
Chi-Heng Yang
Junjie Hu
Hsinchun Chen
118
8
0
27 Dec 2023
PEFTDebias : Capturing debiasing information using PEFTs
Sumit Agarwal
Aditya Srikanth Veerubhotla
Srijan Bansal
80
3
0
01 Dec 2023
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions
Olivia Macmillan-Scott
Mirco Musolesi
96
1
0
28 Nov 2023
P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models
Yuhan Liu
Shangbin Feng
Xiaochuang Han
Vidhisha Balachandran
Chan Young Park
Sachin Kumar
Yulia Tsvetkov
DiffM
86
4
0
16 Nov 2023
ChiSCor: A Corpus of Freely Told Fantasy Stories by Dutch Children for Computational Linguistics and Cognitive Science
Bram van Dijk
Max J. van Duijn
Suzan Verberne
M. Spruit
72
2
0
31 Oct 2023
StereoMap: Quantifying the Awareness of Human-like Stereotypes in Large Language Models
Sullam Jeoung
Yubin Ge
Jana Diesner
66
5
0
20 Oct 2023
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model
Abhijith Chintam
Rahel Beloch
Willem H. Zuidema
Michael Hanna
Oskar van der Wal
82
18
0
19 Oct 2023
Previous
1
2
3
4
5
...
14
15
16
Next