Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings

21 July 2016

Adam Kalai

Papers citing "Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings"

50 / 778 papers shown

Title
An Auditing Test To Detect Behavioral Shift in Language Models Leo Richter Xuanli He Pasquale Minervini Matt J. Kusner 97 0 0 25 Oct 2024
Natural Language Processing for Human Resources: A Survey Naoki Otani Nikita Bhutani Estevam R. Hruschka VLM 122 0 0 21 Oct 2024
LLMs are Biased Teachers: Evaluating LLM Bias in Personalized Education Iain Xie Weissburg Sathvika Anand Sharon Levy Haewon Jeong 220 8 0 17 Oct 2024
Improving Instruction-Following in Language Models through Activation Steering Alessandro Stolfo Vidhisha Balachandran Safoora Yousefi Eric Horvitz Besmira Nushi LLMSV 156 28 0 15 Oct 2024
Organizing Unstructured Image Collections using Natural Language Mingxuan Liu Zhun Zhong Jun Li Gianni Franchi Subhankar Roy Elisa Ricci VLM 145 5 0 07 Oct 2024
Collapsed Language Models Promote Fairness Jingxuan Xu Wuyang Chen Linyi Li Yao Zhao Yunchao Wei 122 0 0 06 Oct 2024
Attention layers provably solve single-location regression Pierre Marion Raphael Berthier Gérard Biau Claire Boyer 476 7 0 02 Oct 2024
Mitigating Propensity Bias of Large Language Models for Recommender Systems Guixian Zhang Guan Yuan Debo Cheng Lin Liu Jiuyong Li Shichao Zhang 111 5 0 30 Sep 2024
A Comprehensive Survey of Bias in LLMs: Current Landscape and Future Directions Rajesh Ranjan Shailja Gupta Surya Narayan Singh 69 11 0 24 Sep 2024
Bias Begets Bias: The Impact of Biased Embeddings on Diffusion Models Sahil Kuchlous Marvin Li Jeffrey G. Wang 78 0 0 15 Sep 2024
Identity-related Speech Suppression in Generative AI Content Moderation Oghenefejiro Isaacs Anigboro Charlie M. Crawford Danaë Metaxa Sorelle A. Friedler Sorelle A. Friedler 145 0 0 09 Sep 2024
Counterfactual Fairness by Combining Factual and Counterfactual Predictions Zeyu Zhou Tianci Liu Ruqi Bai Jing Gao Murat Kocaoglu David I. Inouye 130 2 0 03 Sep 2024
Multi-Output Distributional Fairness via Post-Processing Gang Li Qihang Lin Ayush Ghosh Tianbao Yang 171 0 0 31 Aug 2024
GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models Kunsheng Tang Wenbo Zhou Jie Zhang Aishan Liu Gelei Deng Shuai Li Peigui Qi Weiming Zhang Tianwei Zhang Nenghai Yu 135 4 0 22 Aug 2024
Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models Hila Gonen Terra Blevins Alisa Liu Luke Zettlemoyer Noah A. Smith 142 5 0 12 Aug 2024
ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Science Robert Wolfe Alexis Hiniker Bill Howe 72 0 0 04 Aug 2024
She Works, He Works: A Curious Exploration of Gender Bias in AI-Generated Imagery Amalia Foka 25 1 0 26 Jul 2024
Social Bias in Large Language Models For Bangla: An Empirical Study on Gender and Religious Bias Jayanta Sadhu Maneesha Rani Saha Rifat Shahriyar 83 4 0 03 Jul 2024
Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective Zhaotian Weng Zijun Gao Jerone Andrews Jieyu Zhao 82 1 0 03 Jul 2024
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models Song Wang Peng Wang Tong Zhou Yushun Dong Zhen Tan Jundong Li CoGe 163 9 0 02 Jul 2024
Exploring Safety-Utility Trade-Offs in Personalized Language Models Anvesh Rao Vijjini Somnath Basu Roy Chowdhury Snigdha Chaturvedi 184 9 0 17 Jun 2024
Data Quality in Edge Machine Learning: A State-of-the-Art Survey M. D. Belgoumri Mohamed Reda Bouadjenek Sunil Aryal Hakim Hacid 106 1 0 01 Jun 2024
The Impossibility of Fair LLMs Jacy Reese Anthis Kristian Lum Michael Ekstrand Avi Feller Alexander D’Amour FaML 130 14 0 28 May 2024
Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT Shucheng Zhu Weikang Wang Ying Liu 70 6 0 11 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes Damin Zhang Yi Zhang Geetanjali Bihani Julia Taylor Rayz 162 3 0 06 May 2024
Large Language Models (LLMs) as Agents for Augmented Democracy Jairo Gudiño-Rosero Umberto Grandi César A. Hidalgo LLMAG 99 128 0 06 May 2024
Blind Spots and Biases: Exploring the Role of Annotator Cognitive Biases in NLP Sanjana Gautam Mukund Srinath 100 6 0 29 Apr 2024
MisgenderMender: A Community-Informed Approach to Interventions for Misgendering Tamanna Hossain Sunipa Dev Sameer Singh 101 5 0 23 Apr 2024
Forcing Diffuse Distributions out of Language Models Yiming Zhang Avi Schwarzschild Nicholas Carlini Zico Kolter Daphne Ippolito ALM DiffM 112 20 0 16 Apr 2024
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward Xuan Xie Jiayang Song Zhehua Zhou Yuheng Huang Da Song Lei Ma OffRL 128 6 0 12 Apr 2024
What is Your Favorite Gender, MLM? Gender Bias Evaluation in Multilingual Masked Language Models Emily M. Bender Solon Barocas Robert Sim Hanna Wallach. 2021 64 3 0 09 Apr 2024
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety Paul Röttger Fabio Pernisi Bertie Vidgen Dirk Hovy ELM KELM 167 39 0 08 Apr 2024
Counterfactual Fairness through Transforming Data Orthogonal to Bias Shuyi Chen Shixiang Zhu FaML 182 2 0 26 Mar 2024
Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification Robert Vacareanu F. Alam M. Islam Haris Riaz Mihai Surdeanu NAI 81 2 0 05 Mar 2024
Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality Rahul Zalkikar Kanchan Chandra 139 1 0 21 Feb 2024
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation Kristian Lum Jacy Reese Anthis Chirag Nagpal Alex DÁmour Alexander D’Amour 122 17 0 20 Feb 2024
Primary and Secondary Factor Consistency as Domain Knowledge to Guide Happiness Computing in Online Assessment Xiaohua Wu Lin Li Xiaohui Tao Frank Xing Jingling Yuan 48 0 0 17 Feb 2024
ConFit: Improving Resume-Job Matching using Data Augmentation and Contrastive Learning Xiao Yu Jinzhong Zhang Zhou Yu 69 1 0 29 Jan 2024
Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You Felix Friedrich Katharina Hämmerl P. Schramowski Manuel Brack Jindrich Libovický Kristian Kersting Alexander Fraser EGVM 159 14 0 29 Jan 2024
Black-Box Access is Insufficient for Rigorous AI Audits Stephen Casper Carson Ezell Charlotte Siegmann Noam Kolt Taylor Lynn Curtis ... Michael Gerovitch David Bau Max Tegmark David M. Krueger Dylan Hadfield-Menell AAML 154 95 0 25 Jan 2024
A Comprehensive View of the Biases of Toxicity and Sentiment Analysis Methods Towards Utterances with African American English Expressions Guilherme H. Resende L. F. Nery Fabrício Benevenuto Savvas Zannettou Flavio Figueiredo 64 7 0 23 Jan 2024
Digital Divides in Scene Recognition: Uncovering Socioeconomic Biases in Deep Learning Systems Michelle R. Greene Mariam Josyula Wentao Si Jennifer A. Hart 97 0 0 23 Jan 2024
Manipulating Feature Visualizations with Gradient Slingshots Dilyara Bareeva Marina M.-C. Höhne Alexander Warnecke Lukas Pirch Klaus-Robert Müller Konrad Rieck Sebastian Lapuschkin Kirill Bykov AAML 76 6 0 11 Jan 2024
Large Language Models for Conducting Advanced Text Analytics Information Systems Research Benjamin Ampel Chi-Heng Yang Junjie Hu Hsinchun Chen 118 8 0 27 Dec 2023
PEFTDebias : Capturing debiasing information using PEFTs Sumit Agarwal Aditya Srikanth Veerubhotla Srijan Bansal 80 3 0 01 Dec 2023
(Ir)rationality in AI: State of the Art, Research Challenges and Open Questions Olivia Macmillan-Scott Mirco Musolesi 96 1 0 28 Nov 2023
P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models Yuhan Liu Shangbin Feng Xiaochuang Han Vidhisha Balachandran Chan Young Park Sachin Kumar Yulia Tsvetkov DiffM 86 4 0 16 Nov 2023
ChiSCor: A Corpus of Freely Told Fantasy Stories by Dutch Children for Computational Linguistics and Cognitive Science Bram van Dijk Max J. van Duijn Suzan Verberne M. Spruit 72 2 0 31 Oct 2023
StereoMap: Quantifying the Awareness of Human-like Stereotypes in Large Language Models Sullam Jeoung Yubin Ge Jana Diesner 66 5 0 20 Oct 2023
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model Abhijith Chintam Rahel Beloch Willem H. Zuidema Michael Hanna Oskar van der Wal 82 18 0 19 Oct 2023