Identifying and Reducing Gender Bias in Word-Level Language Models

5 April 2019

Papers citing "Identifying and Reducing Gender Bias in Word-Level Language Models"

50 / 207 papers shown

Title
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias Brandon Smith Mohamed Reda Bouadjenek Tahsin Alamgir Kheya Phillip Dawson S. Aryal ALM ELM 26 0 0 14 May 2025
A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient Yehor Tereshchenko Mika Hämäläinen ELM 43 1 0 06 May 2025
BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models Zhiting Fan Ruizhe Chen Zuozhu Liu 44 0 0 30 Apr 2025
What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns Michael A. Hedderich Anyi Wang Raoyuan Zhao Florian Eichin Barbara Plank 30 0 0 22 Apr 2025
Bias Analysis and Mitigation through Protected Attribute Detection and Regard Classification Takuma Udagawa Yang Zhao H. Kanayama Bishwaranjan Bhattacharjee 31 0 0 19 Apr 2025
Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge Riccardo Cantini A. Orsino Massimo Ruggiero Domenico Talia AAML ELM 42 0 0 10 Apr 2025
Human Preferences for Constructive Interactions in Language Model Alignment Yara Kyrychenko Jon Roozenbeek Brandon Davidson S. V. D. Linden Ramit Debnath 46 0 0 05 Mar 2025
C3AI: Crafting and Evaluating Constitutions for Constitutional AI Yara Kyrychenko Ke Zhou Edyta Bogucka Daniele Quercia ELM 45 3 0 21 Feb 2025
Bias in Large Language Models: Origin, Evaluation, and Mitigation Yufei Guo Muzhe Guo Juntao Su Zhou Yang Mengqiu Zhu Hongfei Li Mengyang Qiu Shuo Shuo Liu AILaw 30 9 0 16 Nov 2024
Identifying Implicit Social Biases in Vision-Language Models Kimia Hamidieh Haoran Zhang Walter Gerych Thomas Hartvigsen Marzyeh Ghassemi VLM 30 11 0 01 Nov 2024
FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs Zhiting Fan Ruizhe Chen Tianxiang Hu Zuozhu Liu 23 7 0 25 Oct 2024
'Simulacrum of Stories': Examining Large Language Models as Qualitative Research Participants Shivani Kapania William Agnew Motahhare Eslami Hoda Heidari Sarah E Fox 39 4 0 28 Sep 2024
Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure Mahasweta Chakraborti Bert Joseph Prestoza Nicholas Vincent Seth Frey 39 1 0 27 Sep 2024
Challenging Fairness: A Comprehensive Exploration of Bias in LLM-Based Recommendations Shahnewaz Karim Sakib Anindya Bijoy Das 31 2 0 17 Sep 2024
Fairness Definitions in Language Models Explained Thang Viet Doan Zhibo Chu Zichong Wang Wenbin Zhang ALM 55 10 0 26 Jul 2024
How Are LLMs Mitigating Stereotyping Harms? Learning from Search Engine Studies Alina Leidinger Richard Rogers 32 5 0 16 Jul 2024
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs Zhiting Fan Ruizhe Chen Ruiling Xu Zuozhu Liu KELM 21 16 0 14 Jul 2024
The Sociolinguistic Foundations of Language Modeling Jack Grieve Sara Bartl Matteo Fuoli Jason Grafmiller Weihang Huang A. Jawerbaum Akira Murakami Marcus Perlman Dana Roemling Bodo Winter 41 7 0 12 Jul 2024
Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial Pre-training Zijian Zhao AAML 40 1 0 11 Jul 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning Haiwen Diao Bo Wan Xu Jia Yunzhi Zhuge Ying Zhang Huchuan Lu Long Chen VLM 47 4 0 10 Jul 2024
Do Multilingual Large Language Models Mitigate Stereotype Bias? Shangrui Nie Michael Fromm Charles F Welch Rebekka Görge Akbar Karimi Joan Plepi Nazia Afsan Mowmita Nicolas Flores-Herr Mehdi Ali Lucie Flek 32 3 0 08 Jul 2024
Leveraging Large Language Models to Measure Gender Bias in Gendered Languages Erik Derner Sara Sansalvador de la Fuente Yoan Gutiérrez Paloma Moreda Nuria Oliver 32 1 0 19 Jun 2024
Unveiling Encoder-Free Vision-Language Models Haiwen Diao Yufeng Cui Xiaotong Li Yueze Wang Huchuan Lu Xinlong Wang VLM 56 28 0 17 Jun 2024
Investigating Annotator Bias in Large Language Models for Hate Speech Detection Amit Das Zheng Zhang Fatemeh Jamshidi Vinija Jain Aman Chadha Nilanjana Raychawdhary Mary J. Sandage Lauramarie Pope Gerry V. Dozier Cheryl Seals 34 2 0 17 Jun 2024
Why Don't Prompt-Based Fairness Metrics Correlate? A. Zayed Gonçalo Mordido Ioana Baldini Sarath Chandar ALM 47 4 0 09 Jun 2024
A Reality check of the benefits of LLM in business Ming Cheung 35 3 0 09 Jun 2024
Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness Guangliang Liu Milad Afshari Xitong Zhang Zhiyu Xue Avrajit Ghosh Bidhan Bashyal Rongrong Wang K. Johnson 27 0 0 06 Jun 2024
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways Zehang Deng Yongjian Guo Changzhou Han Wanlun Ma Junwu Xiong Sheng Wen Yang Xiang 44 23 0 04 Jun 2024
The Life Cycle of Large Language Models: A Review of Biases in Education Jinsook Lee Yann Hicke Renzhe Yu Christopher A. Brooks René F. Kizilcec AI4Ed 34 1 0 03 Jun 2024
Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity Rheeya Uppaal Apratim De Yiting He Yiquao Zhong Junjie Hu 37 7 0 22 May 2024
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs Bilgehan Sel Priya Shanmugasundaram Mohammad Kachuee Kun Zhou Ruoxi Jia Ming Jin LRM 40 2 0 21 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes Damin Zhang Yi Zhang Geetanjali Bihani Julia Taylor Rayz 53 2 0 06 May 2024
More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness Aaron Jiaxun Li Satyapriya Krishna Himabindu Lakkaraju 37 3 0 29 Apr 2024
REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models Sana Ebrahimi N. Shahbazi Abolfazl Asudeh 34 1 0 17 Apr 2024
Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models Sunhao Dai Chen Xu Shicheng Xu Liang Pang Zhenhua Dong Jun Xu 48 59 0 17 Apr 2024
Polarity Calibration for Opinion Summarization Yuanyuan Lei Kaiqiang Song Sangwoo Cho Xiaoyang Wang Ruihong Huang Dong Yu 30 0 0 02 Apr 2024
Fairness in Large Language Models: A Taxonomic Survey Zhibo Chu Zichong Wang Wenbin Zhang AILaw 43 32 0 31 Mar 2024
Investigating Markers and Drivers of Gender Bias in Machine Translations Peter J. Barclay Ashkan Sami 21 2 0 18 Mar 2024
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction Ziyang Xu Keqin Peng Liang Ding Dacheng Tao Xiliang Lu 34 10 0 15 Mar 2024
Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification Garima Chhikara Anurag Sharma Kripabandhu Ghosh Abhijnan Chakraborty 39 14 0 28 Feb 2024
Prejudice and Volatility: A Statistical Framework for Measuring Social Discrimination in Large Language Models Yiran Liu Ke Yang Zehan Qi Xiao-Yang Liu Yang Yu U. I. Urbana-Champaign 39 1 0 23 Feb 2024
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models Ashutosh Sathe Prachi Jain Sunayana Sitaram 58 1 0 21 Feb 2024
From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings Aishik Rakshit Smriti Singh Shuvam Keshari Arijit Ghosh Chowdhury Vinija Jain Aman Chadha 37 0 0 18 Feb 2024
Network Formation and Dynamics Among Multi-LLMs Marios Papachristou Yuan Yuan 48 11 0 16 Feb 2024
MAFIA: Multi-Adapter Fused Inclusive LanguAge Models Prachi Jain Ashutosh Sathe Varun Gumma Kabir Ahuja Sunayana Sitaram 28 1 0 12 Feb 2024
Systematic Biases in LLM Simulations of Debates Amir Taubenfeld Yaniv Dover Roi Reichart Ariel Goldstein 30 49 0 06 Feb 2024
UnMASKed: Quantifying Gender Biases in Masked Language Models through Linguistically Informed Job Market Prompts Inigo Parra 13 1 0 28 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems Tianyu Cui Yanling Wang Chuanpu Fu Yong Xiao Sijia Li ... Junwu Xiong Xinyu Kong Zujie Wen Ke Xu Qi Li 57 56 0 11 Jan 2024
New Job, New Gender? Measuring the Social Bias in Image Generation Models Wenxuan Wang Haonan Bai Jen-tse Huang Yuxuan Wan Youliang Yuan Haoyi Qiu Nanyun Peng Michael R. Lyu 47 20 0 01 Jan 2024
The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias Timo Spinde Smilla Hinterreiter Fabian Haak Terry Ruas Helge Giese Norman Meuschke Bela Gipp 19 12 0 26 Dec 2023