Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.03035
Cited By
Identifying and Reducing Gender Bias in Word-Level Language Models
5 April 2019
Shikha Bordia
Samuel R. Bowman
FaML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Identifying and Reducing Gender Bias in Word-Level Language Models"
50 / 207 papers shown
Title
A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias
Brandon Smith
Mohamed Reda Bouadjenek
Tahsin Alamgir Kheya
Phillip Dawson
S. Aryal
ALM
ELM
26
0
0
14 May 2025
A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient
Yehor Tereshchenko
Mika Hämäläinen
ELM
43
1
0
06 May 2025
BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models
Zhiting Fan
Ruizhe Chen
Zuozhu Liu
44
0
0
30 Apr 2025
What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns
Michael A. Hedderich
Anyi Wang
Raoyuan Zhao
Florian Eichin
Barbara Plank
30
0
0
22 Apr 2025
Bias Analysis and Mitigation through Protected Attribute Detection and Regard Classification
Takuma Udagawa
Yang Zhao
H. Kanayama
Bishwaranjan Bhattacharjee
31
0
0
19 Apr 2025
Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge
Riccardo Cantini
A. Orsino
Massimo Ruggiero
Domenico Talia
AAML
ELM
42
0
0
10 Apr 2025
Human Preferences for Constructive Interactions in Language Model Alignment
Yara Kyrychenko
Jon Roozenbeek
Brandon Davidson
S. V. D. Linden
Ramit Debnath
46
0
0
05 Mar 2025
C3AI: Crafting and Evaluating Constitutions for Constitutional AI
Yara Kyrychenko
Ke Zhou
Edyta Bogucka
Daniele Quercia
ELM
45
3
0
21 Feb 2025
Bias in Large Language Models: Origin, Evaluation, and Mitigation
Yufei Guo
Muzhe Guo
Juntao Su
Zhou Yang
Mengqiu Zhu
Hongfei Li
Mengyang Qiu
Shuo Shuo Liu
AILaw
30
9
0
16 Nov 2024
Identifying Implicit Social Biases in Vision-Language Models
Kimia Hamidieh
Haoran Zhang
Walter Gerych
Thomas Hartvigsen
Marzyeh Ghassemi
VLM
30
11
0
01 Nov 2024
FairMT-Bench: Benchmarking Fairness for Multi-turn Dialogue in Conversational LLMs
Zhiting Fan
Ruizhe Chen
Tianxiang Hu
Zuozhu Liu
23
7
0
25 Oct 2024
'Simulacrum of Stories': Examining Large Language Models as Qualitative Research Participants
Shivani Kapania
William Agnew
Motahhare Eslami
Hoda Heidari
Sarah E Fox
39
4
0
28 Sep 2024
Responsible AI in Open Ecosystems: Reconciling Innovation with Risk Assessment and Disclosure
Mahasweta Chakraborti
Bert Joseph Prestoza
Nicholas Vincent
Seth Frey
39
1
0
27 Sep 2024
Challenging Fairness: A Comprehensive Exploration of Bias in LLM-Based Recommendations
Shahnewaz Karim Sakib
Anindya Bijoy Das
31
2
0
17 Sep 2024
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
55
10
0
26 Jul 2024
How Are LLMs Mitigating Stereotyping Harms? Learning from Search Engine Studies
Alina Leidinger
Richard Rogers
32
5
0
16 Jul 2024
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
Zhiting Fan
Ruizhe Chen
Ruiling Xu
Zuozhu Liu
KELM
21
16
0
14 Jul 2024
The Sociolinguistic Foundations of Language Modeling
Jack Grieve
Sara Bartl
Matteo Fuoli
Jason Grafmiller
Weihang Huang
A. Jawerbaum
Akira Murakami
Marcus Perlman
Dana Roemling
Bodo Winter
41
7
0
12 Jul 2024
Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial Pre-training
Zijian Zhao
AAML
40
1
0
11 Jul 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
Haiwen Diao
Bo Wan
Xu Jia
Yunzhi Zhuge
Ying Zhang
Huchuan Lu
Long Chen
VLM
47
4
0
10 Jul 2024
Do Multilingual Large Language Models Mitigate Stereotype Bias?
Shangrui Nie
Michael Fromm
Charles F Welch
Rebekka Görge
Akbar Karimi
Joan Plepi
Nazia Afsan Mowmita
Nicolas Flores-Herr
Mehdi Ali
Lucie Flek
32
3
0
08 Jul 2024
Leveraging Large Language Models to Measure Gender Bias in Gendered Languages
Erik Derner
Sara Sansalvador de la Fuente
Yoan Gutiérrez
Paloma Moreda
Nuria Oliver
32
1
0
19 Jun 2024
Unveiling Encoder-Free Vision-Language Models
Haiwen Diao
Yufeng Cui
Xiaotong Li
Yueze Wang
Huchuan Lu
Xinlong Wang
VLM
56
28
0
17 Jun 2024
Investigating Annotator Bias in Large Language Models for Hate Speech Detection
Amit Das
Zheng Zhang
Fatemeh Jamshidi
Vinija Jain
Aman Chadha
Nilanjana Raychawdhary
Mary J. Sandage
Lauramarie Pope
Gerry V. Dozier
Cheryl Seals
34
2
0
17 Jun 2024
Why Don't Prompt-Based Fairness Metrics Correlate?
A. Zayed
Gonçalo Mordido
Ioana Baldini
Sarath Chandar
ALM
47
4
0
09 Jun 2024
A Reality check of the benefits of LLM in business
Ming Cheung
35
3
0
09 Jun 2024
Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
Guangliang Liu
Milad Afshari
Xitong Zhang
Zhiyu Xue
Avrajit Ghosh
Bidhan Bashyal
Rongrong Wang
K. Johnson
27
0
0
06 Jun 2024
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
Zehang Deng
Yongjian Guo
Changzhou Han
Wanlun Ma
Junwu Xiong
Sheng Wen
Yang Xiang
44
23
0
04 Jun 2024
The Life Cycle of Large Language Models: A Review of Biases in Education
Jinsook Lee
Yann Hicke
Renzhe Yu
Christopher A. Brooks
René F. Kizilcec
AI4Ed
34
1
0
03 Jun 2024
Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity
Rheeya Uppaal
Apratim De
Yiting He
Yiquao Zhong
Junjie Hu
37
7
0
22 May 2024
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
Bilgehan Sel
Priya Shanmugasundaram
Mohammad Kachuee
Kun Zhou
Ruoxi Jia
Ming Jin
LRM
40
2
0
21 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
53
2
0
06 May 2024
More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness
Aaron Jiaxun Li
Satyapriya Krishna
Himabindu Lakkaraju
37
3
0
29 Apr 2024
REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models
Sana Ebrahimi
N. Shahbazi
Abolfazl Asudeh
34
1
0
17 Apr 2024
Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models
Sunhao Dai
Chen Xu
Shicheng Xu
Liang Pang
Zhenhua Dong
Jun Xu
48
59
0
17 Apr 2024
Polarity Calibration for Opinion Summarization
Yuanyuan Lei
Kaiqiang Song
Sangwoo Cho
Xiaoyang Wang
Ruihong Huang
Dong Yu
30
0
0
02 Apr 2024
Fairness in Large Language Models: A Taxonomic Survey
Zhibo Chu
Zichong Wang
Wenbin Zhang
AILaw
43
32
0
31 Mar 2024
Investigating Markers and Drivers of Gender Bias in Machine Translations
Peter J. Barclay
Ashkan Sami
21
2
0
18 Mar 2024
Take Care of Your Prompt Bias! Investigating and Mitigating Prompt Bias in Factual Knowledge Extraction
Ziyang Xu
Keqin Peng
Liang Ding
Dacheng Tao
Xiliang Lu
34
10
0
15 Mar 2024
Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification
Garima Chhikara
Anurag Sharma
Kripabandhu Ghosh
Abhijnan Chakraborty
39
14
0
28 Feb 2024
Prejudice and Volatility: A Statistical Framework for Measuring Social Discrimination in Large Language Models
Yiran Liu
Ke Yang
Zehan Qi
Xiao-Yang Liu
Yang Yu
U. I. Urbana-Champaign
39
1
0
23 Feb 2024
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models
Ashutosh Sathe
Prachi Jain
Sunayana Sitaram
58
1
0
21 Feb 2024
From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings
Aishik Rakshit
Smriti Singh
Shuvam Keshari
Arijit Ghosh Chowdhury
Vinija Jain
Aman Chadha
37
0
0
18 Feb 2024
Network Formation and Dynamics Among Multi-LLMs
Marios Papachristou
Yuan Yuan
48
11
0
16 Feb 2024
MAFIA: Multi-Adapter Fused Inclusive LanguAge Models
Prachi Jain
Ashutosh Sathe
Varun Gumma
Kabir Ahuja
Sunayana Sitaram
28
1
0
12 Feb 2024
Systematic Biases in LLM Simulations of Debates
Amir Taubenfeld
Yaniv Dover
Roi Reichart
Ariel Goldstein
30
49
0
06 Feb 2024
UnMASKed: Quantifying Gender Biases in Masked Language Models through Linguistically Informed Job Market Prompts
Inigo Parra
13
1
0
28 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
Zujie Wen
Ke Xu
Qi Li
57
56
0
11 Jan 2024
New Job, New Gender? Measuring the Social Bias in Image Generation Models
Wenxuan Wang
Haonan Bai
Jen-tse Huang
Yuxuan Wan
Youliang Yuan
Haoyi Qiu
Nanyun Peng
Michael R. Lyu
47
20
0
01 Jan 2024
The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias
Timo Spinde
Smilla Hinterreiter
Fabian Haak
Terry Ruas
Helge Giese
Norman Meuschke
Bela Gipp
19
12
0
26 Dec 2023
1
2
3
4
5
Next