Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.15585
Cited By
Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting
28 January 2024
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
Timothy Baldwin
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting"
30 / 30 papers shown
Title
Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation
Alireza Salemi
Chris Samarinas
Hamed Zamani
24
0
0
10 Apr 2025
Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models
Panatchakorn Anantaprayoon
Masahiro Kaneko
Naoaki Okazaki
LRM
KELM
50
0
0
08 Mar 2025
Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs
Hannah Cyberey
Yangfeng Ji
David E. Evans
LLMSV
56
1
0
27 Feb 2025
BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
Junho Myung
Nayeon Lee
Yi Zhou
Jiho Jin
Rifki Afina Putri
...
Seid Muhie Yimam
Mohammad Taher Pilehvar
N. Ousidhoum
Jose Camacho-Collados
Alice H. Oh
87
32
0
17 Jan 2025
Different Bias Under Different Criteria: Assessing Bias in LLMs with a Fact-Based Approach
Changgeon Ko
Jisu Shin
Hoyun Song
Jeongyeon Seo
Jong C. Park
62
0
0
26 Nov 2024
Are Large Language Models Ready for Travel Planning?
Ruiping Ren
Xing Yao
Shu Cole
Haining Wang
18
0
0
22 Oct 2024
Evaluating Gender Bias of LLMs in Making Morality Judgements
Divij Bajaj
Yuanyuan Lei
Jonathan Tong
Ruihong Huang
32
2
0
13 Oct 2024
A Comprehensive Survey of Bias in LLMs: Current Landscape and Future Directions
Rajesh Ranjan
Shailja Gupta
Surya Narayan Singh
31
8
0
24 Sep 2024
A Multi-LLM Debiasing Framework
Deonna M. Owens
Ryan A. Rossi
Sungchul Kim
Tong Yu
Franck Dernoncourt
Xiang Chen
Ruiyi Zhang
Jiuxiang Gu
Hanieh Deilamsalehy
Nedim Lipka
26
3
0
20 Sep 2024
Fairness in Large Language Models in Three Hours
Thang Doan Viet
Zichong Wang
Minh Nhat Nguyen
Wenbin Zhang
33
8
0
02 Aug 2024
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
Zhiting Fan
Ruizhe Chen
Ruiling Xu
Zuozhu Liu
KELM
16
15
0
14 Jul 2024
Breaking Bias, Building Bridges: Evaluation and Mitigation of Social Biases in LLMs via Contact Hypothesis
Chahat Raj
A. Mukherjee
Aylin Caliskan
Antonios Anastasopoulos
Ziwei Zhu
30
8
0
02 Jul 2024
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models
Song Wang
Peng Wang
Tong Zhou
Yushun Dong
Zhen Tan
Jundong Li
CoGe
44
6
0
02 Jul 2024
Significance of Chain of Thought in Gender Bias Mitigation for English-Dravidian Machine Translation
Lavanya Prahallad
Radhika Mamidi
AI4CE
LRM
14
1
0
30 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
46
2
0
06 May 2024
Prompting Techniques for Reducing Social Bias in LLMs through System 1 and System 2 Cognitive Processes
M. Kamruzzaman
Gene Louis Kim
29
15
0
26 Apr 2024
Sampling-based Pseudo-Likelihood for Membership Inference Attacks
Masahiro Kaneko
Youmi Ma
Yuki Wata
Naoaki Okazaki
19
9
0
17 Apr 2024
The Impact of Unstated Norms in Bias Analysis of Language Models
Farnaz Kohankhaki
D. B. Emerson
David B. Emerson
Laleh Seyyed-Kalantari
Faiza Khan Khattak
50
1
0
04 Apr 2024
Fairness in Large Language Models: A Taxonomic Survey
Zhibo Chu
Zichong Wang
Wenbin Zhang
AILaw
41
31
0
31 Mar 2024
A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish
Masahiro Kaneko
Timothy Baldwin
PILM
23
3
0
24 Mar 2024
Eagle: Ethical Dataset Given from Real Interactions
Masahiro Kaneko
Danushka Bollegala
Timothy Baldwin
38
3
0
22 Feb 2024
Evaluating Gender Bias of Pre-trained Language Models in Natural Language Inference by Considering All Labels
Panatchakorn Anantaprayoon
Masahiro Kaneko
Naoaki Okazaki
56
16
0
18 Sep 2023
PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator
Chuyi Kong
Yaxin Fan
Xiang Wan
Feng Jiang
Benyou Wang
25
5
0
21 Aug 2023
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
4,048
0
24 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
297
3,163
0
21 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
BBQ: A Hand-Built Bias Benchmark for Question Answering
Alicia Parrish
Angelica Chen
Nikita Nangia
Vishakh Padmakumar
Jason Phang
Jana Thompson
Phu Mon Htut
Sam Bowman
212
364
0
15 Oct 2021
Socially Aware Bias Measurements for Hindi Language Representations
Vijit Malik
Sunipa Dev
A. Nishi
Nanyun Peng
Kai-Wei Chang
53
36
0
15 Oct 2021
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP
Timo Schick
Sahana Udupa
Hinrich Schütze
257
374
0
28 Feb 2021
Debiasing Pre-trained Contextualised Embeddings
Masahiro Kaneko
Danushka Bollegala
210
138
0
23 Jan 2021
1