ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.03521
  4. Cited By
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of
  Conversational Language Models

RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models

7 June 2021
Soumya Barikeri
Anne Lauscher
Ivan Vulić
Goran Glavas
ArXivPDFHTML

Papers citing "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"

50 / 116 papers shown
Title
FAIR Enough: How Can We Develop and Assess a FAIR-Compliant Dataset for
  Large Language Models' Training?
FAIR Enough: How Can We Develop and Assess a FAIR-Compliant Dataset for Large Language Models' Training?
Shaina Raza
Shardul Ghuge
Chen Ding
Elham Dolatabadi
D. Pandya
SyDa
21
9
0
19 Jan 2024
A Group Fairness Lens for Large Language Models
A Group Fairness Lens for Large Language Models
Guanqun Bi
Lei Shen
Yuqiang Xie
Yanan Cao
Tiangang Zhu
Xiao-feng He
ALM
34
4
0
24 Dec 2023
Topic Bias in Emotion Classification
Topic Bias in Emotion Classification
Maximilian Wegge
Roman Klinger
CVBM
18
0
0
14 Dec 2023
GPTBIAS: A Comprehensive Framework for Evaluating Bias in Large Language
  Models
GPTBIAS: A Comprehensive Framework for Evaluating Bias in Large Language Models
Jiaxu Zhao
Meng Fang
Shirui Pan
Wenpeng Yin
Mykola Pechenizkiy
ELM
24
11
0
11 Dec 2023
A Survey on Large Language Model (LLM) Security and Privacy: The Good,
  the Bad, and the Ugly
A Survey on Large Language Model (LLM) Security and Privacy: The Good, the Bad, and the Ugly
Yifan Yao
Jinhao Duan
Kaidi Xu
Yuanfang Cai
Eric Sun
Yue Zhang
PILM
ELM
39
475
0
04 Dec 2023
Tackling Bias in Pre-trained Language Models: Current Trends and
  Under-represented Societies
Tackling Bias in Pre-trained Language Models: Current Trends and Under-represented Societies
Vithya Yogarajan
Gillian Dobbie
Te Taka Keegan
R. Neuwirth
ALM
43
11
0
03 Dec 2023
PEFTDebias : Capturing debiasing information using PEFTs
PEFTDebias : Capturing debiasing information using PEFTs
Sumit Agarwal
Aditya Srikanth Veerubhotla
Srijan Bansal
14
3
0
01 Dec 2023
Current Topological and Machine Learning Applications for Bias Detection
  in Text
Current Topological and Machine Learning Applications for Bias Detection in Text
Colleen Farrelly
Yashbir Singh
Quincy A. Hathaway
Gunnar Carlsson
Ashok Choudhary
Rahul Paul
Gianfranco Doretto
Yassine Himeur
Shadi Atalla
W. Mansoor
29
4
0
22 Nov 2023
Prompt-based Pseudo-labeling Strategy for Sample-Efficient
  Semi-Supervised Extractive Summarization
Prompt-based Pseudo-labeling Strategy for Sample-Efficient Semi-Supervised Extractive Summarization
Gaurav Sahu
Olga Vechtomova
I. Laradji
39
1
0
16 Nov 2023
Social Bias Probing: Fairness Benchmarking for Language Models
Social Bias Probing: Fairness Benchmarking for Language Models
Marta Marchiori Manerba
Karolina Stañczak
Riccardo Guidotti
Isabelle Augenstein
27
16
0
15 Nov 2023
Alquist 5.0: Dialogue Trees Meet Generative Models. A Novel Approach for
  Enhancing SocialBot Conversations
Alquist 5.0: Dialogue Trees Meet Generative Models. A Novel Approach for Enhancing SocialBot Conversations
Ondrej Kobza
Jan Cuhel
Tommaso Gargiani
David Herel
Petr Marek
24
3
0
24 Oct 2023
PromptMix: A Class Boundary Augmentation Method for Large Language Model
  Distillation
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Gaurav Sahu
Olga Vechtomova
Dzmitry Bahdanau
I. Laradji
VLM
55
24
0
22 Oct 2023
Mitigating Bias for Question Answering Models by Tracking Bias Influence
Mitigating Bias for Question Answering Models by Tracking Bias Influence
Mingyu Derek Ma
Jiun-Yu Kao
Arpit Gupta
Yu-Hsiang Lin
Wenbo Zhao
Tagyoung Chung
Wei Wang
Kai-Wei Chang
Nanyun Peng
32
4
0
13 Oct 2023
The potential of large language models for improving probability
  learning: A study on ChatGPT3.5 and first-year computer engineering students
The potential of large language models for improving probability learning: A study on ChatGPT3.5 and first-year computer engineering students
Angel Udias
A. Alonso-Ayuso
Ignacio Sanchez
Sonia Hernandez
Maria Eugenia Castellanos
R. M. Diez
Emilio Lopez Cano
22
1
0
09 Oct 2023
Unlocking Bias Detection: Leveraging Transformer-Based Models for
  Content Analysis
Unlocking Bias Detection: Leveraging Transformer-Based Models for Content Analysis
Shaina Raza
Oluwanifemi Bamgbose
Veronica Chatrath
Shardul Ghuge
Yan Sidyakin
Abdullah Y. Muaad
16
11
0
30 Sep 2023
SafetyBench: Evaluating the Safety of Large Language Models
SafetyBench: Evaluating the Safety of Large Language Models
Zhexin Zhang
Leqi Lei
Lindong Wu
Rui Sun
Yongkang Huang
Chong Long
Xiao Liu
Xuanyu Lei
Jie Tang
Minlie Huang
LRM
LM&MA
ELM
39
90
0
13 Sep 2023
Sensitivity, Performance, Robustness: Deconstructing the Effect of
  Sociodemographic Prompting
Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic Prompting
Tilman Beck
Hendrik Schuff
Anne Lauscher
Iryna Gurevych
40
32
0
13 Sep 2023
Bias Testing and Mitigation in LLM-based Code Generation
Bias Testing and Mitigation in LLM-based Code Generation
Dong Huang
Qingwen Bu
Jie M. Zhang
Xiaofei Xie
Junjie Chen
Heming Cui
45
20
0
03 Sep 2023
Bias and Fairness in Large Language Models: A Survey
Bias and Fairness in Large Language Models: A Survey
Isabel O. Gallegos
Ryan A. Rossi
Joe Barrow
Md Mehrab Tanjim
Sungchul Kim
Franck Dernoncourt
Tong Yu
Ruiyi Zhang
Nesreen Ahmed
AILaw
26
490
0
02 Sep 2023
AI in the Gray: Exploring Moderation Policies in Dialogic Large Language
  Models vs. Human Answers in Controversial Topics
AI in the Gray: Exploring Moderation Policies in Dialogic Large Language Models vs. Human Answers in Controversial Topics
V. Ghafouri
Vibhor Agarwal
Yong Zhang
Nishanth R. Sastry
Jose Such
Guillermo Suarez-Tangil
AI4MH
23
21
0
28 Aug 2023
A Survey on Fairness in Large Language Models
A Survey on Fairness in Large Language Models
Yingji Li
Mengnan Du
Rui Song
Xin Wang
Ying Wang
ALM
52
59
0
20 Aug 2023
Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language
  Models
Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models
Somayeh Ghanbarzadeh
Yan-ping Huang
Hamid Palangi
R. C. Moreno
Hamed Khanpour
32
12
0
20 Jul 2023
Mitigating Bias in Conversations: A Hate Speech Classifier and Debiaser
  with Prompts
Mitigating Bias in Conversations: A Hate Speech Classifier and Debiaser with Prompts
Shaina Raza
Chen Ding
D. Pandya
FaML
16
2
0
14 Jul 2023
Exposing Bias in Online Communities through Large-Scale Language Models
Exposing Bias in Online Communities through Large-Scale Language Models
Celine Wald
Lukas Pfahler
13
6
0
04 Jun 2023
Marked Personas: Using Natural Language Prompts to Measure Stereotypes
  in Language Models
Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models
Myra Cheng
Esin Durmus
Dan Jurafsky
25
174
0
29 May 2023
Stereotypes and Smut: The (Mis)representation of Non-cisgender
  Identities by Text-to-Image Models
Stereotypes and Smut: The (Mis)representation of Non-cisgender Identities by Text-to-Image Models
Eddie L. Ungless
Bjorn Ross
Anne Lauscher
28
32
0
26 May 2023
What about em? How Commercial Machine Translation Fails to Handle
  (Neo-)Pronouns
What about em? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns
Anne Lauscher
Debora Nozza
Archie Crowley
E. Miltersen
Dirk Hovy
24
21
0
25 May 2023
Trade-Offs Between Fairness and Privacy in Language Modeling
Trade-Offs Between Fairness and Privacy in Language Modeling
Cleo Matzken
Steffen Eger
Ivan Habernal
SILM
41
6
0
24 May 2023
Gender Biases in Automatic Evaluation Metrics for Image Captioning
Gender Biases in Automatic Evaluation Metrics for Image Captioning
Haoyi Qiu
Zi-Yi Dou
Tianlu Wang
Asli Celikyilmaz
Nanyun Peng
EGVM
26
14
0
24 May 2023
Word Embeddings Are Steers for Language Models
Word Embeddings Are Steers for Language Models
Chi Han
Jialiang Xu
Manling Li
Yi Ren Fung
Chenkai Sun
Nan Jiang
Tarek F. Abdelzaher
Heng Ji
LLMSV
29
27
0
22 May 2023
CHBias: Bias Evaluation and Mitigation of Chinese Conversational
  Language Models
CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models
Jiaxu Zhao
Meng Fang
Zijing Shi
Yitong Li
Ling-Hao Chen
Mykola Pechenizkiy
22
20
0
18 May 2023
"I'm fully who I am": Towards Centering Transgender and Non-Binary
  Voices to Measure Biases in Open Language Generation
"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation
Anaelia Ovalle
Palash Goyal
Jwala Dhamala
Zachary Jaggers
Kai-Wei Chang
Aram Galstyan
R. Zemel
Rahul Gupta
25
61
0
17 May 2023
A General-Purpose Multilingual Document Encoder
A General-Purpose Multilingual Document Encoder
Onur Galoglu
Robert Litschko
Goran Glavas
34
2
0
11 May 2023
Introducing MBIB -- the first Media Bias Identification Benchmark Task
  and Dataset Collection
Introducing MBIB -- the first Media Bias Identification Benchmark Task and Dataset Collection
Martin Wessel
Tomávs Horych
Terry Ruas
Akiko Aizawa
Bela Gipp
Timo Spinde
26
21
0
25 Apr 2023
Effectiveness of Debiasing Techniques: An Indigenous Qualitative
  Analysis
Effectiveness of Debiasing Techniques: An Indigenous Qualitative Analysis
Vithya Yogarajan
Gillian Dobbie
Henry Gouk
14
3
0
17 Apr 2023
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence
  Reasoning
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning
Hongyin Luo
James R. Glass
NAI
23
7
0
10 Mar 2023
Toward Fairness in Text Generation via Mutual Information Minimization
  based on Importance Sampling
Toward Fairness in Text Generation via Mutual Information Minimization based on Importance Sampling
Rui Wang
Pengyu Cheng
Ricardo Henao
12
8
0
25 Feb 2023
In What Languages are Generative Language Models the Most Formal?
  Analyzing Formality Distribution across Languages
In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages
Asim Ersoy
Gerson Vizcarra
T. Mayeesha
Benjamin Muller
26
2
0
23 Feb 2023
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous
  Pronouns
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns
Zhongbin Xie
Vid Kocijan
Thomas Lukasiewicz
Oana-Maria Camburu
8
2
0
11 Feb 2023
FineDeb: A Debiasing Framework for Language Models
FineDeb: A Debiasing Framework for Language Models
Akash Saravanan
Dhruv Mullick
Habibur Rahman
Nidhi Hegde
FedML
AI4CE
18
4
0
05 Feb 2023
Using In-Context Learning to Improve Dialogue Safety
Using In-Context Learning to Improve Dialogue Safety
Nicholas Meade
Spandana Gella
Devamanyu Hazarika
Prakhar Gupta
Di Jin
Siva Reddy
Yang Liu
Dilek Z. Hakkani-Tür
30
38
0
02 Feb 2023
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
DialGuide: Aligning Dialogue Model Behavior with Developer Guidelines
Prakhar Gupta
Yang Liu
Di Jin
Behnam Hedayatnia
Spandana Gella
Sijia Liu
P. Lange
Julia Hirschberg
Dilek Z. Hakkani-Tür
30
5
0
20 Dec 2022
Religion and Spirituality on Social Media in the Aftermath of the Global
  Pandemic
Religion and Spirituality on Social Media in the Aftermath of the Global Pandemic
O. Aduragba
Alexandra I. Cristea
Pete Phillips
Jonas Kurlberg
Jialin Yu
18
2
0
11 Dec 2022
Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts
Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts
Arshiya Aggarwal
Jiao Sun
Nanyun Peng
9
6
0
03 Dec 2022
Conceptor-Aided Debiasing of Large Language Models
Conceptor-Aided Debiasing of Large Language Models
Yifei Li
Lyle Ungar
João Sedoc
14
4
0
20 Nov 2022
Does Debiasing Inevitably Degrade the Model Performance
Does Debiasing Inevitably Degrade the Model Performance
Yiran Liu
Xiao-Yang Liu
Haotian Chen
Yang Yu
33
2
0
14 Nov 2022
Bridging Fairness and Environmental Sustainability in Natural Language
  Processing
Bridging Fairness and Environmental Sustainability in Natural Language Processing
Marius Hessenthaler
Emma Strubell
Dirk Hovy
Anne Lauscher
21
8
0
08 Nov 2022
Weakly Supervised Data Augmentation Through Prompting for Dialogue
  Understanding
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding
Maximillian Chen
Alexandros Papangelis
Chenyang Tao
Andrew Rosenbaum
Seokhwan Kim
Yang Liu
Zhou Yu
Dilek Z. Hakkani-Tür
39
32
0
25 Oct 2022
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for
  Text Generation
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation
Tianxiang Sun
Junliang He
Xipeng Qiu
Xuanjing Huang
24
44
0
14 Oct 2022
BlenderBot 3: a deployed conversational agent that continually learns to
  responsibly engage
BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
Kurt Shuster
Jing Xu
M. Komeili
Da Ju
Eric Michael Smith
...
Naman Goyal
Arthur Szlam
Y-Lan Boureau
Melanie Kambadur
Jason Weston
LM&Ro
KELM
35
233
0
05 Aug 2022
Previous
123
Next