Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.12516
Cited By
Racial Bias in Hate Speech and Abusive Language Detection Datasets
29 May 2019
Thomas Davidson
Debasmita Bhattacharya
Ingmar Weber
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Racial Bias in Hate Speech and Abusive Language Detection Datasets"
44 / 94 papers shown
Title
CO-STAR: Conceptualisation of Stereotypes for Analysis and Reasoning
Teyun Kwon
Anandha Gopalan
35
2
0
01 Dec 2021
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection
Maarten Sap
Swabha Swayamdipta
Laura Vianna
Xuhui Zhou
Yejin Choi
Noah A. Smith
46
269
0
15 Nov 2021
Cross-lingual Hate Speech Detection using Transformer Models
Teodor Tita
A. Zubiaga
22
13
0
01 Nov 2021
Perceptual Score: What Data Modalities Does Your Model Perceive?
Itai Gat
Idan Schwartz
Alex Schwing
39
30
0
27 Oct 2021
BBQ: A Hand-Built Bias Benchmark for Question Answering
Alicia Parrish
Angelica Chen
Nikita Nangia
Vishakh Padmakumar
Jason Phang
Jana Thompson
Phu Mon Htut
Sam Bowman
223
381
0
15 Oct 2021
Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework
Matan Halevy
Camille Harris
A. Bruckman
Diyi Yang
A. Howard
42
35
0
27 Sep 2021
Latent Hatred: A Benchmark for Understanding Implicit Hate Speech
Mai Elsherief
Caleb Ziems
D. Muchlinski
Vaishnavi Anupindi
Jordyn Seybolt
M. D. Choudhury
Diyi Yang
106
239
0
11 Sep 2021
Does Pretraining for Summarization Require Knowledge Transfer?
Kundan Krishna
Jeffrey P. Bigham
Zachary Chase Lipton
30
36
0
10 Sep 2021
Detecting Inspiring Content on Social Media
Oana Ignat
Y-Lan Boureau
Jane A. Yu
A. Halevy
24
6
0
06 Sep 2021
Dataset for Identification of Homophobia and Transophobia in Multilingual YouTube Comments
Bharathi Raja Chakravarthi
R. Priyadharshini
Rahul Ponnusamy
Prasanna Kumar Kumaresan
Kayalvizhi Sampath
D. Thenmozhi
S. Thangasamy
Rajendran Nallathambi
John P. Mccrae
26
90
0
01 Sep 2021
Overview of the HASOC track at FIRE 2020: Hate Speech and Offensive Content Identification in Indo-European Languages
Thomas Mandl
Sandip J Modha
Gautam Kishore Shahi
Prasenjit Majumder
Mohana Dave
Daksh Patel
Chintak Mandalia
Aditya Patel
91
171
0
12 Aug 2021
On Measures of Biases and Harms in NLP
Sunipa Dev
Emily Sheng
Jieyu Zhao
Aubrie Amstutz
Jiao Sun
...
M. Sanseverino
Jiin Kim
Akihiro Nishi
Nanyun Peng
Kai-Wei Chang
33
80
0
07 Aug 2021
Cross-lingual Capsule Network for Hate Speech Detection in Social Media
Aiqi Jiang
A. Zubiaga
23
14
0
06 Aug 2021
Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach
Sheikh Muhammad Sarwar
Vanessa Murdock
39
20
0
27 Jul 2021
Quantifying Social Biases in NLP: A Generalization and Empirical Comparison of Extrinsic Fairness Metrics
Paula Czarnowska
Yogarshi Vyas
Kashif Shah
21
104
0
28 Jun 2021
A Survey of Race, Racism, and Anti-Racism in NLP
Anjalie Field
Su Lin Blodgett
Zeerak Talat
Yulia Tsvetkov
42
122
0
21 Jun 2021
Understanding and Evaluating Racial Biases in Image Captioning
Dora Zhao
Angelina Wang
Olga Russakovsky
30
134
0
16 Jun 2021
Designing Toxic Content Classification for a Diversity of Perspectives
Deepak Kumar
Patrick Gage Kelley
Sunny Consolvo
Joshua Mason
Elie Bursztein
Zakir Durumeric
Kurt Thomas
Michael C. Bailey
19
105
0
04 Jun 2021
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts
Alisa Liu
Maarten Sap
Ximing Lu
Swabha Swayamdipta
Chandra Bhagavatula
Noah A. Smith
Yejin Choi
MU
46
361
0
07 May 2021
Reliability Testing for Natural Language Processing Systems
Samson Tan
Chenyu You
K. Baxter
Araz Taeihagh
G. Bennett
Min-Yen Kan
22
39
0
06 May 2021
Cross-lingual hate speech detection based on multilingual domain-specific word embeddings
Aymé Arango
Jorge A. Pérez
Bárbara Poblete
30
9
0
30 Apr 2021
A Neighbourhood Framework for Resource-Lean Content Flagging
Sheikh Muhammad Sarwar
Dimitrina Zlatkova
Momchil Hardalov
Yoan Dinkov
Isabelle Augenstein
Preslav Nakov
24
5
0
31 Mar 2021
Detecting Hate Speech with GPT-3
Ke-Li Chiu
Annie Collins
Rohan Alexander
AILaw
25
108
0
23 Mar 2021
Towards generalisable hate speech detection: a review on obstacles and solutions
Wenjie Yin
A. Zubiaga
117
164
0
17 Feb 2021
Re-imagining Algorithmic Fairness in India and Beyond
Nithya Sambasivan
Erin Arnesen
Ben Hutchinson
Tulsee Doshi
Vinodkumar Prabhakaran
FaML
17
174
0
25 Jan 2021
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
Bertie Vidgen
Tristan Thrush
Zeerak Talat
Douwe Kiela
34
245
0
31 Dec 2020
HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
Binny Mathew
Punyajoy Saha
Seid Muhie Yimam
Chris Biemann
Pawan Goyal
Animesh Mukherjee
47
551
0
18 Dec 2020
Towards Ethics by Design in Online Abusive Content Detection
S. Kiritchenko
I. Nejadgholi
24
13
0
28 Oct 2020
HateBERT: Retraining BERT for Abusive Language Detection in English
Tommaso Caselli
Valerio Basile
Jelena Mitrović
Michael Granitzer
24
359
0
23 Oct 2020
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
Samuel Gehman
Suchin Gururangan
Maarten Sap
Yejin Choi
Noah A. Smith
58
1,134
0
24 Sep 2020
Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model
Marzieh Mozafari
R. Farahbakhsh
Noel Crespi
12
211
0
14 Aug 2020
CausaLM: Causal Model Explanation Through Counterfactual Language Models
Amir Feder
Nadav Oved
Uri Shalit
Roi Reichart
CML
LRM
49
157
0
27 May 2020
Towards Socially Responsible AI: Cognitive Bias-Aware Multi-Objective Learning
Procheta Sen
Debasis Ganguly
27
18
0
14 May 2020
Intersectional Bias in Hate Speech and Abusive Language Datasets
Jae-Yeon Kim
Carlos Ortiz
S. Nam
Sarah Santiago
V. Datta
17
45
0
12 May 2020
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
Douwe Kiela
Hamed Firooz
Aravind Mohan
Vedanuj Goswami
Amanpreet Singh
Pratik Ringshia
Davide Testuggine
45
582
0
10 May 2020
Cyberbullying Detection with Fairness Constraints
O. Gencoglu
19
48
0
09 May 2020
Detecting East Asian Prejudice on Social Media
Bertie Vidgen
Austin Botelho
David A. Broniatowski
E. Guest
Matthew Hall
Helen Z. Margetts
Rebekah Tromble
Zeerak Talat
Scott A. Hale
19
97
0
08 May 2020
Social Biases in NLP Models as Barriers for Persons with Disabilities
Ben Hutchinson
Vinodkumar Prabhakaran
Emily L. Denton
Kellie Webster
Yu Zhong
Stephen Denuyl
28
302
0
02 May 2020
Multilingual Twitter Corpus and Baselines for Evaluating Demographic Bias in Hate Speech Recognition
Xiaolei Huang
Linzi Xing
Franck Dernoncourt
Michael J. Paul
16
87
0
24 Feb 2020
Social Bias Frames: Reasoning about Social and Power Implications of Language
Maarten Sap
Saadia Gabriel
Lianhui Qin
Dan Jurafsky
Noah A. Smith
Yejin Choi
42
486
0
10 Nov 2019
Privacy Enhanced Multimodal Neural Representations for Emotion Recognition
Mimansa Jaiswal
E. Provost
42
73
0
29 Oct 2019
A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media
Marzieh Mozafari
R. Farahbakhsh
Noel Crespi
14
344
0
28 Oct 2019
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings
Gregor Wiedemann
Steffen Remus
Avi Chawla
Chris Biemann
27
174
0
23 Sep 2019
Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection
Ameya Vaidya
Feng Mai
Yue Ning
115
21
0
21 Sep 2019
Previous
1
2