Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.04251
Cited By
A Benchmark Dataset for Learning to Intervene in Online Hate Speech
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
10 September 2019
Jing Qian
Anna Bethke
Yinyin Liu
E. Belding-Royer
William Yang Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Benchmark Dataset for Learning to Intervene in Online Hate Speech"
50 / 119 papers shown
Improving Cross-Domain Hate Speech Generalizability with Emotion Knowledge
Pacific Asia Conference on Language, Information and Computation (PACLIC), 2023
Shi Yin Hong
Susan Gauch
257
2
0
24 Nov 2023
Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study
Maike Zufle
Verna Dankers
Ivan Titov
242
0
0
16 Nov 2023
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Indira Sen
Dennis Assenmacher
Mattia Samory
Isabelle Augenstein
Wil M.P. van der Aalst
Claudia Wagner
459
31
0
02 Nov 2023
Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in Language
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kumail Alhamoud
Emily Allaway
Akhila Yerukola
Laura Vianna
Sarah-Jane Leslie
Maarten Sap
127
27
0
31 Oct 2023
Text-Transport: Toward Learning Causal Effects of Natural Language
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Victoria Lin
Louis-Philippe Morency
Eli Ben-Michael
217
6
0
31 Oct 2023
LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection
Communication Systems and Applications (CSA), 2023
Ahmad Nasir
Aadish Sharma
Kokil Jaidka
Saifuddin Ahmed
291
5
0
29 Oct 2023
StyloMetrix: An Open-Source Multilingual Tool for Representing Stylometric Vectors
Inez Okulska
Daria Stetsenko
Anna Kołos
Agnieszka Karlinska
Kinga Glkabiñska
Adam Nowakowski
201
21
0
22 Sep 2023
On the Challenges of Building Datasets for Hate Speech Detection
Vitthal Bhandari
215
1
0
06 Sep 2023
Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation via Attention Regularization
Helena Bonaldi
Giuseppe Attanasio
Debora Nozza
Marco Guerini
245
12
0
05 Sep 2023
BAN-PL: a Novel Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service
International Conference on Language Resources and Evaluation (LREC), 2023
Anna Kołos
Inez Okulska
Kinga Głąbińska
Agnieszka Karlinska
Emilia Wisnios
Paweł Ellerik
Andrzej Prałat
213
4
0
21 Aug 2023
Studying Socially Unacceptable Discourse Classification (SUD) through different eyes: "Are we on the same page ?"
Bruno Machado Carneiro
Michele Linardi
Julien Longhi
150
3
0
08 Aug 2023
Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media
AAAI Conference on Artificial Intelligence (AAAI), 2023
Liam Hebert
Gaurav Sahu
Yuxuan Guo
Nanda Kishore Sreenivas
Lukasz Golab
Robin Cohen
301
17
0
18 Jul 2023
Understanding Counterspeech for Online Harm Mitigation
Yi-Ling Chung
Gavin Abercrombie
Florence E. Enock
Jonathan Bright
Verena Rieser
166
25
0
01 Jul 2023
Identity Construction in a Misogynist Incels Forum
Michael Miller Yoder
C. Perry
D. W. Brown
Kathleen M. Carley
Meredith L. Pruden
221
8
0
27 Jun 2023
PEACE: Cross-Platform Hate Speech Detection- A Causality-guided Framework
Paras Sheth
Tharindu Kumarage
Raha Moraffah
Amanat Chadha
Huan Liu
239
13
0
15 Jun 2023
Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Rishabh Gupta
Shaily Desai
Manvi Goel
Anil Bandhakavi
Tanmoy Chakraborty
Md. Shad Akhtar
189
28
0
23 May 2023
Towards Countering Essentialism through Social Bias Reasoning
Emily Allaway
Nina Taneja
Sarah-Jane Leslie
Maarten Sap
115
5
0
28 Mar 2023
On the rise of fear speech in online social media
Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023
Punyajoy Saha
Kiran Garimella
Narla Komal Kalyan
Saurabh Kumar Pandey
Pauras Mangesh Meher
Binny Mathew
Animesh Mukherjee
84
31
0
18 Mar 2023
Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine Misinformation
The Web Conference (WWW), 2023
Bing He
M. Ahamad
Srijan Kumar
OffRL
187
58
0
11 Mar 2023
SemEval-2023 Task 10: Explainable Detection of Online Sexism
International Workshop on Semantic Evaluation (SemEval), 2023
Hannah Rose Kirk
Wenjie Yin
Bertie Vidgen
Paul Röttger
299
144
0
07 Mar 2023
CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sreyan Ghosh
Manan Suri
Purva Chiniya
Utkarsh Tyagi
Sonal Kumar
Dinesh Manocha
236
19
0
02 Mar 2023
Multilingual Content Moderation: A Case Study on Reddit
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Meng Ye
Karan Sikka
Katherine Atwell
Sabit Hassan
Ajay Divakaran
Malihe Alikhani
AI4MH
177
11
0
19 Feb 2023
Vicarious Offense and Noise Audit of Offensive Speech Classifiers: Unifying Human and Machine Disagreement on What is Offensive
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Tharindu Cyril Weerasooriya
Sujan Dutta
Tharindu Ranasinghe
Marcos Zampieri
Christopher Homan
Ashiqur R. KhudaBukhsh
AAML
521
21
0
29 Jan 2023
Using Selective Masking as a Bridge between Pre-training and Fine-tuning
Tanish Lad
Himanshu Maheshwari
Shreyas Kottukkal
R. Mamidi
139
3
0
24 Nov 2022
Cross-Platform and Cross-Domain Abusive Language Detection with Supervised Contrastive Learning
Md. Tawkat Islam Khondaker
Muhammad Abdul-Mageed
L. Lakshmanan
99
2
0
11 Nov 2022
Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Helena Bonaldi
Sara Dellantonio
Serra Sinem Tekiroğlu
Marco Guerini
201
56
0
07 Nov 2022
Quantifying How Hateful Communities Radicalize Online Users
International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 2022
Matheus Schmitz
Keith Burghardt
Goran Murić
136
17
0
19 Sep 2022
Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild
Web Search and Data Mining (WSDM), 2022
Donghyun Son
Byounggyu Lew
Kwanghee Choi
Yongsu Baek
Seungwoo Choi
Beomjun Shin
S. Ha
Buru Chang
362
13
0
16 Aug 2022
Parsimonious Argument Annotations for Hate Speech Counter-narratives
D. Furman
Pablo E. Torres
José Raúl Rodríguez Rodríguez
Lautaro Martínez
Laura Alonso Alemany
Diego Letzen
María Vanina Martínez
101
1
0
01 Aug 2022
ELF22: A Context-based Counter Trolling Dataset to Combat Internet Trolls
International Conference on Language Resources and Evaluation (LREC), 2022
Huije Lee
Young Ju Na
Hoyun Song
Jisu Shin
Jong C. Park
202
9
0
30 Jul 2022
Hate Speech Criteria: A Modular Approach to Task-Specific Hate Speech Definitions
Urja Khurana
I. Vermeulen
Eric T. Nalisnick
M. V. Noorloos
Antske Fokkens
AILaw
129
23
0
30 Jun 2022
Hate Speech and Counter Speech Detection: Conversational Context Does Matter
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Xinchen Yu
Eduardo Blanco
Lingzi Hong
153
53
0
13 Jun 2022
Proactively Reducing the Hate Intensity of Online Posts via Hate Speech Normalization
Knowledge Discovery and Data Mining (KDD), 2022
Sarah Masud
Manjot Bedi
Mohammad Aflah Khan
Md. Shad Akhtar
Tanmoy Chakraborty
164
31
0
08 Jun 2022
CounterGeDi: A controllable approach to generate polite, detoxified and emotional counterspeech
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Punyajoy Saha
Kanishk Singh
Adarsh Kumar
Binny Mathew
Animesh Mukherjee
177
44
0
09 May 2022
Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation
International Conference on Human Factors in Computing Systems (CHI), 2022
Vivian Lai
Samuel Carton
Rajat Bhatnagar
Vera Liao
Yunfeng Zhang
Chenhao Tan
316
162
0
25 Apr 2022
CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection
Souvic Chakraborty
Parag Dutta
Sumegh Roychowdhury
Animesh Mukherjee
152
8
0
13 Apr 2022
Korean Online Hate Speech Dataset for Multilabel Classification: How Can Social Science Improve Dataset on Hate Speech?
Taeyoung Kang
Eunrang Kwon
Junbum Lee
Youngeun Nam
Junmo Song
JeongKyu Suh
121
14
0
07 Apr 2022
Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study
Findings (Findings), 2022
Serra Sinem Tekiroğlu
Helena Bonaldi
Margherita Fanton
Marco Guerini
259
55
0
04 Apr 2022
Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language
N. Babakov
V. Logacheva
Sergey Petrakov
152
3
0
04 Mar 2022
Counter Hate Speech in Social Media: A Survey
Dana Alsagheer
Hadi Mansourifar
W. Shi
121
12
0
21 Feb 2022
Going Extreme: Comparative Analysis of Hate Speech in Parler and Gab
Abraham Israeli
Oren Tsur
156
1
0
27 Jan 2022
Leveraging Transformers for Hate Speech Detection in Conversational Code-Mixed Tweets
Fire (FIRE), 2021
Zaki Mustafa Farooqi
Sreyan Ghosh
R. Shah
132
38
0
18 Dec 2021
Revisiting Contextual Toxicity Detection in Conversations
Atijit Anuchitanukul
Julia Ive
Lucia Specia
289
18
0
24 Nov 2021
Sexism Identification in Tweets and Gabs using Deep Neural Networks
Amikul Kalra
A. Zubiaga
66
14
0
05 Nov 2021
Multilingual Counter Narrative Type Classification
Yi-Ling Chung
Marco Guerini
Rodrigo Agerri
757
17
0
28 Sep 2021
Countering Online Hate Speech: An NLP Perspective
Mudit Chaudhary
Chandni Saxena
Helen Meng
125
21
0
07 Sep 2021
Dataset for Identification of Homophobia and Transophobia in Multilingual YouTube Comments
Bharathi Raja Chakravarthi
R. Priyadharshini
Rahul Ponnusamy
Prasanna Kumar Kumaresan
Kayalvizhi Sampath
D. Thenmozhi
S. Thangasamy
Rajendran Nallathambi
John P. Mccrae
121
98
0
01 Sep 2021
TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter
Sumit Kumar
Raj Ratn Pranesh
105
20
0
27 Aug 2021
Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Margherita Fanton
Helena Bonaldi
Serra Sinem Tekiroğlu
Marco Guerini
178
128
0
19 Jul 2021
Empowering NGOs in Countering Online Hate Messages
Yi-Ling Chung
Serra Sinem Tekiroğlu
Sara Tonelli
Marco Guerini
111
25
0
06 Jul 2021
Previous
1
2
3
Next
Page 2 of 3