Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.12305
Cited By
v1
v2 (latest)
Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective
Journal of Artificial Intelligence Research (JAIR), 2020
22 December 2020
S. Kiritchenko
I. Nejadgholi
Kathleen C. Fraser
AILaw
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective"
41 / 41 papers shown
Beating Harmful Stereotypes Through Facts: RAG-based Counter-speech Generation
Greta Damo
Elena Cabrio
S. Villata
124
0
0
14 Oct 2025
Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future Directions
Expert systems with applications (ESWA), 2025
Smita Khapre
Melkamu Mersha
Hassan Shakil
Jonali Baruah
Jugal Kalita
216
4
0
29 Sep 2025
Conversations Gone Awry, But Then? Evaluating Conversational Forecasting Models
Son Quoc Tran
Tushaar Gangavarapu
Nicholas Chernogor
Jonathan P. Chang
Cristian Danescu-Niculescu-Mizil
AI4TS
206
0
0
25 Jul 2025
Cracking the Code: Enhancing Implicit Hate Speech Detection through Coding Classification
Lu Wei
Liangzhi Li
Tong Xiang
Xiao Liu
Noa Garcia
262
2
0
05 Jun 2025
WildFireCan-MMD: A Multimodal Dataset for Classification of User-Generated Content During Wildfires in Canada
Braeden Sherritt
Isar Nejadgholi
Efstratios Aivaliotis
Khaled Mslmani
Marzieh Amini
VLM
622
0
0
17 Apr 2025
Tackling Social Bias against the Poor: A Dataset and Taxonomy on Aporophobia
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Georgina Curto
S. Kiritchenko
Muhammad Hammad Fahim Siddiqui
I. Nejadgholi
Kathleen C. Fraser
196
1
0
17 Apr 2025
From Intrinsic Toxicity to Reception-Based Toxicity: A Contextual Framework for Prediction and Evaluation
Sergey Berezin
R. Farahbakhsh
Noel Crespi
371
1
0
20 Mar 2025
Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization
Sahil Wadhwa
Chengtian Xu
Haoming Chen
Aakash Mahalingam
Akankshya Kar
Divya Chaudhary
272
4
0
19 Dec 2024
KidLM: Advancing Language Models for Children -- Early Insights and Future Directions
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Mir Tafseer Nayeem
Davood Rafiei
ALM
337
12
0
04 Oct 2024
Knowledge-Aware Conversation Derailment Forecasting Using Graph Convolutional Networks
Enas Altarawneh
Ameeta Agrawal
Michael R. M. Jenkin
Manos Papagelis
377
0
0
24 Aug 2024
Navigating LLM Ethics: Advancements, Challenges, and Future Directions
AI and Ethics (AI & Ethics), 2024
Junfeng Jiao
S. Afroogh
Yiming Xu
Connor Phillips
AILaw
728
80
0
14 May 2024
Exploring Boundaries and Intensities in Offensive and Hate Speech: Unveiling the Complex Spectrum of Social Media Discourse
Abinew Ali Ayele
Esubalew alemneh Jalew
Adem Chanie Ali
Seid Muhie Yimam
Christian Biemann
197
6
0
18 Apr 2024
D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation
Aida Mostafazadeh Davani
Mark Díaz
Dylan K. Baker
Vinodkumar Prabhakaran
281
11
0
16 Apr 2024
NLP for Counterspeech against Hate: A Survey and How-To Guide
Helena Bonaldi
Yi-Ling Chung
Gavin Abercrombie
Marco Guerini
AAML
468
31
0
29 Mar 2024
GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?
Yiping Jin
Leo Wanner
A. Shvets
308
5
0
23 Feb 2024
Quantifying Stereotypes in Language
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Yang Liu
240
5
0
28 Jan 2024
A Critical Reflection on the Use of Toxicity Detection Algorithms in Proactive Content Moderation Systems
Mark Warner
Angelika Strohmayer
Matthew Higgs
Lynne Coventry
369
12
0
19 Jan 2024
Key to Kindness: Reducing Toxicity In Online Discourse Through Proactive Content Moderation in a Mobile Keyboard
Mark Warner
Angelika Strohmayer
Matthew Higgs
Husnain Rafiq
Liying Yang
Lynne Coventry
266
3
0
19 Jan 2024
Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges
Aiqi Jiang
A. Zubiaga
AAML
401
7
0
17 Jan 2024
Consolidating Strategies for Countering Hate Speech Using Persuasive Dialogues
ICON (ICON), 2024
Sougata Saha
Rohini Srihari
206
4
0
15 Jan 2024
Disentangling Perceptions of Offensiveness: Cultural and Moral Correlates
Aida Mostafazadeh Davani
Mark Díaz
Dylan K. Baker
Vinodkumar Prabhakaran
AAML
243
36
0
11 Dec 2023
Conversation Derailment Forecasting with Graph Convolutional Networks
Enas Altarawneh
Ammeta Agrawal
Michael R. M. Jenkin
Manos Papagelis
245
4
0
22 Jun 2023
Toxic comments reduce the activity of volunteer editors on Wikipedia
PNAS Nexus (PNAS Nexus), 2023
Ivan Smirnov
Camelia Oprea
Markus Strohmaier
KELM
193
5
0
26 Apr 2023
The crime of being poor
Georgina Curto
S. Kiritchenko
I. Nejadgholi
Kathleen C. Fraser
340
4
0
24 Mar 2023
Interactive Text Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Felix Faltings
Michel Galley
Baolin Peng
Kianté Brantley
Weixin Cai
Yizhe Zhang
Jianfeng Gao
Bill Dolan
380
0
0
02 Mar 2023
Leveraging World Knowledge in Implicit Hate Speech Detection
Jessica Lin
201
11
0
28 Dec 2022
Foveate, Attribute, and Rationalize: Towards Physically Safe and Trustworthy AI
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Alex Mei
Sharon Levy
William Yang Wang
347
8
0
19 Dec 2022
Undesirable Biases in NLP: Addressing Challenges of Measurement
Oskar van der Wal
Dominik Bachmann
Alina Leidinger
L. Maanen
Willem H. Zuidema
K. Schulz
533
8
0
24 Nov 2022
Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Helena Bonaldi
Sara Dellantonio
Serra Sinem Tekiroğlu
Marco Guerini
238
57
0
07 Nov 2022
Mitigating Covertly Unsafe Text within Natural Language Systems
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Alex Mei
Anisha Kabir
Sharon Levy
Melanie Subbiah
Emily Allaway
J. Judge
D. Patton
Bruce Bimber
Kathleen McKeown
William Yang Wang
373
13
0
17 Oct 2022
Metaphorical Paraphrase Generation: Feeding Metaphorical Language Models with Literal Texts
Giorgio Ottolina
John Pavlopoulos
225
1
0
10 Oct 2022
Hate Speech Criteria: A Modular Approach to Task-Specific Hate Speech Definitions
Urja Khurana
I. Vermeulen
Eric T. Nalisnick
M. V. Noorloos
Antske Fokkens
AILaw
177
25
0
30 Jun 2022
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Esma Balkir
I. Nejadgholi
Kathleen C. Fraser
S. Kiritchenko
FAtt
231
33
0
06 May 2022
Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
I. Nejadgholi
Kathleen C. Fraser
S. Kiritchenko
167
27
0
05 Apr 2022
Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study
Findings (Findings), 2022
Serra Sinem Tekiroğlu
Helena Bonaldi
Margherita Fanton
Marco Guerini
329
57
0
04 Apr 2022
Dynamic Forecasting of Conversation Derailment
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yova Kementchedjhieva
Anders Søgaard
AI4TS
116
18
0
11 Oct 2021
Countering Online Hate Speech: An NLP Perspective
Mudit Chaudhary
Chandni Saxena
Helen Meng
154
22
0
07 Sep 2021
SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection
Aiqi Jiang
Xiaohan Yang
Yang Liu
A. Zubiaga
247
100
0
06 Aug 2021
Your fairness may vary: Pretrained language model fairness in toxic text classification
Ioana Baldini
Dennis L. Wei
Karthikeyan N. Ramamurthy
Mikhail Yurochkin
Moninder Singh
446
57
0
03 Aug 2021
Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Margherita Fanton
Helena Bonaldi
Serra Sinem Tekiroğlu
Marco Guerini
240
133
0
19 Jul 2021
A Legal Approach to Hate Speech: Operationalizing the EU's Legal Framework against the Expression of Hatred as an NLP Task
Frederike Zufall
Marius Hamacher
Katharina Kloppenborg
Torsten Zesch
AILaw
191
18
0
07 Apr 2020
1
Page 1 of 1