v1v2 (latest)

Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective

Journal of Artificial Intelligence Research (JAIR), 2020

22 December 2020

Papers citing "Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective"

41 / 41 papers shown

Beating Harmful Stereotypes Through Facts: RAG-based Counter-speech Generation

Greta Damo

Elena Cabrio

S. Villata

124

14 Oct 2025

Toxicity in Online Platforms and AI Systems: A Survey of Needs, Challenges, Mitigations, and Future DirectionsExpert systems with applications (ESWA), 2025

216

29 Sep 2025

Conversations Gone Awry, But Then? Evaluating Conversational Forecasting Models

Cristian Danescu-Niculescu-Mizil

AI4TS

206

25 Jul 2025

Cracking the Code: Enhancing Implicit Hate Speech Detection through Coding Classification

262

05 Jun 2025

WildFireCan-MMD: A Multimodal Dataset for Classification of User-Generated Content During Wildfires in Canada

Braeden Sherritt

Isar Nejadgholi

Efstratios Aivaliotis

Khaled Mslmani

Marzieh Amini

VLM

622

17 Apr 2025

Tackling Social Bias against the Poor: A Dataset and Taxonomy on AporophobiaNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Georgina Curto

S. Kiritchenko

Muhammad Hammad Fahim Siddiqui

I. Nejadgholi

Kathleen C. Fraser

196

17 Apr 2025

From Intrinsic Toxicity to Reception-Based Toxicity: A Contextual Framework for Prediction and Evaluation

Sergey Berezin

R. Farahbakhsh

Noel Crespi

371

20 Mar 2025

Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization

272

19 Dec 2024

KidLM: Advancing Language Models for Children -- Early Insights and Future DirectionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Mir Tafseer Nayeem

Davood Rafiei

ALM

337

04 Oct 2024

Knowledge-Aware Conversation Derailment Forecasting Using Graph Convolutional Networks

377

24 Aug 2024

Navigating LLM Ethics: Advancements, Challenges, and Future DirectionsAI and Ethics (AI & Ethics), 2024

728

14 May 2024

Exploring Boundaries and Intensities in Offensive and Hate Speech: Unveiling the Complex Spectrum of Social Media Discourse

Abinew Ali Ayele

Esubalew alemneh Jalew

Adem Chanie Ali

Seid Muhie Yimam

Christian Biemann

197

18 Apr 2024

D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation

Aida Mostafazadeh Davani

Mark Díaz

Dylan K. Baker

Vinodkumar Prabhakaran

281

16 Apr 2024

NLP for Counterspeech against Hate: A Survey and How-To Guide

468

29 Mar 2024

GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?

Yiping Jin

Leo Wanner

A. Shvets

308

23 Feb 2024

Quantifying Stereotypes in LanguageConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024

Yang Liu

240

28 Jan 2024

A Critical Reflection on the Use of Toxicity Detection Algorithms in Proactive Content Moderation Systems

369

19 Jan 2024

Key to Kindness: Reducing Toxicity In Online Discourse Through Proactive Content Moderation in a Mobile Keyboard

266

19 Jan 2024

Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges

Aiqi Jiang

A. Zubiaga

AAML

401

17 Jan 2024

Consolidating Strategies for Countering Hate Speech Using Persuasive DialoguesICON (ICON), 2024

Sougata Saha

Rohini Srihari

206

15 Jan 2024

Disentangling Perceptions of Offensiveness: Cultural and Moral Correlates

Aida Mostafazadeh Davani

Mark Díaz

Dylan K. Baker

Vinodkumar Prabhakaran

AAML

243

11 Dec 2023

Conversation Derailment Forecasting with Graph Convolutional Networks

245

22 Jun 2023

Toxic comments reduce the activity of volunteer editors on WikipediaPNAS Nexus (PNAS Nexus), 2023

193

26 Apr 2023

The crime of being poor

340

24 Mar 2023

Interactive Text GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

380

02 Mar 2023

Leveraging World Knowledge in Implicit Hate Speech Detection

Jessica Lin

201

28 Dec 2022

Foveate, Attribute, and Rationalize: Towards Physically Safe and Trustworthy AIAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Alex Mei

Sharon Levy

William Yang Wang

347

19 Dec 2022

Undesirable Biases in NLP: Addressing Challenges of Measurement

533

24 Nov 2022

Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech CounteringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Helena Bonaldi

Sara Dellantonio

Serra Sinem Tekiroğlu

Marco Guerini

238

07 Nov 2022

Mitigating Covertly Unsafe Text within Natural Language SystemsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Kathleen McKeown

373

17 Oct 2022

Metaphorical Paraphrase Generation: Feeding Metaphorical Language Models with Literal Texts

Giorgio Ottolina

John Pavlopoulos

225

10 Oct 2022

Hate Speech Criteria: A Modular Approach to Task-Specific Hate Speech Definitions

177

30 Jun 2022

Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech DetectionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

231

06 May 2022

Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation VectorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

I. Nejadgholi

Kathleen C. Fraser

S. Kiritchenko

167

05 Apr 2022

Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative StudyFindings (Findings), 2022

Serra Sinem Tekiroğlu

Helena Bonaldi

Margherita Fanton

Marco Guerini

329

04 Apr 2022

Dynamic Forecasting of Conversation DerailmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Yova Kementchedjhieva

Anders Søgaard

AI4TS

116

11 Oct 2021

Countering Online Hate Speech: An NLP Perspective

Mudit Chaudhary

Chandni Saxena

Helen Meng

154

07 Sep 2021

SWSR: A Chinese Dataset and Lexicon for Online Sexism Detection

Aiqi Jiang

Xiaohan Yang

Yang Liu

A. Zubiaga

247

100

06 Aug 2021

Your fairness may vary: Pretrained language model fairness in toxic text classification

Ioana Baldini

Dennis L. Wei

Karthikeyan N. Ramamurthy

Mikhail Yurochkin

Moninder Singh

446

03 Aug 2021

Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate SpeechAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Margherita Fanton

Helena Bonaldi

Serra Sinem Tekiroğlu

Marco Guerini

240

133

19 Jul 2021

A Legal Approach to Hate Speech: Operationalizing the EU's Legal Framework against the Expression of Hatred as an NLP Task

Frederike Zufall

Marius Hamacher

Katharina Kloppenborg

Torsten Zesch

AILaw

191

07 Apr 2020