Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

7 November 2022

Helena Bonaldi

Sara Dellantonio

Serra Sinem Tekiroğlu

Marco Guerini

ArXiv (abs)PDF HTML

Papers citing "Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering"

33 / 33 papers shown

SPOT: An Annotated French Corpus and Benchmark for Detecting Critical Interventions in Online Conversations

Manon Berriche

Célia Nouri

Chloé Clavel

Jean-Philippe Cointet

176

10 Nov 2025

Beating Harmful Stereotypes Through Facts: RAG-based Counter-speech Generation

Greta Damo

Elena Cabrio

S. Villata

111

14 Oct 2025

Can NLP Tackle Hate Speech in the Real World? Stakeholder-Informed Feedback and Survey on Counterspeech

110

06 Aug 2025

EMBRACE: Shaping Inclusive Opinion Representation by Aligning Implicit Conversations with Social Norms

Abeer Aldayel

Areej Alokaili

162

27 Jul 2025

Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate

Mikel K. Ngueajio

Flor Miriam Plaza del Arco

Yi-Ling Chung

D. Rawat

Amanda Cercas Curry

192

04 Jun 2025

Counterspeech the ultimate shield! Multi-Conditioned Counterspeech Generation through Attributed Prefix LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Aswini Kumar Padhi

Anil Bandhakavi

Tanmoy Chakraborty

450

17 May 2025

Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories

Mareike Lisker

Christina Gottschalk

Helena Mihaljević

209

23 Apr 2025

Policy Learning with a Natural Language Action Space: A Causal Approach

218

24 Feb 2025

Echoes of Discord: Forecasting Hater Reactions to CounterspeechNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

897

17 Feb 2025

ReZG: Retrieval-Augmented Zero-Shot Counter Narrative Generation for Hate Speech

150

31 Dec 2024

Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech CounteringConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Helena Bonaldi

Greta Damo

Nicolás Benjamín Ocampo

Elena Cabrio

S. Villata

Marco Guerini

172

04 Oct 2024

Decoding Hate: Exploring Language Models' Reactions to Hate SpeechNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Paloma Piot

Javier Parapar

271

01 Oct 2024

A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation

I. Zubiaga

A. Soroa

Rodrigo Agerri

222

21 Jun 2024

NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction HelpsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Kristina Gligorić

Myra Cheng

Lucia Zheng

Esin Durmus

Dan Jurafsky

219

02 Apr 2024

Revealing Trends in Datasets from the 2022 ACL and EMNLP Conferences

Jesse Atuhurra

Hidetaka Kamigaito

350

31 Mar 2024

Causal Inference for Human-Language Model Collaboration

Bohan Zhang

Yixin Wang

Paramveer S. Dhillon

199

30 Mar 2024

NLP for Counterspeech against Hate: A Survey and How-To Guide

303

29 Mar 2024

Outcome-Constrained Large Language Models for Countering Hate Speech

305

25 Mar 2024

Hatred Stems from Ignorance! Distillation of the Persuasion Modes in Countering Conversational Hate Speech

Ghadi Alyahya

Abeer Aldayel

210

18 Mar 2024

A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models

Jaylen Jones

Lingbo Mo

Eric Fosler-Lussier

Huan Sun

307

18 Feb 2024

Navigating the OverKill in Large Language Models

Xuanjing Huang

Dahua Lin

215

31 Jan 2024

Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse

198

26 Jan 2024

Consolidating Strategies for Countering Hate Speech Using Persuasive DialoguesICON (ICON), 2024

Sougata Saha

Rohini Srihari

149

15 Jan 2024

DisCGen: A Framework for Discourse-Informed Counterspeech GenerationInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

Sabit Hassan

Malihe Alikhani

225

29 Nov 2023

Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in LanguageConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

124

31 Oct 2023

Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation via Attention Regularization

242

05 Sep 2023

Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence

141

01 Sep 2023

Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?

Bo-Ru Lu

Tao Yu

Mari Ostendorf

229

13 Jul 2023

Understanding Counterspeech for Online Harm Mitigation

166

01 Jul 2023

Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps?

Seyed Mahed Mousavi

Simone Caldarella

Giuseppe Riccardi

207

25 May 2023

Hate Speech Targets Detection in Parler using BERT

146

03 Apr 2023

CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic NetworkConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

233

02 Mar 2023

Using In-Context Learning to Improve Dialogue SafetyConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Siva Reddy

Yang Liu

Dilek Z. Hakkani-Tür

247

02 Feb 2023