Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study

4 April 2022

Serra Sinem Tekiroğlu

Papers citing "Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study"

23 / 23 papers shown

Title
Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories Mareike Lisker Christina Gottschalk Helena Mihaljević 33 0 0 23 Apr 2025
Echoes of Discord: Forecasting Hater Reactions to Counterspeech Xiaoying Song Sharon Lisseth Perez Xinchen Yu Eduardo Blanco Lingzi Hong 121 0 0 17 Feb 2025
Assessing the Human Likeness of AI-Generated Counterspeech Xiaoying Song Sujana Mamidisetty Eduardo Blanco Lingzi Hong 21 1 0 14 Oct 2024
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering Helena Bonaldi Greta Damo Nicolás Benjamín Ocampo Elena Cabrio S. Villata Marco Guerini 38 4 0 04 Oct 2024
Decoding Hate: Exploring Language Models' Reactions to Hate Speech Paloma Piot Javier Parapar 43 1 0 01 Oct 2024
A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation I. Zubiaga A. Soroa Rodrigo Agerri 34 4 0 21 Jun 2024
Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models Daniela Occhipinti Michele Marchi Irene Mondella Huiyuan Lai F. Dell’Orletta Malvina Nissim Marco Guerini 23 1 0 11 Jun 2024
NLP for Counterspeech against Hate: A Survey and How-To Guide Helena Bonaldi Yi-Ling Chung Gavin Abercrombie Marco Guerini AAML 31 13 0 29 Mar 2024
Outcome-Constrained Large Language Models for Countering Hate Speech Lingzi Hong Pengcheng Luo Eduardo Blanco Xiaoying Song 36 6 0 25 Mar 2024
On Zero-Shot Counterspeech Generation by LLMs Punyajoy Saha Aalok Agrawal Abhik Jana Chris Biemann Animesh Mukherjee 30 12 0 22 Mar 2024
Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation Jaione Bengoetxea Yi-Ling Chung Marco Guerini Rodrigo Agerri 44 4 0 14 Mar 2024
A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models Jaylen Jones Lingbo Mo Eric Fosler-Lussier Huan Sun 48 3 0 18 Feb 2024
Low-Resource Counterspeech Generation for Indic Languages: The Case of Bengali and Hindi Mithun Das Saurabh Kumar Pandey Shivansh Sethi Punyajoy Saha Animesh Mukherjee 25 2 0 11 Feb 2024
Automatic Evaluation of Generative Models with Instruction Tuning Shuhaib Mehri Vered Shwartz ELM ALM 8 1 0 30 Oct 2023
HateRephrase: Zero- and Few-Shot Reduction of Hate Intensity in Online Posts using Large Language Models Vibhor Agarwal Yu Chen Nishanth R. Sastry 16 6 0 21 Oct 2023
Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation via Attention Regularization Helena Bonaldi Giuseppe Attanasio Debora Nozza Marco Guerini 16 6 0 05 Sep 2023
Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence Daniel Scalena Gabriele Sarti Malvina Nissim Elisabetta Fersini 11 0 0 01 Sep 2023
Understanding Counterspeech for Online Harm Mitigation Yi-Ling Chung Gavin Abercrombie Florence E. Enock Jonathan Bright Verena Rieser 25 16 0 01 Jul 2023
COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements Xuhui Zhou Haojie Zhu Akhila Yerukola Thomas Davidson Jena D. Hwang Swabha Swayamdipta Maarten Sap 19 33 0 03 Jun 2023
Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps? Seyed Mahed Mousavi Simone Caldarella Giuseppe Riccardi 24 5 0 25 May 2023
Manifestations of Xenophobia in AI Systems Nenad Tomašev J. L. Maynard Iason Gabriel 24 9 0 15 Dec 2022
Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering Helena Bonaldi Sara Dellantonio Serra Sinem Tekiroğlu Marco Guerini 21 41 0 07 Nov 2022
Deep Reinforcement Learning for Dialogue Generation Jiwei Li Will Monroe Alan Ritter Michel Galley Jianfeng Gao Dan Jurafsky 198 1,327 0 05 Jun 2016