ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.01440
  4. Cited By
Using Pre-Trained Language Models for Producing Counter Narratives
  Against Hate Speech: a Comparative Study

Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study

4 April 2022
Serra Sinem Tekiroğlu
Helena Bonaldi
Margherita Fanton
Marco Guerini
ArXivPDFHTML

Papers citing "Using Pre-Trained Language Models for Producing Counter Narratives Against Hate Speech: a Comparative Study"

23 / 23 papers shown
Title
Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories
Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories
Mareike Lisker
Christina Gottschalk
Helena Mihaljević
33
0
0
23 Apr 2025
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
Xiaoying Song
Sharon Lisseth Perez
Xinchen Yu
Eduardo Blanco
Lingzi Hong
121
0
0
17 Feb 2025
Assessing the Human Likeness of AI-Generated Counterspeech
Assessing the Human Likeness of AI-Generated Counterspeech
Xiaoying Song
Sujana Mamidisetty
Eduardo Blanco
Lingzi Hong
21
1
0
14 Oct 2024
Is Safer Better? The Impact of Guardrails on the Argumentative Strength
  of LLMs in Hate Speech Countering
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
Helena Bonaldi
Greta Damo
Nicolás Benjamín Ocampo
Elena Cabrio
S. Villata
Marco Guerini
38
4
0
04 Oct 2024
Decoding Hate: Exploring Language Models' Reactions to Hate Speech
Decoding Hate: Exploring Language Models' Reactions to Hate Speech
Paloma Piot
Javier Parapar
43
1
0
01 Oct 2024
A LLM-Based Ranking Method for the Evaluation of Automatic
  Counter-Narrative Generation
A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation
I. Zubiaga
A. Soroa
Rodrigo Agerri
34
4
0
21 Jun 2024
Fine-tuning with HED-IT: The impact of human post-editing for dialogical
  language models
Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models
Daniela Occhipinti
Michele Marchi
Irene Mondella
Huiyuan Lai
F. Dell’Orletta
Malvina Nissim
Marco Guerini
23
1
0
11 Jun 2024
NLP for Counterspeech against Hate: A Survey and How-To Guide
NLP for Counterspeech against Hate: A Survey and How-To Guide
Helena Bonaldi
Yi-Ling Chung
Gavin Abercrombie
Marco Guerini
AAML
31
13
0
29 Mar 2024
Outcome-Constrained Large Language Models for Countering Hate Speech
Outcome-Constrained Large Language Models for Countering Hate Speech
Lingzi Hong
Pengcheng Luo
Eduardo Blanco
Xiaoying Song
36
6
0
25 Mar 2024
On Zero-Shot Counterspeech Generation by LLMs
On Zero-Shot Counterspeech Generation by LLMs
Punyajoy Saha
Aalok Agrawal
Abhik Jana
Chris Biemann
Animesh Mukherjee
30
12
0
22 Mar 2024
Basque and Spanish Counter Narrative Generation: Data Creation and
  Evaluation
Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation
Jaione Bengoetxea
Yi-Ling Chung
Marco Guerini
Rodrigo Agerri
44
4
0
14 Mar 2024
A Multi-Aspect Framework for Counter Narrative Evaluation using Large
  Language Models
A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models
Jaylen Jones
Lingbo Mo
Eric Fosler-Lussier
Huan Sun
48
3
0
18 Feb 2024
Low-Resource Counterspeech Generation for Indic Languages: The Case of
  Bengali and Hindi
Low-Resource Counterspeech Generation for Indic Languages: The Case of Bengali and Hindi
Mithun Das
Saurabh Kumar Pandey
Shivansh Sethi
Punyajoy Saha
Animesh Mukherjee
25
2
0
11 Feb 2024
Automatic Evaluation of Generative Models with Instruction Tuning
Automatic Evaluation of Generative Models with Instruction Tuning
Shuhaib Mehri
Vered Shwartz
ELM
ALM
8
1
0
30 Oct 2023
HateRephrase: Zero- and Few-Shot Reduction of Hate Intensity in Online
  Posts using Large Language Models
HateRephrase: Zero- and Few-Shot Reduction of Hate Intensity in Online Posts using Large Language Models
Vibhor Agarwal
Yu Chen
Nishanth R. Sastry
16
6
0
21 Oct 2023
Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation
  via Attention Regularization
Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation via Attention Regularization
Helena Bonaldi
Giuseppe Attanasio
Debora Nozza
Marco Guerini
16
6
0
05 Sep 2023
Let the Models Respond: Interpreting Language Model Detoxification
  Through the Lens of Prompt Dependence
Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence
Daniel Scalena
Gabriele Sarti
Malvina Nissim
Elisabetta Fersini
11
0
0
01 Sep 2023
Understanding Counterspeech for Online Harm Mitigation
Understanding Counterspeech for Online Harm Mitigation
Yi-Ling Chung
Gavin Abercrombie
Florence E. Enock
Jonathan Bright
Verena Rieser
25
16
0
01 Jul 2023
COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive
  Statements
COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Xuhui Zhou
Haojie Zhu
Akhila Yerukola
Thomas Davidson
Jena D. Hwang
Swabha Swayamdipta
Maarten Sap
19
33
0
03 Jun 2023
Response Generation in Longitudinal Dialogues: Which Knowledge
  Representation Helps?
Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps?
Seyed Mahed Mousavi
Simone Caldarella
Giuseppe Riccardi
24
5
0
25 May 2023
Manifestations of Xenophobia in AI Systems
Manifestations of Xenophobia in AI Systems
Nenad Tomašev
J. L. Maynard
Iason Gabriel
24
9
0
15 Dec 2022
Human-Machine Collaboration Approaches to Build a Dialogue Dataset for
  Hate Speech Countering
Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering
Helena Bonaldi
Sara Dellantonio
Serra Sinem Tekiroğlu
Marco Guerini
21
41
0
07 Nov 2022
Deep Reinforcement Learning for Dialogue Generation
Deep Reinforcement Learning for Dialogue Generation
Jiwei Li
Will Monroe
Alan Ritter
Michel Galley
Jianfeng Gao
Dan Jurafsky
198
1,327
0
05 Jun 2016
1