Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.03433
Cited By
Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering
7 November 2022
Helena Bonaldi
Sara Dellantonio
Serra Sinem Tekiroğlu
Marco Guerini
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering"
26 / 26 papers shown
Title
Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories
Mareike Lisker
Christina Gottschalk
Helena Mihaljević
33
0
0
23 Apr 2025
Policy Learning with a Natural Language Action Space: A Causal Approach
Bohan Zhang
Yixin Wang
Paramveer S. Dhillon
CML
41
0
0
24 Feb 2025
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
Xiaoying Song
Sharon Lisseth Perez
Xinchen Yu
Eduardo Blanco
Lingzi Hong
118
0
0
17 Feb 2025
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
Helena Bonaldi
Greta Damo
Nicolás Benjamín Ocampo
Elena Cabrio
S. Villata
Marco Guerini
38
4
0
04 Oct 2024
Decoding Hate: Exploring Language Models' Reactions to Hate Speech
Paloma Piot
Javier Parapar
43
1
0
01 Oct 2024
A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation
I. Zubiaga
A. Soroa
Rodrigo Agerri
34
4
0
21 Jun 2024
NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps
Kristina Gligorić
Myra Cheng
Lucia Zheng
Esin Durmus
Dan Jurafsky
37
8
0
02 Apr 2024
Revealing Trends in Datasets from the 2022 ACL and EMNLP Conferences
Jesse Atuhurra
Hidetaka Kamigaito
36
0
0
31 Mar 2024
Causal Inference for Human-Language Model Collaboration
Bohan Zhang
Yixin Wang
Paramveer S. Dhillon
38
2
0
30 Mar 2024
NLP for Counterspeech against Hate: A Survey and How-To Guide
Helena Bonaldi
Yi-Ling Chung
Gavin Abercrombie
Marco Guerini
AAML
31
13
0
29 Mar 2024
Outcome-Constrained Large Language Models for Countering Hate Speech
Lingzi Hong
Pengcheng Luo
Eduardo Blanco
Xiaoying Song
36
6
0
25 Mar 2024
Hatred Stems from Ignorance! Distillation of the Persuasion Modes in Countering Conversational Hate Speech
Ghadi Alyahya
Abeer Aldayel
38
2
0
18 Mar 2024
A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models
Jaylen Jones
Lingbo Mo
Eric Fosler-Lussier
Huan Sun
48
3
0
18 Feb 2024
Navigating the OverKill in Large Language Models
Chenyu Shi
Xiao Wang
Qiming Ge
Songyang Gao
Xianjun Yang
Tao Gui
Qi Zhang
Xuanjing Huang
Xun Zhao
Dahua Lin
16
11
0
31 Jan 2024
Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse
Seungyoon Lee
Dahyun Jung
Chanjun Park
Seolhwa Lee
Heu-Jeoung Lim
26
1
0
26 Jan 2024
Consolidating Strategies for Countering Hate Speech Using Persuasive Dialogues
Sougata Saha
R. Srihari
25
1
0
15 Jan 2024
DisCGen: A Framework for Discourse-Informed Counterspeech Generation
Sabit Hassan
Malihe Alikhani
38
13
0
29 Nov 2023
Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in Language
Jimin Mun
Emily Allaway
Akhila Yerukola
Laura Vianna
Sarah-Jane Leslie
Maarten Sap
16
22
0
31 Oct 2023
Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation via Attention Regularization
Helena Bonaldi
Giuseppe Attanasio
Debora Nozza
Marco Guerini
16
6
0
05 Sep 2023
Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence
Daniel Scalena
Gabriele Sarti
Malvina Nissim
Elisabetta Fersini
11
0
0
01 Sep 2023
Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?
Bo-Ru Lu
Nikita Haduong
Chia-Hsuan Lee
Zeqiu Wu
Hao Cheng
Paul Koester
J. Utke
Tao Yu
Noah A. Smith
Mari Ostendorf
SyDa
47
2
0
13 Jul 2023
Understanding Counterspeech for Online Harm Mitigation
Yi-Ling Chung
Gavin Abercrombie
Florence E. Enock
Jonathan Bright
Verena Rieser
25
16
0
01 Jul 2023
Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps?
Seyed Mahed Mousavi
Simone Caldarella
Giuseppe Riccardi
24
5
0
25 May 2023
Hate Speech Targets Detection in Parler using BERT
Nadav Schneider
Shimon Shouei
Saleem Ghantous
Elad Feldman
13
4
0
03 Apr 2023
CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
Sreyan Ghosh
Manan Suri
Purva Chiniya
Utkarsh Tyagi
Sonal Kumar
Dinesh Manocha
21
12
0
02 Mar 2023
Using In-Context Learning to Improve Dialogue Safety
Nicholas Meade
Spandana Gella
Devamanyu Hazarika
Prakhar Gupta
Di Jin
Siva Reddy
Yang Liu
Dilek Z. Hakkani-Tür
25
38
0
02 Feb 2023
1