v1v2 (latest)

HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

1 November 2023

ArXiv (abs)PDF HTML Github (25★)

Papers citing "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning"

14 / 14 papers shown

Aligning Attention with Human Rationales for Self-Explaining Hate Speech Detection

Brage Eilertsen

Røskva Bjørgfinsdóttir

Francielle Vargas

Ali Ramezani-Kebrya

112

10 Nov 2025

Toxic Ink on Immutable Paper: Content Moderation for Ethereum Input Data Messages (IDMs)

134

12 Oct 2025

ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection

175

08 Oct 2025

Fine-Tuning Large Language Models with QLoRA for Offensive Language Detection in Roman Urdu-English Code-Mixed Text

220

04 Oct 2025

LLM-Based Multi-Task Bangla Hate Speech Detection: Type, Severity, and Target

207

02 Oct 2025

Are LLMs Enough for Hyperpartisan, Fake, Polarized and Harmful Content Detection? Evaluating In-Context Learning vs. Fine-Tuning

Michele Joshua Maggini

Dhia Merzougui

Rabiraj Bandyopadhyay

Gaël Dias

Fabrice Maurel

Pablo Gamallo

189

09 Sep 2025

Specializing General-purpose LLM Embeddings for Implicit Hate Speech Detection across Datasets

167

28 Aug 2025

Argument-Based Consistency in Toxicity Explanations of LLMs

Ramaravind Kommiya Mothilal

Joanna Roy

Syed Ishtiaque Ahmed

Shion Guha

221

23 Jun 2025

Selective Demonstration Retrieval for Improved Implicit Hate Speech Detection

Yumin Kim

Donghoon Shin

279

16 Apr 2025

MemeIntel: Explainable Detection of Propagandistic and Hateful Memes

Mohamed Bayan Kmainasi

332

23 Feb 2025

Towards Efficient and Explainable Hate Speech Detection via Model DistillationEuropean Conference on Information Retrieval (ECIR), 2024

Paloma Piot

Javier Parapar

432

167

18 Dec 2024

CFSafety: Comprehensive Fine-grained Safety Assessment for LLMs

Zhihao Liu

Chenhui Hu

ALM ELM

240

29 Oct 2024

Hate Personified: Investigating the role of LLMs in content moderationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Tanmoy Chakraborty

292

03 Oct 2024

HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

H. Nghiem

Hal Daumé

446

18 Mar 2024