Hate Speech Dataset from a White Supremacy Forum

12 September 2018

Papers citing "Hate Speech Dataset from a White Supremacy Forum"

50 / 201 papers shown

Defining, Understanding, and Detecting Online Toxicity: Challenges and Machine Learning Approaches

Gautam Kishore Shahi

Tim A. Majchrzak

163

14 Sep 2025

MM-HSD: Multi-Modal Hate Speech Detection in Videos

Berta Céspedes-Sarrias

Carlos Collado-Capell

Pablo Rodenas-Ruiz

Olena Hrynenko

Andrea Cavallaro

130

28 Aug 2025

Mapping Toxic Comments Across Demographics: A Dataset from German Public Broadcasting

Jan Fillies

Michael Peter Hoffmann

192

26 Aug 2025

Scaling Up Active Testing to Large Language Models

Gabrielle Berrada

Jannik Kossen

Muhammed Razzak

Freddie Bickford-Smith

Y. Gal

Tom Rainforth

ALM

211

12 Aug 2025

Towards Safer AI Moderation: Evaluating LLM Moderators Through a Unified Benchmark Dataset and Advocating a Human-First Approach

215

09 Aug 2025

Can NLP Tackle Hate Speech in the Real World? Stakeholder-Informed Feedback and Survey on Counterspeech

162

06 Aug 2025

Web(er) of Hate: A Survey on How Hate Speech Is Typed

Luna Wang

Andrew Caines

Alice Hutchings

185

19 Jun 2025

Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?

Daman Deep Singh

Ramanuj Bhattacharjee

Abhijnan Chakraborty

217

15 Jun 2025

AmpleHate: Amplifying the Attention for Versatile Implicit Hate Detection

561

26 May 2025

Optimization-Inspired Few-Shot Adaptation for Large Language Models

357

25 May 2025

Model Risk Management for Generative AI In Financial Institutions

Anwesha Bhattacharyya

337

19 Mar 2025

Improving Hate Speech Classification with Cross-Taxonomy Dataset Integration

Jan Fillies

Adrian Paschke

247

07 Mar 2025

Towards a Robust Framework for Multimodal Hate Detection: A Study on Video vs. Image-based ContentThe Web Conference (WWW), 2025

Girish A. Koushik

Diptesh Kanojia

Helen Treharne

320

11 Feb 2025

Cross-Modal Transfer from Memes to Videos: Addressing Data Scarcity in Hateful Video DetectionThe Web Conference (WWW), 2025

Han Wang

Rui Yang Tan

Roy Ka-wei Lee

214

28 Jan 2025

Towards Efficient and Explainable Hate Speech Detection via Model DistillationEuropean Conference on Information Retrieval (ECIR), 2024

Paloma Piot

Javier Parapar

452

167

18 Dec 2024

A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based InformationICON (ICON), 2024

Prashant Kapil

Asif Ekbal

300

11 Nov 2024

Task Calibration: Calibrating Large Language Models on Inference Tasks

Yingjie Li

Yun Luo

Xiaotian Xie

Yue Zhang

LRM

286

24 Oct 2024

Mitigating Biases to Embrace Diversity: A Comprehensive Annotation Benchmark for Toxic Language

Xinmeng Hou

289

17 Oct 2024

Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech CounteringConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Helena Bonaldi

Greta Damo

Nicolás Benjamín Ocampo

Elena Cabrio

S. Villata

Marco Guerini

253

04 Oct 2024

Calibrate to Discriminate: Improve In-Context Learning with Label-Free Comparative Inference

296

03 Oct 2024

What is the social benefit of hate speech detection research? A Systematic Review

Sidney Gig-Jan Wong

189

26 Sep 2024

An Effective, Robust and Fairness-aware Hate Speech Detection Framework

Guanyi Mou

Kyumin Lee

320

25 Sep 2024

Zero-to-Strong Generalization: Eliciting Strong Capabilities of Large Language Models Iteratively without Gold LabelsInternational Conference on Computational Linguistics (COLING), 2024

Chaoqun Liu

Qin Chao

Wenxuan Zhang

Xiaobao Wu

Boyang Albert Li

Anh Tuan Luu

Lidong Bing

234

19 Sep 2024

Rethinking KenLM: Good and Bad Model Ensembles for Efficient Text Quality Filtering in Large Web CorporaAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Yungi Kim

Jihoo Kim

195

15 Sep 2024

LLM-based feature generation from text for interpretable machine learningMachine-mediated learning (ML), 2024

278

11 Sep 2024

Analysis of Socially Unacceptable Discourse with Zero-shot Learning

220

10 Sep 2024

Identity-related Speech Suppression in Generative AI Content Moderation

Oghenefejiro Isaacs Anigboro

Charlie M. Crawford

Danaé Metaxa

Sorelle A. Friedler

525

09 Sep 2024

Rethinking Backdoor Detection Evaluation for Language Models

372

31 Aug 2024

Promoting Equality in Large Language Models: Identifying and Mitigating the Implicit Bias based on Bayesian Theory

Xihe Qiu

Yinghui Xu

Yuan Qi

263

20 Aug 2024

Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Verna Dankers

Ivan Titov

315

09 Aug 2024

MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and BilibiliACM Multimedia (MM), 2024

308

28 Jul 2024

Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack

Xiaoyue Xu

Qinyuan Ye

Xiang Ren

408

23 Jul 2024

LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content

Jessica Foo

Shaun Khoo

300

24 Jun 2024

Token-based Decision Criteria Are Suboptimal in In-context Learning

684

24 Jun 2024

COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport

323

18 Jun 2024

Estimating the Hallucination Rate of Generative AI

Andrew Jesson

Nicolas Beltran-Velez

578

11 Jun 2024

Expert-Guided Extinction of Toxic Tokens for Debiased Generation

296

29 May 2024

Implicit In-context LearningInternational Conference on Learning Representations (ICLR), 2024

Di Liu

401

23 May 2024

The Unseen Targets of Hate -- A Systematic Review of Hateful Communication DatasetsSocial science computer review (SSCR), 2024

287

14 May 2024

Large Language Model Enhanced Machine Learning Estimators for Classification

107

08 May 2024

From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets

455

27 Apr 2024

Modeling Emotions and Ethics with Large Language Models

Edward Y. Chang

320

15 Apr 2024

Decomposing Label Space, Format and Discrimination: Rethinking How LLMs Respond and Solve Tasks via In-Context Learning

313

11 Apr 2024

Rectifying Demonstration Shortcut in In-Context LearningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

445

14 Mar 2024

GreenLLaMA: A Framework for Detoxification with Explanations

Md. Tawkat Islam Khondaker

Muhammad Abdul-Mageed

L. Lakshmanan

25 Feb 2024

NoisyICL: A Little Noise in Model Parameters Calibrates In-context Learning

Yufeng Zhao

Yoshihiro Sakai

Naoya Inoue

365

08 Feb 2024

Online Cascade Learning for Efficient Inference over Streams

Lunyiu Nie

Zhimin Ding

Erdong Hu

Christopher M. Jermaine

Swarat Chaudhuri

465

07 Feb 2024

Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Michele Mastromattei

Fabio Massimo Zanzotto

VLM

305

05 Feb 2024

Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models

Ming Shan Hee

253

30 Jan 2024

APT-Pipe: A Prompt-Tuning Tool for Social Data Annotation using ChatGPTThe Web Conference (WWW), 2024

Lik-Hang Lee

449

24 Jan 2024