Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1809.04444
Cited By
Hate Speech Dataset from a White Supremacy Forum
12 September 2018
Ona de Gibert
Naiara Pérez
Aitor García-Pablos
Montse Cuadros
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Hate Speech Dataset from a White Supremacy Forum"
50 / 201 papers shown
Defining, Understanding, and Detecting Online Toxicity: Challenges and Machine Learning Approaches
Gautam Kishore Shahi
Tim A. Majchrzak
163
0
0
14 Sep 2025
MM-HSD: Multi-Modal Hate Speech Detection in Videos
Berta Céspedes-Sarrias
Carlos Collado-Capell
Pablo Rodenas-Ruiz
Olena Hrynenko
Andrea Cavallaro
130
6
0
28 Aug 2025
Mapping Toxic Comments Across Demographics: A Dataset from German Public Broadcasting
Jan Fillies
Michael Peter Hoffmann
Rebecca Reichel
Roman Salzwedel
Sven Bodemer
Adrian Paschke
192
1
0
26 Aug 2025
Scaling Up Active Testing to Large Language Models
Gabrielle Berrada
Jannik Kossen
Muhammed Razzak
Freddie Bickford-Smith
Y. Gal
Tom Rainforth
ALM
211
3
0
12 Aug 2025
Towards Safer AI Moderation: Evaluating LLM Moderators Through a Unified Benchmark Dataset and Advocating a Human-First Approach
Naseem Machlovi
Maryam Saleki
Innocent Ababio
Ruhul Amin
215
4
0
09 Aug 2025
Can NLP Tackle Hate Speech in the Real World? Stakeholder-Informed Feedback and Survey on Counterspeech
Tanvi Dinkar
Aiqi Jiang
Simona Frenda
Poppy Gerrard-Abbott
Nancie Gunson
Gavin Abercrombie
Ioannis Konstas
162
0
0
06 Aug 2025
Web(er) of Hate: A Survey on How Hate Speech Is Typed
Luna Wang
Andrew Caines
Alice Hutchings
185
0
0
19 Jun 2025
Rethinking Hate Speech Detection on Social Media: Can LLMs Replace Traditional Models?
Daman Deep Singh
Ramanuj Bhattacharjee
Abhijnan Chakraborty
217
3
0
15 Jun 2025
AmpleHate: Amplifying the Attention for Versatile Implicit Hate Detection
Yejin Lee
Joonghyuk Hahn
Hyeseon Ahn
Yo-Sub Han
561
0
0
26 May 2025
Optimization-Inspired Few-Shot Adaptation for Large Language Models
Boyan Gao
Xin Wang
Jianlong Wu
David A. Clifton
357
0
0
25 May 2025
Model Risk Management for Generative AI In Financial Institutions
Anwesha Bhattacharyya
Ye Yu
Hanyu Yang
Rahul Singh
Tarun Joshi
Jie Chen
Kiran Yalavarthy
AIFin
MedIm
337
3
0
19 Mar 2025
Improving Hate Speech Classification with Cross-Taxonomy Dataset Integration
Jan Fillies
Adrian Paschke
247
1
0
07 Mar 2025
Towards a Robust Framework for Multimodal Hate Detection: A Study on Video vs. Image-based Content
The Web Conference (WWW), 2025
Girish A. Koushik
Diptesh Kanojia
Helen Treharne
320
12
0
11 Feb 2025
Cross-Modal Transfer from Memes to Videos: Addressing Data Scarcity in Hateful Video Detection
The Web Conference (WWW), 2025
Han Wang
Rui Yang Tan
Roy Ka-wei Lee
214
10
0
28 Jan 2025
Towards Efficient and Explainable Hate Speech Detection via Model Distillation
European Conference on Information Retrieval (ECIR), 2024
Paloma Piot
Javier Parapar
452
167
0
18 Dec 2024
A Unified Multi-Task Learning Architecture for Hate Detection Leveraging User-Based Information
ICON (ICON), 2024
Prashant Kapil
Asif Ekbal
300
1
0
11 Nov 2024
Task Calibration: Calibrating Large Language Models on Inference Tasks
Yingjie Li
Yun Luo
Xiaotian Xie
Yue Zhang
LRM
286
2
0
24 Oct 2024
Mitigating Biases to Embrace Diversity: A Comprehensive Annotation Benchmark for Toxic Language
Xinmeng Hou
289
1
0
17 Oct 2024
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Helena Bonaldi
Greta Damo
Nicolás Benjamín Ocampo
Elena Cabrio
S. Villata
Marco Guerini
253
14
0
04 Oct 2024
Calibrate to Discriminate: Improve In-Context Learning with Label-Free Comparative Inference
Wei Cheng
Tianlu Wang
Yanmin Ji
Fan Yang
Keren Tan
Yiyu Zheng
296
0
0
03 Oct 2024
What is the social benefit of hate speech detection research? A Systematic Review
Sidney Gig-Jan Wong
189
1
0
26 Sep 2024
An Effective, Robust and Fairness-aware Hate Speech Detection Framework
Guanyi Mou
Kyumin Lee
320
5
0
25 Sep 2024
Zero-to-Strong Generalization: Eliciting Strong Capabilities of Large Language Models Iteratively without Gold Labels
International Conference on Computational Linguistics (COLING), 2024
Chaoqun Liu
Qin Chao
Wenxuan Zhang
Xiaobao Wu
Boyang Albert Li
Anh Tuan Luu
Lidong Bing
234
4
0
19 Sep 2024
Rethinking KenLM: Good and Bad Model Ensembles for Efficient Text Quality Filtering in Large Web Corpora
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yungi Kim
Hyunsoo Ha
Sukyung Lee
Jihoo Kim
Seonghoon Yang
Chanjun Park
195
1
0
15 Sep 2024
LLM-based feature generation from text for interpretable machine learning
Machine-mediated learning (ML), 2024
Vojtěch Balek
Lukáš Sýkora
Vilém Sklenák
Tomáš Kliegr
278
12
0
11 Sep 2024
Analysis of Socially Unacceptable Discourse with Zero-shot Learning
Rayane Ghilene
Dimitra Niaouri
Michele Linardi
Julien Longhi
220
1
0
10 Sep 2024
Identity-related Speech Suppression in Generative AI Content Moderation
Oghenefejiro Isaacs Anigboro
Oghenefejiro Isaacs Anigboro
Charlie M. Crawford
Danaé Metaxa
Sorelle A. Friedler
525
3
0
09 Sep 2024
Rethinking Backdoor Detection Evaluation for Language Models
Jun Yan
Wenjie Jacky Mo
Xiang Ren
Robin Jia
ELM
372
5
0
31 Aug 2024
Promoting Equality in Large Language Models: Identifying and Mitigating the Implicit Bias based on Bayesian Theory
Yongxin Deng
Xihe Qiu
Jue Chen
Jing Pan
Chen Jue
Zhijun Fang
Yinghui Xu
Wei Chu
Yuan Qi
263
4
0
20 Aug 2024
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Verna Dankers
Ivan Titov
315
10
0
09 Aug 2024
MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili
ACM Multimedia (MM), 2024
Han Wang
Tan Rui Yang
Usman Naseem
Roy Ka-wei Lee
308
32
0
28 Jul 2024
Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack
Xiaoyue Xu
Qinyuan Ye
Xiang Ren
408
17
0
23 Jul 2024
LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content
Jessica Foo
Shaun Khoo
300
7
0
24 Jun 2024
Token-based Decision Criteria Are Suboptimal in In-context Learning
Hakaze Cho
Yoshihiro Sakai
Mariko Kato
Kenshiro Tanaka
Akira Ishii
Naoya Inoue
684
7
0
24 Jun 2024
COT: A Generative Approach for Hate Speech Counter-Narratives via Contrastive Optimal Transport
Linhao Zhang
Li Jin
Guangluan Xu
Xiaoyu Li
Xian Sun
323
3
0
18 Jun 2024
Estimating the Hallucination Rate of Generative AI
Andrew Jesson
Nicolas Beltran-Velez
Quentin Chu
Sweta Karlekar
Jannik Kossen
Yarin Gal
John P. Cunningham
David M. Blei
578
35
0
11 Jun 2024
Expert-Guided Extinction of Toxic Tokens for Debiased Generation
Xueyao Sun
Kaize Shi
Haoran Tang
Guandong Xu
Qing Li
MU
296
2
0
29 May 2024
Implicit In-context Learning
International Conference on Learning Representations (ICLR), 2024
Zhuowei Li
Zihao Xu
Ligong Han
Yunhe Gao
Song Wen
Di Liu
Hao Wang
Dimitris N. Metaxas
401
10
0
23 May 2024
The Unseen Targets of Hate -- A Systematic Review of Hateful Communication Datasets
Social science computer review (SSCR), 2024
Zehui Yu
Indira Sen
Dennis Assenmacher
Mattia Samory
Leon Fröhling
Christina Dahn
Debora Nozza
Claudia Wagner
287
11
0
14 May 2024
Large Language Model Enhanced Machine Learning Estimators for Classification
Yuhang Wu
Yingfei Wang
Chu Wang
Zeyu Zheng
107
3
0
08 May 2024
From Languages to Geographies: Towards Evaluating Cultural Bias in Hate Speech Datasets
Manuel Tonneau
Diyi Liu
Samuel Fraiberger
Ralph Schroeder
Scott A. Hale
Paul Röttger
455
23
0
27 Apr 2024
Modeling Emotions and Ethics with Large Language Models
Edward Y. Chang
320
2
0
15 Apr 2024
Decomposing Label Space, Format and Discrimination: Rethinking How LLMs Respond and Solve Tasks via In-Context Learning
Quanyu Long
Yin Wu
Wenya Wang
Sinno Jialin Pan
313
9
0
11 Apr 2024
Rectifying Demonstration Shortcut in In-Context Learning
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Joonwon Jang
Sanghwan Jang
Wonbin Kweon
Minjin Jeon
Hwanjo Yu
445
7
0
14 Mar 2024
GreenLLaMA: A Framework for Detoxification with Explanations
Md. Tawkat Islam Khondaker
Muhammad Abdul-Mageed
L. Lakshmanan
60
14
0
25 Feb 2024
NoisyICL: A Little Noise in Model Parameters Calibrates In-context Learning
Yufeng Zhao
Yoshihiro Sakai
Naoya Inoue
365
8
0
08 Feb 2024
Online Cascade Learning for Efficient Inference over Streams
Lunyiu Nie
Zhimin Ding
Erdong Hu
Christopher M. Jermaine
Swarat Chaudhuri
465
19
0
07 Feb 2024
Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Michele Mastromattei
Fabio Massimo Zanzotto
VLM
305
3
0
05 Feb 2024
Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models
Ming Shan Hee
Shivam Sharma
Rui Cao
Palash Nandi
Tanmoy Chakraborty
Roy Ka-wei Lee
253
4
0
30 Jan 2024
APT-Pipe: A Prompt-Tuning Tool for Social Data Annotation using ChatGPT
The Web Conference (WWW), 2024
Yiming Zhu
Zhizhuo Yin
Gareth Tyson
Ehsan-ul Haq
Lik-Hang Lee
Pan Hui
ALM
449
16
0
24 Jan 2024
1
2
3
4
5
Next
Page 1 of 5