From BERT to Qwen: Hate Detection across architectures

14 July 2025

Ariadna Mon

Saúl Fenollosa

Jon Lecumberri

ArXiv (abs)PDF HTML

Main:3 Pages

5 Figures

Bibliography:1 Pages

1 Tables

Abstract

Online platforms struggle to curb hate speech without over-censoring legitimate discourse. Early bidirectional transformer encoders made big strides, but the arrival of ultra-large autoregressive LLMs promises deeper context-awareness. Whether this extra scale actually improves practical hate-speech detection on real-world text remains unverified. Our study puts this question to the test by benchmarking both model families, classic encoders and next-generation LLMs, on curated corpora of online interactions for hate-speech detection (Hate or No Hate).

View on arXiv

Comments on this paper