Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation

4 July 2023

Papers citing "Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation"

12 / 12 papers shown

Title
Measuring Online Hate on 4chan using Pre-trained Deep Learning Models Adrian Bermudez-Villalva M. Mehrnezhad Ehsan Toreini 35 0 0 30 Mar 2025
DefVerify: Do Hate Speech Models Reflect Their Dataset's Definition? Urja Khurana Eric T. Nalisnick Antske Fokkens 36 1 0 21 Oct 2024
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information Zheng Hui Zhaoxiao Guo Hang Zhao Juanyong Duan Congrui Huang 15 6 0 23 Sep 2024
The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content Xinyu Wang S. Koneru Pranav Narayanan Venkit Brett Frischmann Sarah Rajtmajer 16 0 0 17 May 2024
Probing Critical Learning Dynamics of PLMs for Hate Speech Detection Sarah Masud Mohammad Aflah Khan Vikram Goyal Md. Shad Akhtar Tanmoy Chakraborty 8 0 0 03 Feb 2024
Generative AI for Hate Speech Detection: Evaluation and Findings Sagi Pendzel Tomer Wullach Amir Adler Einat Minkov 16 11 0 16 Nov 2023
LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection Ahmad Nasir Aadish Sharma Kokil Jaidka Saifuddin Ahmed 22 3 0 29 Oct 2023
Causality Guided Disentanglement for Cross-Platform Hate Speech Detection Paras Sheth Tharindu Kumarage Raha Moraffah Amanat Chadha Huan Liu 13 7 0 03 Aug 2023
DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures Angus R. Williams Hannah Rose Kirk L. Burke Yi-Ling Chung Ivan Debono Pica Johansson Francesca Stevens Jonathan Bright Scott A. Hale 16 1 0 31 Jul 2023
HateModerate: Testing Hate Speech Detectors against Content Moderation Policies Jiangrui Zheng Xueqing Liu Guanqun Yang Mirazul Haque Xing Qian Ravishka Rathnasuriya Wei Yang G. Budhrani 22 3 0 23 Jul 2023
Twitter Topic Classification Dimosthenis Antypas Asahi Ushio Jose Camacho-Collados Leonardo Neves Vítor Silva Francesco Barbieri 13 31 0 20 Sep 2022
Overview of the HASOC track at FIRE 2020: Hate Speech and Offensive Content Identification in Indo-European Languages Thomas Mandl Sandip J Modha Gautam Kishore Shahi Prasenjit Majumder Mohana Dave Daksh Patel Chintak Mandalia Aditya Patel 80 172 0 12 Aug 2021