Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01680
Cited By
Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation
4 July 2023
Dimosthenis Antypas
Jose Camacho-Collados
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation"
12 / 12 papers shown
Title
Measuring Online Hate on 4chan using Pre-trained Deep Learning Models
Adrian Bermudez-Villalva
M. Mehrnezhad
Ehsan Toreini
35
0
0
30 Mar 2025
DefVerify: Do Hate Speech Models Reflect Their Dataset's Definition?
Urja Khurana
Eric T. Nalisnick
Antske Fokkens
36
1
0
21 Oct 2024
ToxiCraft: A Novel Framework for Synthetic Generation of Harmful Information
Zheng Hui
Zhaoxiao Guo
Hang Zhao
Juanyong Duan
Congrui Huang
15
6
0
23 Sep 2024
The Unappreciated Role of Intent in Algorithmic Moderation of Social Media Content
Xinyu Wang
S. Koneru
Pranav Narayanan Venkit
Brett Frischmann
Sarah Rajtmajer
16
0
0
17 May 2024
Probing Critical Learning Dynamics of PLMs for Hate Speech Detection
Sarah Masud
Mohammad Aflah Khan
Vikram Goyal
Md. Shad Akhtar
Tanmoy Chakraborty
8
0
0
03 Feb 2024
Generative AI for Hate Speech Detection: Evaluation and Findings
Sagi Pendzel
Tomer Wullach
Amir Adler
Einat Minkov
16
11
0
16 Nov 2023
LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection
Ahmad Nasir
Aadish Sharma
Kokil Jaidka
Saifuddin Ahmed
22
3
0
29 Oct 2023
Causality Guided Disentanglement for Cross-Platform Hate Speech Detection
Paras Sheth
Tharindu Kumarage
Raha Moraffah
Amanat Chadha
Huan Liu
13
7
0
03 Aug 2023
DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures
Angus R. Williams
Hannah Rose Kirk
L. Burke
Yi-Ling Chung
Ivan Debono
Pica Johansson
Francesca Stevens
Jonathan Bright
Scott A. Hale
16
1
0
31 Jul 2023
HateModerate: Testing Hate Speech Detectors against Content Moderation Policies
Jiangrui Zheng
Xueqing Liu
Guanqun Yang
Mirazul Haque
Xing Qian
Ravishka Rathnasuriya
Wei Yang
G. Budhrani
22
3
0
23 Jul 2023
Twitter Topic Classification
Dimosthenis Antypas
Asahi Ushio
Jose Camacho-Collados
Leonardo Neves
Vítor Silva
Francesco Barbieri
13
31
0
20 Sep 2022
Overview of the HASOC track at FIRE 2020: Hate Speech and Offensive Content Identification in Indo-European Languages
Thomas Mandl
Sandip J Modha
Gautam Kishore Shahi
Prasenjit Majumder
Mohana Dave
Daksh Patel
Chintak Mandalia
Aditya Patel
80
172
0
12 Aug 2021
1