Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.09993
Cited By
Generative AI for Hate Speech Detection: Evaluation and Findings
16 November 2023
Sagi Pendzel
Tomer Wullach
Amir Adler
Einat Minkov
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generative AI for Hate Speech Detection: Evaluation and Findings"
7 / 7 papers shown
Title
Evolving Hate Speech Online: An Adaptive Framework for Detection and Mitigation
Shiza Ali
Jeremy Blackburn
Gianluca Stringhini
54
0
0
24 Feb 2025
Re-examining Sexism and Misogyny Classification with Annotator Attitudes
Aiqi Jiang
Nikolas Vitsakis
Tanvi Dinkar
Gavin Abercrombie
Ioannis Konstas
29
0
0
04 Oct 2024
Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services
David Hartmann
Amin Oueslati
Dimitri Staufer
MLAU
27
1
0
20 Jun 2024
HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
Tanmay Sen
Ansuman Das
Mrinmay Sen
26
4
0
26 Apr 2024
What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection
Shangbin Feng
Herun Wan
Ningnan Wang
Zhaoxuan Tan
Minnan Luo
Yulia Tsvetkov
AAML
DeLMO
14
14
0
01 Feb 2024
Assessing the impact of contextual information in hate speech detection
Juan Manuel Pérez
Franco Luque
Demián Zayat
Martín Kondratzky
Agustín Moro
...
Joaquín Zajac
Paula Miguel
Natalia Debandi
Agustin Gravano
Viviana Cotik
19
29
0
02 Oct 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
1