Generative AI for Hate Speech Detection: Evaluation and Findings

16 November 2023

Papers citing "Generative AI for Hate Speech Detection: Evaluation and Findings"

7 / 7 papers shown

Title
Evolving Hate Speech Online: An Adaptive Framework for Detection and Mitigation Shiza Ali Jeremy Blackburn Gianluca Stringhini 54 0 0 24 Feb 2025
Re-examining Sexism and Misogyny Classification with Annotator Attitudes Aiqi Jiang Nikolas Vitsakis Tanvi Dinkar Gavin Abercrombie Ioannis Konstas 29 0 0 04 Oct 2024
Watching the Watchers: A Comparative Fairness Audit of Cloud-based Content Moderation Services David Hartmann Amin Oueslati Dimitri Staufer MLAU 27 1 0 20 Jun 2024
HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models Tanmay Sen Ansuman Das Mrinmay Sen 26 4 0 26 Apr 2024
What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection Shangbin Feng Herun Wan Ningnan Wang Zhaoxuan Tan Minnan Luo Yulia Tsvetkov AAML DeLMO 14 14 0 01 Feb 2024
Assessing the impact of contextual information in hate speech detection Juan Manuel Pérez Franco Luque Demián Zayat Martín Kondratzky Agustín Moro ... Joaquín Zajac Paula Miguel Natalia Debandi Agustin Gravano Viviana Cotik 19 29 0 02 Oct 2022
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 301 11,730 0 04 Mar 2022