Towards Efficient and Explainable Hate Speech Detection via Model
DistillationEuropean Conference on Information Retrieval (ECIR), 2024 |
Hate Personified: Investigating the role of LLMs in content moderationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive
Speech Detection via Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |