A benchmark for toxic comment classification on Civil Comments dataset

A benchmark for toxic comment classification on Civil Comments dataset

26 January 2023

Corentin Duchene

Pierre Guillaume

Reda Dehak

Papers citing "A benchmark for toxic comment classification on Civil Comments dataset"

5 / 5 papers shown

Title
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers Jiaqi Xue Qian Lou Mengxin Zheng 26 1 0 23 Oct 2024
When is Multicalibration Post-Processing Necessary? Dutch Hansen Siddartha Devic Preetum Nakkiran Vatsal Sharan 33 4 0 10 Jun 2024
Complexity Matters: Dynamics of Feature Learning in the Presence of Spurious Correlations GuanWen Qiu Da Kuang Surbhi Goel 25 8 0 05 Mar 2024
Measuring Misogyny in Natural Language Generation: Preliminary Results from a Case Study on two Reddit Communities Aaron J. Snoswell Lucinda Nelson Hao Xue Flora D. Salim Nicolas Suzor Jean Burgess 21 2 0 06 Dec 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond Jingfeng Yang Hongye Jin Ruixiang Tang Xiaotian Han Qizhang Feng Haoming Jiang Bing Yin Xia Hu LM&MA 125 619 0 26 Apr 2023