Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.11125
Cited By
A benchmark for toxic comment classification on Civil Comments dataset
26 January 2023
Corentin Duchene
Henri Jamet
Pierre Guillaume
Reda Dehak
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A benchmark for toxic comment classification on Civil Comments dataset"
5 / 5 papers shown
Title
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
Jiaqi Xue
Qian Lou
Mengxin Zheng
26
1
0
23 Oct 2024
When is Multicalibration Post-Processing Necessary?
Dutch Hansen
Siddartha Devic
Preetum Nakkiran
Vatsal Sharan
33
4
0
10 Jun 2024
Complexity Matters: Dynamics of Feature Learning in the Presence of Spurious Correlations
GuanWen Qiu
Da Kuang
Surbhi Goel
25
8
0
05 Mar 2024
Measuring Misogyny in Natural Language Generation: Preliminary Results from a Case Study on two Reddit Communities
Aaron J. Snoswell
Lucinda Nelson
Hao Xue
Flora D. Salim
Nicolas Suzor
Jean Burgess
21
2
0
06 Dec 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Xia Hu
LM&MA
125
619
0
26 Apr 2023
1