ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.06025
  4. Cited By
COLD: A Benchmark for Chinese Offensive Language Detection
v1v2 (latest)

COLD: A Benchmark for Chinese Offensive Language Detection

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
16 January 2022
Deng Jiawen
Jingyan Zhou
Hao Sun
Chujie Zheng
Fei Mi
Helen M. Meng
Shiyu Huang
ArXiv (abs)PDFHTML

Papers citing "COLD: A Benchmark for Chinese Offensive Language Detection"

50 / 56 papers shown
Investigating the Impact of Rationales for LLMs on Natural Language Understanding
Investigating the Impact of Rationales for LLMs on Natural Language Understanding
Wenhang Shi
Shuqing Bian
Yiren Chen
Xinyi Zhang
Zhe Zhao
Pengfei Hu
Wei Lu
Xiaoyong Du
ReLMLRM
84
0
0
19 Oct 2025
From Ground Trust to Truth: Disparities in Offensive Language Judgments on Contemporary Korean Political Discourse
From Ground Trust to Truth: Disparities in Offensive Language Judgments on Contemporary Korean Political Discourse
Seunguk Yu
Jungmin Yun
Jinhee Jang
Youngbin Kim
134
1
0
18 Sep 2025
Social Bias in Multilingual Language Models: A Survey
Social Bias in Multilingual Language Models: A Survey
Lance Calvin Lim Gamboa
Yue Feng
Mark Lee
252
0
0
27 Aug 2025
Can NLP Tackle Hate Speech in the Real World? Stakeholder-Informed Feedback and Survey on Counterspeech
Can NLP Tackle Hate Speech in the Real World? Stakeholder-Informed Feedback and Survey on Counterspeech
Tanvi Dinkar
Aiqi Jiang
Simona Frenda
Poppy Gerrard-Abbott
Nancie Gunson
Gavin Abercrombie
Ioannis Konstas
110
0
0
06 Aug 2025
MMBERT: Scaled Mixture-of-Experts Multimodal BERT for Robust Chinese Hate Speech Detection under Cloaking Perturbations
MMBERT: Scaled Mixture-of-Experts Multimodal BERT for Robust Chinese Hate Speech Detection under Cloaking Perturbations
Qiyao Xue
Yuchen Dou
Ryan Shi
Xiang Li
Wei Gao
MoE
133
1
0
01 Aug 2025
Culture Matters in Toxic Language Detection in Persian
Culture Matters in Toxic Language Detection in PersianAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zahra Bokaei
Walid Magdy
Bonnie Webber
140
0
0
03 Jun 2025
Unified Game Moderation: Soft-Prompting and LLM-Assisted Label Transfer for Resource-Efficient Toxicity Detection
Unified Game Moderation: Soft-Prompting and LLM-Assisted Label Transfer for Resource-Efficient Toxicity Detection
Zachary Yang
Domenico Tullo
Reihaneh Rabbany
85
3
0
01 Jun 2025
The Hidden Language of Harm: Examining the Role of Emojis in Harmful Online Communication and Content Moderation
The Hidden Language of Harm: Examining the Role of Emojis in Harmful Online Communication and Content Moderation
Yuhang Zhou
Yimin Xiao
Wei Ai
Ge Gao
181
0
0
31 May 2025
Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and FindingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Shujian Yang
Shiyao Cui
Chuanrui Hu
Huaimin Wang
Tianwei Zhang
Minlie Huang
Jialiang Lu
Han Qiu
178
7
0
30 May 2025
Chinese Cyberbullying Detection: Dataset, Method, and Validation
Chinese Cyberbullying Detection: Dataset, Method, and Validation
Yi Zhu
Xin Zou
Xindong Wu
235
0
0
27 May 2025
Chinese Toxic Language Mitigation via Sentiment Polarity Consistent Rewrites
Chinese Toxic Language Mitigation via Sentiment Polarity Consistent Rewrites
Xintong Wang
Yixiao Liu
Jingheng Pan
Liang Ding
Longyue Wang
Chris Biemann
177
0
0
21 May 2025
LLM-C3MOD: A Human-LLM Collaborative System for Cross-Cultural Hate Speech Moderation
Junyeong Park
Seogyeong Jeong
Siyang Song
Yohan Lee
Alice Oh
247
3
0
10 Mar 2025
U-Sticker: A Large-Scale Multi-Domain User Sticker Dataset for Retrieval and Personalization
U-Sticker: A Large-Scale Multi-Domain User Sticker Dataset for Retrieval and PersonalizationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Heng Er Metilda Chee
Jiayin Wang
Zhiqiang Guo
Weizhi Ma
Qinglang Guo
Min Zhang
321
1
0
26 Feb 2025
SafeDialBench: A Fine-Grained Safety Benchmark for Large Language Models in Multi-Turn Dialogues with Diverse Jailbreak Attacks
SafeDialBench: A Fine-Grained Safety Benchmark for Large Language Models in Multi-Turn Dialogues with Diverse Jailbreak Attacks
Hongye Cao
Yanming Wang
Sijia Jing
Ziyue Peng
Zhixin Bai
...
Yang Gao
Fanyu Meng
Xi Yang
Chao Deng
Junlan Feng
AAML
490
14
0
16 Feb 2025
SCCD: A Session-based Dataset for Chinese Cyberbullying DetectionInternational Conference on Computational Linguistics (COLING), 2025
Qingpo Yang
Yakai Chen
Zihui Xu
Yu-ming Shang
Sanchuan Guo
Xi Zhang
292
5
0
28 Jan 2025
ChineseWebText 2.0: Large-Scale High-quality Chinese Web Text with
  Multi-dimensional and fine-grained information
ChineseWebText 2.0: Large-Scale High-quality Chinese Web Text with Multi-dimensional and fine-grained information
Wanyue Zhang
Ziyong Li
Wen Yang
Chunlin Leng
Yinan Bai
Qianlong Du
Chengqing Zong
Jiajun Zhang
258
1
0
29 Nov 2024
LongSafety: Enhance Safety for Long-Context LLMs
LongSafety: Enhance Safety for Long-Context LLMs
Mianqiu Huang
Xiaoran Liu
Shaojun Zhou
Mozhi Zhang
Chenkun Tan
...
Zhikai Lei
Linlin Li
Qiang Liu
Yaqian Zhou
Jiaqi Leng
ELMALM
292
0
0
11 Nov 2024
DeMod: A Holistic Tool with Explainable Detection and Personalized
  Modification for Toxicity Censorship
DeMod: A Holistic Tool with Explainable Detection and Personalized Modification for Toxicity Censorship
Yiming Li
Peng Zhang
Hansu Gu
Tun Lu
Siyuan Qiao
Yubo Shu
Y. Shao
Ning Gu
224
7
0
04 Nov 2024
PclGPT: A Large Language Model for Patronizing and Condescending
  Language Detection
PclGPT: A Large Language Model for Patronizing and Condescending Language DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Hongbo Wang
Mingda Li
Junyu Lu
Hebin Xia
Liang Yang
Bo Xu
Ruizhu Liu
Hongfei Lin
150
3
0
01 Oct 2024
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language ModelsThe Web Conference (WWW), 2024
Peiyi Zhang
Yazhou Zhang
Bo Wang
Lu Rong
Jing Qin
Jing Qin
AI4EdELM
373
6
0
19 Sep 2024
MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video
  Detection on YouTube and Bilibili
MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and BilibiliACM Multimedia (MM), 2024
Han Wang
Tan Rui Yang
Usman Naseem
Roy Ka-wei Lee
267
26
0
28 Jul 2024
Purple-teaming LLMs with Adversarial Defender Training
Purple-teaming LLMs with Adversarial Defender Training
Jingyan Zhou
Kun Li
Junan Li
Jiawen Kang
Minda Hu
Xixin Wu
Helen Meng
AAML
225
1
0
01 Jul 2024
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
Yuchen Wen
Keping Bi
Wei Chen
Jiafeng Guo
Xueqi Cheng
520
6
0
20 Jun 2024
Quite Good, but Not Enough: Nationality Bias in Large Language Models --
  A Case Study of ChatGPT
Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT
Shucheng Zhu
Weikang Wang
Ying Liu
263
19
0
11 May 2024
SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource
  Languages of Singapore
SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore
Ri Chi Ng
Nirmalendu Prakash
Ming Shan Hee
K. T. W. Choo
Roy Ka-wei Lee
217
16
0
03 May 2024
Chinese Offensive Language Detection:Current Status and Future
  Directions
Chinese Offensive Language Detection:Current Status and Future Directions
Yunze Xiao
Houda Bouamor
Wajdi Zaghouani
373
3
0
27 Mar 2024
OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and
  Safety
OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
Chuang Liu
Linhao Yu
Jiaxuan Li
Renren Jin
Yufei Huang
...
Tao Liu
Jinwang Song
Hongying Zan
Sun Li
Deyi Xiong
ELM
332
13
0
18 Mar 2024
Collaborative decoding of critical tokens for boosting factuality of
  large language models
Collaborative decoding of critical tokens for boosting factuality of large language models
Lifeng Jin
Baolin Peng
Linfeng Song
Haitao Mi
Ye Tian
Dong Yu
HILM
154
8
0
28 Feb 2024
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable
  Safety Detectors
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
Zhexin Zhang
Yida Lu
Jingyuan Ma
Di Zhang
Rui Li
...
Hao Sun
Lei Sha
Zhifang Sui
Hongning Wang
Shiyu Huang
129
47
0
26 Feb 2024
Social Orientation: A New Feature for Dialogue Analysis
Social Orientation: A New Feature for Dialogue Analysis
Todd Morrill
Zhaoyuan Deng
Yanda Chen
Amith Ananthram
Colin Wayne Leach
Kathleen McKeown
253
4
0
26 Feb 2024
Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges
Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges
Aiqi Jiang
A. Zubiaga
AAML
303
7
0
17 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language
  Model Systems
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
ZuJie Wen
Ke Xu
Qi Li
321
99
0
11 Jan 2024
A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models
A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models
Hongru Wang
Lingzhi Wang
Yiming Du
Liang Chen
Jing Zhou
Yufei Wang
Kam-Fai Wong
LRM
452
23
0
28 Nov 2023
Can Large Language Models Understand Content and Propagation for
  Misinformation Detection: An Empirical Study
Can Large Language Models Understand Content and Propagation for Misinformation Detection: An Empirical Study
Mengyang Chen
Lingwei Wei
Han Cao
Wei Zhou
Song Hu
136
6
0
21 Nov 2023
Flames: Benchmarking Value Alignment of LLMs in Chinese
Flames: Benchmarking Value Alignment of LLMs in ChineseNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Kexin Huang
Xiangyang Liu
Qianyu Guo
Tianxiang Sun
Jiawei Sun
...
Yixu Wang
Yan Teng
Xipeng Qiu
Yingchun Wang
Dahua Lin
ALM
412
30
0
12 Nov 2023
Self-Guard: Empower the LLM to Safeguard Itself
Self-Guard: Empower the LLM to Safeguard ItselfNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Zezhong Wang
Fangkai Yang
Lu Wang
Lu Wang
Hongru Wang
Liang Chen
Qingwei Lin
Kam-Fai Wong
270
57
0
24 Oct 2023
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64
  Languages
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Chiyu Zhang
Khai Duy Doan
Qisheng Liao
Muhammad Abdul-Mageed
247
8
0
23 Oct 2023
Cultural Compass: Predicting Transfer Learning Success in Offensive
  Language Detection with Cultural Features
Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural FeaturesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Li Zhou
Antonia Karamolegkou
Wenyu Chen
Daniel Hershcovich
225
20
0
10 Oct 2023
Adapting Large Language Models for Content Moderation: Pitfalls in Data
  Engineering and Supervised Fine-tuning
Adapting Large Language Models for Content Moderation: Pitfalls in Data Engineering and Supervised Fine-tuning
Huan Ma
Changqing Zhang
Huazhu Fu
Peilin Zhao
Bing Wu
OffRLAI4MH
336
32
0
05 Oct 2023
Large Language Model Alignment: A Survey
Large Language Model Alignment: A Survey
Shangda Wu
Renren Jin
Yufei Huang
Chuang Liu
Weilong Dong
Zishan Guo
Xinwei Wu
Yan Liu
Deyi Xiong
LM&MA
359
282
0
26 Sep 2023
SafetyBench: Evaluating the Safety of Large Language Models
SafetyBench: Evaluating the Safety of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhexin Zhang
Leqi Lei
Lindong Wu
Rui Sun
Yongkang Huang
Chong Long
Xiao Liu
Xuanyu Lei
Jie Tang
Shiyu Huang
LRMLM&MAELM
304
169
0
13 Sep 2023
Exploring Cross-Cultural Differences in English Hate Speech Annotations:
  From Dataset Construction to Analysis
Exploring Cross-Cultural Differences in English Hate Speech Annotations: From Dataset Construction to AnalysisNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Nayeon Lee
Chani Jung
Jun-Hee Myung
Jiho Jin
Jose Camacho-Collados
Juho Kim
Alice Oh
314
42
0
31 Aug 2023
Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through
  the Lens of Moral Theories?
Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
Jingyan Zhou
Minda Hu
Junan Li
Xiaoying Zhang
Xixin Wu
Irwin King
Helen M. Meng
LRM
282
38
0
29 Aug 2023
Enhancing Psychological Counseling with Large Language Model: A
  Multifaceted Decision-Support System for Non-Professionals
Enhancing Psychological Counseling with Large Language Model: A Multifaceted Decision-Support System for Non-Professionals
Guanghui Fu
Qing Zhao
Jianqiang Li
Dan Luo
Changwei Song
...
Fan Wang
Yan Wang
Lijuan Cheng
Juan Zhang
B. Yang
OffRL
233
43
0
29 Aug 2023
CLEVA: Chinese Language Models EVAluation Platform
CLEVA: Chinese Language Models EVAluation PlatformConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yanyang Li
Jianqiao Zhao
Duo Zheng
Zi-Yuan Hu
Zhi Chen
...
Yongfeng Huang
Shijia Huang
Dahua Lin
Michael R. Lyu
Liwei Wang
ALMELM
322
16
0
09 Aug 2023
Classifying Crime Types using Judgment Documents from Social Media
Haoxuan Xu
Zeyu He
Mengfan Shen
Songning Lai
Ziqiang Han
Yifan Peng
269
0
0
29 Jun 2023
CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI
  Collaboration for Large Language Models
CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2023
Yufei Huang
Deyi Xiong
ALM
302
24
0
28 Jun 2023
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large
  Language Model Application
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Hwaran Lee
Seokhee Hong
Joonsuk Park
Takyoung Kim
Gunhee Kim
Jung-Woo Ha
361
34
0
28 May 2023
Improved Instruction Ordering in Recipe-Grounded Conversation
Improved Instruction Ordering in Recipe-Grounded ConversationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Duong Minh Le
Ruohao Guo
Wei Xu
Alan Ritter
225
10
0
26 May 2023
Facilitating Fine-grained Detection of Chinese Toxic Language:
  Hierarchical Taxonomy, Resources, and Benchmarks
Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and BenchmarksAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Junyu Lu
Bo Xu
Xiaokun Zhang
C. Min
Liang Yang
Hongfei Lin
165
54
0
08 May 2023
12
Next