ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.10436
  4. Cited By
Safety Assessment of Chinese Large Language Models

Safety Assessment of Chinese Large Language Models

20 April 2023
Hao Sun
Zhexin Zhang
Jiawen Deng
Jiale Cheng
Shiyu Huang
    ALMELM
ArXiv (abs)PDFHTML

Papers citing "Safety Assessment of Chinese Large Language Models"

9 / 59 papers shown
RoCar: A Relationship Network-based Evaluation Method to Large Language
  Models
RoCar: A Relationship Network-based Evaluation Method to Large Language Models
Ming Wang
Wenfang Wu
Chongyun Gao
Daling Wang
Shi Feng
Yifei Zhang
73
0
0
29 Jul 2023
MediaGPT : A Large Language Model For Chinese Media
MediaGPT : A Large Language Model For Chinese Media
Zhonghao Wang
Zijia Lu
Boshen Jin
Haiying Deng
LM&MA
234
1
0
20 Jul 2023
CValues: Measuring the Values of Chinese Large Language Models from
  Safety to Responsibility
CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Guohai Xu
Jiayi Liu
Mingshi Yan
Haotian Xu
Jinghui Si
...
Rong Zhang
Ji Zhang
Chao Peng
Feiyan Huang
Jingren Zhou
ALMELM
256
97
0
19 Jul 2023
BeaverTails: Towards Improved Safety Alignment of LLM via a
  Human-Preference Dataset
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference DatasetNeural Information Processing Systems (NeurIPS), 2023
Jiaming Ji
Mickel Liu
Juntao Dai
Xuehai Pan
Chi Zhang
Ce Bian
Chi Zhang
Ruiyang Sun
Yizhou Wang
Yaodong Yang
ALM
400
718
0
10 Jul 2023
PromptRobust: Towards Evaluating the Robustness of Large Language Models
  on Adversarial Prompts
PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts
Lingyao Li
Yongfeng Zhang
Jiaheng Zhou
Zichen Wang
Hao Chen
...
Linyi Yang
Weirong Ye
Yue Zhang
Neil Zhenqiang Gong
Xingxu Xie
SILM
430
209
0
07 Jun 2023
Attention Paper: How Generative AI Reshapes Digital Shadow Industry?
Attention Paper: How Generative AI Reshapes Digital Shadow Industry?ACM Turing Celebration Conference (TC), 2023
Qichao Wang
Huan Ma
Wen-Ke Wei
Hang Li
Liang Chen
...
Binwen Zhao
Bo Hu
Shu Zhen Zhang
Zibin Zheng
Bing Wu
180
1
0
26 May 2023
Fairness of ChatGPT
Fairness of ChatGPT
Yunqi Li
Lanjing Zhang
Zelong Li
355
26
0
22 May 2023
Editing Large Language Models: Problems, Methods, and Opportunities
Editing Large Language Models: Problems, Methods, and OpportunitiesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yunzhi Yao
Peng Wang
Bo Tian
Shuyang Cheng
Zhoubo Li
Shumin Deng
Huajun Chen
Ningyu Zhang
KELM
330
395
0
22 May 2023
A Survey of Safety and Trustworthiness of Large Language Models through
  the Lens of Verification and Validation
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and ValidationArtificial Intelligence Review (AIR), 2023
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
351
146
0
19 May 2023
Previous
12