Safety Assessment of Chinese Large Language Models

20 April 2023

Jiale Cheng

Papers citing "Safety Assessment of Chinese Large Language Models"

9 / 59 papers shown

RoCar: A Relationship Network-based Evaluation Method to Large Language Models

Shi Feng

29 Jul 2023

MediaGPT : A Large Language Model For Chinese Media

Haiying Deng

234

20 Jul 2023

CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility

...

Ji Zhang

Jingren Zhou

256

19 Jul 2023

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference DatasetNeural Information Processing Systems (NeurIPS), 2023

Jiaming Ji

Juntao Dai

Chi Zhang

Chi Zhang

400

718

10 Jul 2023

PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts

Hao Chen

...

Yue Zhang

430

209

07 Jun 2023

Attention Paper: How Generative AI Reshapes Digital Shadow Industry?ACM Turing Celebration Conference (TC), 2023

...

180

26 May 2023

355

22 May 2023

Editing Large Language Models: Problems, Methods, and OpportunitiesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Peng Wang

Shumin Deng

Huajun Chen

Ningyu Zhang

KELM

330

395

22 May 2023

A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and ValidationArtificial Intelligence Review (AIR), 2023

...

351

146

19 May 2023