Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.20968
Cited By
Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs
28 February 2025
Weixiang Zhao
Yulin Hu
Yang Deng
Jiahe Guo
Xingyu Sui
Xinyang Han
An Zhang
Yanyan Zhao
Bing Qin
Tat-Seng Chua
Ting Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs"
1 / 1 papers shown
Title
IMPersona: Evaluating Individual Level LM Impersonation
Quan Shi
Carlos E. Jimenez
Stephen Dong
Brian Seo
Caden Yao
Adam Kelch
Karthik Narasimhan
21
0
0
06 Apr 2025
1