
Title |
|---|
![]() Targeted Vaccine: Safety Alignment for Large Language Models against Harmful Fine-Tuning via Layer-wise PerturbationIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024 |
![]() Towards a Unified View of Preference Learning for Large Language Models:
A Survey Bofei Gao Feifan Song Yibo Miao Zefan Cai Zhiyong Yang ...Houfeng Wang Zhifang Sui Peiyi Wang Baobao Chang Baobao Chang |