Title |
---|
![]() Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in
Red Teaming GenAI Ambrish Rawat Stefan Schoepf Giulio Zizzo Giandomenico Cornacchia Muhammad Zaid Hameed ...Elizabeth M. Daly Mark Purcell P. Sattigeri Pin-Yu Chen Kush R. Varshney |
![]() QPO: Query-dependent Prompt Optimization via Multi-Loop Offline
Reinforcement Learning Yilun Kong Hangyu Mao Qi Zhao Bin Zhang Jingqing Ruan Li Shen Yongzhe Chang Xueqian Wang Rui Zhao Dacheng Tao |
![]() FactorLLM: Factorizing Knowledge via Mixture of Experts for Large
Language Models Zhongyu Zhao Menghang Dong Rongyu Zhang Wenzhao Zheng Yunpeng Zhang Huanrui Yang Dalong Du Kurt Keutzer Shanghang Zhang |
![]() I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative
Self-Enhancement Paradigm Yiming Liang Ge Zhang Xingwei Qu Tianyu Zheng Jiawei Guo ...Jiaheng Liu Chenghua Lin Lei Ma Wenhao Huang Jiajun Zhang |