Title |
---|
![]() ReLU Wins: Discovering Efficient Activation Functions for Sparse
LLMs Zhengyan Zhang Yixin Song Guanghui Yu Xu Han Yankai Lin Chaojun Xiao Chenyang Song Zhiyuan Liu Zeyu Mi Maosong Sun |
![]() Variator: Accelerating Pre-trained Models with Plug-and-Play Compression
Modules Chaojun Xiao Yuqi Luo Wenbin Zhang Pengle Zhang Xu Han ...Zhengyan Zhang Ruobing Xie Zhiyuan Liu Maosong Sun Jie Zhou |
![]() Emergent Modularity in Pre-trained Transformers Zhengyan Zhang Zhiyuan Zeng Yankai Lin Chaojun Xiao Xiaozhi Wang Xu Han Zhiyuan Liu Ruobing Xie Maosong Sun Jie Zhou |