Title |
---|
![]() ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity
within Large Language Models Chenyang Song Xu Han Zhengyan Zhang Shengding Hu Xiyu Shi ...Chen Chen Zhiyuan Liu Guanglin Li Tao Yang Maosong Sun |
![]() Is It a Free Lunch for Removing Outliers during Pretraining? Baohao Liao Christof Monz |