Title |
---|
![]() CFSP: An Efficient Structured Pruning Framework for LLMs with
Coarse-to-Fine Activation Information Yuxin Wang Minghua Ma Zekun Wang Jingchang Chen Huiming Fan Liping Shan Qing Yang Dongliang Xu Ming Liu Bing Qin |
![]() LLM Inference Unveiled: Survey and Roofline Model Insights Zhihang Yuan Yuzhang Shang Yang Zhou Zhen Dong Zhe Zhou ...Yong Jae Lee Yan Yan Beidi Chen Guangyu Sun Kurt Keutzer |