Title |
---|
![]() LLM Inference Unveiled: Survey and Roofline Model Insights Zhihang Yuan Yuzhang Shang Yang Zhou Zhen Dong Zhe Zhou ...Yong Jae Lee Yan Yan Beidi Chen Guangyu Sun Kurt Keutzer |
![]() Recursion in Recursion: Two-Level Nested Recursion for Length
Generalization with Scalability Jishnu Ray Chowdhury Cornelia Caragea |
![]() Variator: Accelerating Pre-trained Models with Plug-and-Play Compression
Modules Chaojun Xiao Yuqi Luo Wenbin Zhang Pengle Zhang Xu Han ...Zhengyan Zhang Ruobing Xie Zhiyuan Liu Maosong Sun Jie Zhou |