Title |
---|
![]() DroidSpeak: KV Cache Sharing for Cross-LLM Communication and Multi-LLM
Serving Yuhan Liu Esha Choukse Shan Lu Junchen Jiang Madan Musuvathi ...Yihua Cheng Junchen Jiang Shan Lu Madan Musuvathi Esha Choukse |
![]() Distilling Step-by-Step! Outperforming Larger Language Models with Less
Training Data and Smaller Model Sizes Lokesh Nagalapatti Chun-Liang Li Chih-Kuan Yeh Hootan Nakhost Yasuhisa Fujii Alexander Ratner Ranjay Krishna Chen-Yu Lee Tomas Pfister |