Title |
---|
![]() Parallel AutoRegressive Models for Multi-Agent Combinatorial Optimization Federico Berto Chuanbo Hua Laurin Luttmann Jiwoo Son Junyoung Park Kyuree Ahn Changhyun Kwon Lin Xie Jinkyoo Park |
![]() LLM Inference Unveiled: Survey and Roofline Model Insights Zhihang Yuan Yuzhang Shang Yang Zhou Zhen Dong Zhe Zhou ...Yong Jae Lee Yan Yan Beidi Chen Guangyu Sun Kurt Keutzer |
![]() Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster
Speculative Decoding Weilin Zhao Yuxiang Huang Xu Han Wang Xu Chaojun Xiao Xinrong Zhang Yewei Fang Kaihuo Zhang Zhiyuan Liu Maosong Sun |