Title |
---|
![]() Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster
Speculative Decoding Weilin Zhao Yuxiang Huang Xu Han Wang Xu Chaojun Xiao Xinrong Zhang Yewei Fang Kaihuo Zhang Zhiyuan Liu Maosong Sun |
![]() Bench: Extending Long Context Evaluation Beyond 100K Tokens Xinrong Zhang Yingfa Chen Shengding Hu Zihang Xu Junhao Chen ...Xu Han Zhen Leng Thai Shuo Wang Zhiyuan Liu Maosong Sun |