How to Train Long-Context Language Models (Effectively)Annual Meeting of the Association for Computational Linguistics (ACL), 2024 |
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative DecodingInternational Conference on Learning Representations (ICLR), 2024 Jian Chen Vashisth Tiwari Ranajoy Sadhukhan Zhuoming Chen Jinyuan Shi Ian En-Hsu Yen Ian En-Hsu Yen Avner May Tianqi Chen Beidi Chen |