COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 TrainingInternational Conference on Learning Representations (ICLR), 2024 |
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference AccelerationInternational Conference on Learning Representations (ICLR), 2024 |
CogVideoX: Text-to-Video Diffusion Models with An Expert TransformerInternational Conference on Learning Representations (ICLR), 2024 Zhuoyi Yang Jiayan Teng Wendi Zheng Ming Ding Shiyu Huang ...Weihan Wang Yean Cheng Xiaotao Gu Yuxiao Dong Jie Tang |