Title |
---|
![]() Effective Long-Context Scaling of Foundation Models Wenhan Xiong Jingyu Liu Igor Molybog Hejia Zhang Prajjwal Bhargava ...Dániel Baráth Sergey Edunov Mike Lewis Sinong Wang Hao Ma |
![]() CoLT5: Faster Long-Range Transformers with Conditional Computation Joshua Ainslie Tao Lei Michiel de Jong Santiago Ontañón Siddhartha Brahma ...Mandy Guo James Lee-Thorp Yi Tay Yun-hsuan Sung Sumit Sanghai |