Compressive Transformers for Long-Range Sequence ModellingInternational Conference on Learning Representations (ICLR), 2019 |
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
TransformerJournal of machine learning research (JMLR), 2019 |