
Title |
|---|
![]() Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRAInternational Conference on Learning Representations (ICLR), 2024 |
![]() MEMORY-VQ: Compression for Tractable Internet-Scale MemoryNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023 Yury Zemlyanskiy Michiel de Jong Luke Vilnis Santiago Ontañón William W. Cohen Sumit Sanghai Joshua Ainslie |
![]() Pre-computed memory or on-the-fly encoding? A hybrid approach to
retrieval augmentation makes the most of your computeInternational Conference on Machine Learning (ICML), 2023 Michiel de Jong Yury Zemlyanskiy Nicholas FitzGerald Joshua Ainslie Sumit Sanghai Fei Sha William W. Cohen |