Scalable Efficient Training of Large Language Models with
Low-dimensional Projected AttentionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
ReLoRA: High-Rank Training Through Low-Rank UpdatesInternational Conference on Learning Representations (ICLR), 2023 |