Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling LawsInternational Conference on Machine Learning (ICML), 2023 |
An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuningIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2023 |