
![]() Dynamic Low-Rank Sparse Adaptation for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2025 |
![]() How Redundant Is the Transformer Stack in Speech Representation Models?IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
![]() Layer-wise Importance Matters: Less Memory for Better Performance in
Parameter-efficient Fine-tuning of Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() f-Divergence Minimization for Sequence-Level Knowledge DistillationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |