Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy SparsityInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024 |
Small transformer architectures for task switchingInternational Conference on Artificial Neural Networks (ICANN), 2025 |
Compression Method for Deep Diagonal State Space Model Based on Optimal ReductionIEEE Control Systems Letters (L-CSS), 2025 |