
Title |
|---|
![]() Learning from Students: Applying t-Distributions to Explore Accurate and
Efficient Formats for LLMsInternational Conference on Machine Learning (ICML), 2024 |
![]() Local Masking Meets Progressive Freezing: Crafting Efficient Vision
Transformers for Self-Supervised LearningInternational Conference on Machine Vision (ICMV), 2023 |
![]() How to Capture Higher-order Correlations? Generalizing Matrix Softmax
Attention to Kronecker ComputationInternational Conference on Learning Representations (ICLR), 2023 |