RAT: Bridging RNN Efficiency and Attention Accuracy via Chunk-based Sequence ModelingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025 |
AutoMixer: Checkpoint Artifacts as Automatic Data MixersAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding HelpsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Understand the Implication: Learning to Think for Pragmatic UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Just Go Parallel: Improving the Multilingual Capabilities of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |