
Title |
|---|
![]() HOT: Hadamard-based Optimized TrainingComputer Vision and Pattern Recognition (CVPR), 2025 |
![]() BATON: Enhancing Batch-wise Inference Efficiency for Large Language
Models via Dynamic Re-batchingThe Web Conference (WWW), 2024 |
![]() Token-Scaled Logit Distillation for Ternary Weight Generative Language
ModelsNeural Information Processing Systems (NeurIPS), 2023 |