An Efficient Transformer Decoder with Compressed Sub-layersAAAI Conference on Artificial Intelligence (AAAI), 2021 |
Weight Distillation: Transferring the Knowledge in Neural Network
ParametersAnnual Meeting of the Association for Computational Linguistics (ACL), 2020 |
Towards Fully 8-bit Integer Inference for the Transformer ModelInternational Joint Conference on Artificial Intelligence (IJCAI), 2020 |
Neural Machine Translation: Challenges, Progress and FutureScience China Technological Sciences (Sci China Technol Sci), 2020 |