Improved Training of Mixture-of-Experts Language GANsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 |
Guiding Teacher Forcing with Seer Forcing for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021 |