Investigating Recurrent Transformers with Dynamic Halt Jishnu Ray Chowdhury Cornelia Caragea |
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for
Programming LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple
TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
Rewiring the Transformer with Depth-Wise LSTMsInternational Conference on Language Resources and Evaluation (LREC), 2020 |