Quality Does Matter: A Detailed Look at the Quality and Utility of
Web-Mined Parallel CorporaConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024 |
Data Augmentation to Address Out-of-Vocabulary Problem in Low-Resource
Sinhala-English Neural Machine TranslationPacific Asia Conference on Language, Information and Computation (PACLIC), 2022 |
Samanantar: The Largest Publicly Available Parallel Corpora Collection
for 11 Indic LanguagesTransactions of the Association for Computational Linguistics (TACL), 2021 |