GlotLID: Language Identification for Low-Resource LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
LIMIT: Language Identification, Misidentification, and Translation using
Hierarchical Models in 350+ LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
Transfer to a Low-Resource Language via Close Relatives: The Case Study
on FaroeseNordic Conference of Computational Linguistics (NODALIDA), 2023 |
PALI: A Language Identification Benchmark for Perso-Arabic ScriptsWorkshop on NLP for Similar Languages, Varieties and Dialects (VarDial), 2023 |
Geographic Adaptation of Pretrained Language ModelsTransactions of the Association for Computational Linguistics (TACL), 2022 |
A reproduction of Apple's bi-directional LSTM models for language
identification in short stringsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021 |