
Title |
|---|
![]() The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual DatasetNeural Information Processing Systems (NeurIPS), 2023 |
![]() ClArTTS: An Open-Source Classical Arabic Text-to-Speech CorpusInterspeech (Interspeech), 2023 |
![]() In What Languages are Generative Language Models the Most Formal?
Analyzing Formality Distribution across LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
![]() SAIDS: A Novel Approach for Sentiment Analysis Informed of Dialect and
SarcasmWorkshop on Arabic Natural Language Processing (WANLP), 2023 |
![]() NusaCrowd: Open Source Initiative for Indonesian NLP ResourcesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
![]() Maknuune: A Large Open Palestinian Arabic LexiconWorkshop on Arabic Natural Language Processing (WANLP), 2022 |
![]() One Country, 700+ Languages: NLP Challenges for Underrepresented
Languages and Dialects in IndonesiaAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |