Revisiting Syllables in Language Modelling and their Application on
Low-Resource Machine TranslationInternational Conference on Computational Linguistics (COLING), 2022 |
Data-driven Model Generalizability in Crosslinguistic Low-resource
Morphological SegmentationTransactions of the Association for Computational Linguistics (TACL), 2022 |
A multilabel approach to morphosyntactic probingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language
RepresentationTransactions of the Association for Computational Linguistics (TACL), 2021 |