TokAlign: Efficient Vocabulary Adaptation via Token AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Cross-Lingual Optimization for Language Transfer in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?International Conference on Learning Representations (ICLR), 2024 |
Zero-Shot Tokenizer TransferNeural Information Processing Systems (NeurIPS), 2024 |
Distilling Efficient Language-Specific Models for Cross-Lingual TransferAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |
Mini-Model Adaptation: Efficiently Extending Pretrained Models to New
Languages via Aligned Shallow TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
GreenPLM: Cross-Lingual Transfer of Monolingual Pre-Trained Language
Models at Almost No CostInternational Joint Conference on Artificial Intelligence (IJCAI), 2022 |
Lifting the Curse of Multilinguality by Pre-training Modular
TransformersNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022 |
Language Contamination Helps Explain the Cross-lingual Capabilities of
English Pretrained ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Terra Blevins Luke Zettlemoyer |
Oolong: Investigating What Makes Transfer Learning Hard with Controlled
StudiesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 |
Subword Mapping and Anchoring across LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2021 |
Identifying the Limits of Cross-Domain Knowledge Transfer for Pretrained
ModelsWorkshop on Representation Learning for NLP (RepL4NLP), 2021 |
A Primer in BERTology: What we know about how BERT worksTransactions of the Association for Computational Linguistics (TACL), 2020 |
On the Cross-lingual Transferability of Monolingual RepresentationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2019 |