
Title |
|---|
![]() AI-generated stories favour stability over change: homogeneity and cultural stereotyping in narratives generated by gpt-4o-miniOpen Research Europe (ORE), 2025 |
![]() Beyond Text Compression: Evaluating Tokenizers Across ScalesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Minimal Pair-Based Evaluation of Code-SwitchingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Adversarial TokenizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() Tokenization is Sensitive to Language VariationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() DiSCo: Device-Server Collaborative LLM-Based Text Streaming ServicesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() MrT5: Dynamic Token Merging for Efficient Byte-level Language ModelsInternational Conference on Learning Representations (ICLR), 2024 |
![]() Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
![]() Adapters for Altering LLM Vocabularies: What Languages Benefit the Most?International Conference on Learning Representations (ICLR), 2024 |
![]() From Tokens to Words: On the Inner Lexicon of LLMsInternational Conference on Learning Representations (ICLR), 2024 |
![]() ExploreSelf: Fostering User-driven Exploration and Reflection on Personal Challenges with Adaptive Guidance by Large Language ModelsInternational Conference on Human Factors in Computing Systems (CHI), 2024 |
![]() Where is the signal in tokenization space?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |