Language Model Tokenizers Introduce Unfairness Between LanguagesNeural Information Processing Systems (NeurIPS), 2023 |
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large
Language Models in Multilingual LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
Measuring Massive Multitask Language UnderstandingInternational Conference on Learning Representations (ICLR), 2020 |
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training,
Understanding and GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 |