v1v2 (latest)

XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

15 April 2021

Graham Neubig

ArXiv (abs)PDF HTML Github (644★)

Papers citing "XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation"

50 / 147 papers shown

Rethinking what Matters: Effective and Robust Multilingual Realignment for Low-Resource Languages

218

09 Nov 2025

TransAlign: Machine Translation Encoders are Strong Word Aligners, TooConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

Benedikt Ebing

Christian Goldschmied

Goran Glavaš

159

31 Oct 2025

Modality Matching Matters: Calibrating Language Distances for Cross-Lingual Transfer in URIEL+

198

22 Oct 2025

Model-Based Ranking of Source Languages for Zero-Shot Cross-Lingual Transfer

Abteen Ebrahimi

Adam Wiemerslage

Katharina von der Wense

LRM

222

03 Oct 2025

MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages

194

30 Sep 2025

Evaluating Language Translation Models by Playing Telephone

Syeda Jannatus Saba

Steven Skiena

148

23 Sep 2025

SinhalaMMLU: A Comprehensive Benchmark for Evaluating Multitask Language Understanding in Sinhala

282

03 Sep 2025

Quantifying Language Disparities in Multilingual Large Language Models

Songbo Hu

Ivan Vulić

Anna Korhonen

150

23 Aug 2025

Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish

227

22 Aug 2025

Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?

287

27 Jul 2025

IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems

319

02 Jun 2025

Moderating Harm: Benchmarking Large Language Models for Cyberbullying Detection in YouTube CommentsInternational Journal of Computer Applications (IJCA), 2025

Amel Muminovic

ELM AI4MH

282

25 May 2025

The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Benedikt Ebing

Goran Glavaš

408

15 May 2025

Myanmar XNLI: Building a Dataset and Exploring Low-resource Approaches to Natural Language Inference with MyanmarLanguage Resources and Evaluation (LRE), 2025

Aung Kyaw Htet

Mark Dras

216

13 Apr 2025

LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama

1.0K

14 Mar 2025

MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

...

Edison Marrese-Taylor

608

13 Mar 2025

NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous ScriptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Muhammad Farid Adilazuarda

408

25 Feb 2025

URIEL+: Enhancing Linguistic Inclusion and Usability in a Typological and Multilingual Knowledge BaseInternational Conference on Computational Linguistics (COLING), 2024

430

17 Feb 2025

INCLUDE: Evaluating Multilingual Language Understanding with Regional KnowledgeInternational Conference on Learning Representations (ICLR), 2024

...

519

29 Nov 2024

DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion ModelPattern Recognition Letters (PR), 2024

326

26 Nov 2024

Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic ParsingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Deokhyung Kang

Seonjeong Hwang

Yunsu Kim

Gary Geunbae Lee

315

01 Oct 2024

XTRUST: On the Multilingual Trustworthiness of Large Language Models

320

24 Sep 2024

Bilingual Evaluation of Language Models on General Knowledge in University Entrance Exams with Minimal ContaminationInternational Conference on Computational Linguistics (COLING), 2024

Eva Sánchez Salido

Roser Morante

Julio Gonzalo

Guillermo Marco

Jorge Carrillo-de-Albornoz

...

Enrique Amigó

Andrés Fernández

Alejandro Benito-Santos

Adrián Ghajari Espinosa

Victor Fresno

ELM

334

19 Sep 2024

AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs

423

17 Sep 2024

Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings

272

05 Aug 2024

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text RetrievalConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

...

Fei Huang

Min Zhang

357

277

29 Jul 2024

sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting

Sanchit Ahuja

Kumar Tanmay

Hardik Hansrajbhai Chauhan

...

530

13 Jul 2024

Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models

Nikhil Sharma

Kenton Murray

Ziang Xiao

551

07 Jul 2024

Disce aut Deficere: Evaluating LLMs Proficiency on the INVALSI Italian Benchmark

319

25 Jun 2024

PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data

325

21 Jun 2024

On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?

458

20 Jun 2024

Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Fabian David Schmidt

Philipp Borchert

Ivan Vulić

Goran Glavaš

291

18 Jun 2024

Decoding the Diversity: A Review of the Indic AI Research Landscape

Sankalp KJ

Vinija Jain

S. Bhaduri

Tamoghna Roy

Vasu Sharma

328

13 Jun 2024

MINERS: Multilingual Language Models as Semantic Retrievers

Genta Indra Winata

Ruochen Zhang

David Ifeoluwa Adelani

RALM

491

11 Jun 2024

From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency

336

18 Apr 2024

Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers

Zhi Chen

Wanxiang Che

Philip S. Yu

LRM

401

07 Apr 2024

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

Antonios Anastasopoulos

259

16 Mar 2024

Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

294

08 Mar 2024

Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ

Paul Röttger

331

06 Mar 2024

Could We Have Had Better Multilingual LLMs If English Was Not the Central Language?

445

21 Feb 2024

ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

Abdelrahman Boda Sadallah

...

351

20 Feb 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

...

433

192

09 Feb 2024

What is "Typological Diversity" in NLP?

552

06 Feb 2024

Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models

Sara Rajaee

Christof Monz

293

03 Feb 2024

Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning

Ashish Agrawal

Barah Fazili

Preethi Jyothi

278

03 Feb 2024

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks

374

29 Jan 2024

Discovering Low-rank Subspaces for Language-agnostic Multilingual RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Zhihui Xie

Handong Zhao

Tong Yu

Shuai Li

287

11 Jan 2024

Understanding LLMs: A Comprehensive Overview from Training to Inference

...

Tuo Zhang

Tianming Liu

502

139

04 Jan 2024

To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource LanguagesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Benedikt Ebing

Goran Glavaš

288

15 Nov 2023

PLUG: Leveraging Pivot Language in Cross-Lingual Instruction TuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

451

15 Nov 2023