EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering

5 November 2020

ArXiv (abs)PDF HTML Github (44★)

Papers citing "EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering"

27 / 27 papers shown

Evaluating Arabic Large Language Models: A Survey of Benchmarks, Methods, and Gaps

Ahmed Alzubaidi

Shaikha Alsuwaidi

Basma El Amel Boussaha

195

15 Oct 2025

Tahakom LLM Guidelines and Recipes: From Pre-training Data to an Arabic LLM

...

251

15 Oct 2025

MeTA-LoRA: Data-Efficient Multi-Task Fine-Tuning for Large Language Models

199

13 Oct 2025

Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale

Hasan Hammoud

Mohammad Zbeeb

Bernard Ghanem

183

17 Sep 2025

Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish

218

22 Aug 2025

Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization

289

06 Aug 2025

MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages

Dieuwke Hupkes

Nikolay Bogoychev

1.1K

14 Apr 2025

Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM

...

342

18 Mar 2025

Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History

1.0K

17 Jan 2025

Enhancing Character-Level Understanding in LLMs through Token Internal Structure LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

591

26 Nov 2024

Bilingual Evaluation of Language Models on General Knowledge in University Entrance Exams with Minimal ContaminationInternational Conference on Computational Linguistics (COLING), 2024

Eva Sánchez Salido

Roser Morante

Julio Gonzalo

Guillermo Marco

Jorge Carrillo-de-Albornoz

...

Enrique Amigó

Andrés Fernández

Alejandro Benito-Santos

Adrián Ghajari Espinosa

Victor Fresno

ELM

329

19 Sep 2024

Bilingual Adaptation of Monolingual Foundation Models

...

Mohamed El Karim Chami

Preslav Nakov

CLL

388

13 Jul 2024

Mitigating Catastrophic Forgetting in Language Transfer via Model Merging

Ce Zhang

461

11 Jul 2024

New Textual Corpora for Serbian Language Modeling

Mihailo Škorić

Nikola Janković

175

15 May 2024

SambaLingo: Teaching Large Language Models New Languages

268

08 Apr 2024

To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource LanguagesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Benedikt Ebing

Goran Glavaš

285

15 Nov 2023

Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation

263

24 May 2023

Dolphin: A Challenging and Diverse Benchmark for Arabic NLGConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

El Moatez Billah Nagoudi

AbdelRahim Elmadany

Ahmed Oumar El-Shangiti

Muhammad Abdul-Mageed

LM&MA

363

24 May 2023

xPQA: Cross-Lingual Product Question Answering across 12 LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xiaoyu Shen

Akari Asai

Bill Byrne

Adria de Gispert

275

16 May 2023

Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching

162

13 Sep 2022

Investigating Information Inconsistency in Multilingual Open-Domain Question Answering

208

25 May 2022

Leaf: Multiple-Choice Question GenerationEuropean Conference on Information Retrieval (ECIR), 2022

332

22 Jan 2022

DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation

Zeming Liu

183

18 Sep 2021

CodeQA: A Question Answering Dataset for Source Code Comprehension

Chenxiao Liu

Xiaojun Wan

238

17 Sep 2021

Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering DataConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Dian Yu

Kai Sun

Dong Yu

Claire Cardie

199

01 Feb 2021

XOR QA: Cross-lingual Open-Retrieval Question Answering

401

175

22 Oct 2020

TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse LanguagesTransactions of the Association for Computational Linguistics (TACL), 2020

683

711

10 Mar 2020