Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.03080
Cited By
EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering
5 November 2020
Momchil Hardalov
Todor Mihaylov
Dimitrina Zlatkova
Yoan Dinkov
Ivan Koychev
Preslav Nakov
AI4Ed
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (44★)
Papers citing
"EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering"
27 / 27 papers shown
Evaluating Arabic Large Language Models: A Survey of Benchmarks, Methods, and Gaps
Ahmed Alzubaidi
Shaikha Alsuwaidi
Basma El Amel Boussaha
Leen AlQadi
Omar Alkaabi
Mohammed Alyafeai
Hamza Alobeidli
Hakim Hacid
ELM
195
2
0
15 Oct 2025
Tahakom LLM Guidelines and Recipes: From Pre-training Data to an Arabic LLM
Areej AlOtaibi
Lina Alyahya
Raghad Alshabanah
Shahad Alfawzan
Shuruq Alarefei
...
Waad Alahmed
Omar Talabay
Jalal Alowibdi
Salem Alelyani
Adel Bibi
251
0
0
15 Oct 2025
MeTA-LoRA: Data-Efficient Multi-Task Fine-Tuning for Large Language Models
Bo Cheng
Xu Wang
Jinda Liu
Yi-Ju Chang
Yuan Wu
MoE
ALM
199
0
0
13 Oct 2025
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Hasan Hammoud
Mohammad Zbeeb
Bernard Ghanem
183
2
0
17 Sep 2025
Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Yakup Abrek Er
.Ilker Kesen
Gözde Gül Şahin
Aykut Erdem
ELM
VLM
218
4
0
22 Aug 2025
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Negar Foroutan
Clara Meister
Debjit Paul
Joel Niklaus
Sina Ahmadi
Antoine Bosselut
Rico Sennrich
289
6
0
06 Aug 2025
MultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languages
Dieuwke Hupkes
Nikolay Bogoychev
1.1K
15
0
14 Apr 2025
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Yazeed Alnumay
Alexandre Barbet
Anna Bialas
William Darling
Shaan Desai
...
Stephanie Howe
Olivia Lasche
Justin Lee
Anirudh Shrinivason
Jennifer Tracey
342
7
0
18 Mar 2025
Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History
Yevhen Kostiuk
O. Vitman
Łukasz Gagała
Artur Kiulian
ELM
1.0K
2
0
17 Jan 2025
Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Zhu Xu
Zhiqiang Zhao
Zihan Zhang
Yuchi Liu
Quanwei Shen
Fei Liu
Yu Kuang
Jian He
Conglin Liu
591
4
0
26 Nov 2024
Bilingual Evaluation of Language Models on General Knowledge in University Entrance Exams with Minimal Contamination
International Conference on Computational Linguistics (COLING), 2024
Eva Sánchez Salido
Roser Morante
Julio Gonzalo
Guillermo Marco
Jorge Carrillo-de-Albornoz
...
Enrique Amigó
Andrés Fernández
Alejandro Benito-Santos
Adrián Ghajari Espinosa
Victor Fresno
ELM
329
2
0
19 Sep 2024
Bilingual Adaptation of Monolingual Foundation Models
Gurpreet Gosal
Yishi Xu
Gokul Ramakrishnan
Rituraj Joshi
Avraham Sheinin
...
Rahul Pal
Parvez Mullah
Soundar Doraiswamy
Mohamed El Karim Chami
Preslav Nakov
CLL
388
6
0
13 Jul 2024
Mitigating Catastrophic Forgetting in Language Transfer via Model Merging
Anton Alexandrov
Veselin Raychev
Mark Niklas Muller
Ce Zhang
Martin Vechev
Kristina Toutanova
MoMe
CLL
KELM
461
40
0
11 Jul 2024
New Textual Corpora for Serbian Language Modeling
Mihailo Škorić
Nikola Janković
175
2
0
15 May 2024
SambaLingo: Teaching Large Language Models New Languages
Zoltan Csaki
Bo Li
Jonathan Li
Qiantong Xu
Pian Pawakapan
Leon Zhang
Yun Du
Hengyu Zhao
Changran Hu
Urmish Thakker
268
14
0
08 Apr 2024
To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Benedikt Ebing
Goran Glavaš
285
10
0
15 Nov 2023
Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation
Jinyan Su
Fajri Koto
Minghao Wu
Alham Fikri Aji
Timothy Baldwin
ALM
263
86
0
24 May 2023
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
El Moatez Billah Nagoudi
AbdelRahim Elmadany
Ahmed Oumar El-Shangiti
Muhammad Abdul-Mageed
LM&MA
363
29
0
24 May 2023
xPQA: Cross-Lingual Product Question Answering across 12 Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Xiaoyu Shen
Akari Asai
Bill Byrne
Adria de Gispert
275
11
0
16 May 2023
Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching
Kunbo Ding
Weijie Liu
Yuejian Fang
Zhe Zhao
Qi Ju
Xuefeng Yang
162
2
0
13 Sep 2022
Investigating Information Inconsistency in Multilingual Open-Domain Question Answering
Shramay Palta
Haozhe An
Yifan Yang
Shuaiyi Huang
Maharshi Gor
208
1
0
25 May 2022
Leaf: Multiple-Choice Question Generation
European Conference on Information Retrieval (ECIR), 2022
Kristiyan Vachev
Momchil Hardalov
Georgi Karadzhov
Georgi Georgiev
Ivan Koychev
Preslav Nakov
AI4Ed
332
30
0
22 Jan 2022
DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation
Zeming Liu
Haifeng Wang
Zheng-Yu Niu
Hua Wu
Wanxiang Che
183
74
0
18 Sep 2021
CodeQA: A Question Answering Dataset for Source Code Comprehension
Chenxiao Liu
Xiaojun Wan
238
46
0
17 Sep 2021
Self-Teaching Machines to Read and Comprehend with Large-Scale Multi-Subject Question-Answering Data
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Dian Yu
Kai Sun
Dong Yu
Claire Cardie
199
8
0
01 Feb 2021
XOR QA: Cross-lingual Open-Retrieval Question Answering
Akari Asai
Jungo Kasai
J. Clark
Kenton Lee
Eunsol Choi
Hannaneh Hajishirzi
401
175
0
22 Oct 2020
TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages
Transactions of the Association for Computational Linguistics (TACL), 2020
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
683
711
0
10 Mar 2020
1
Page 1 of 1