ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.01401
  4. Cited By
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training,
  Understanding and Generation
v1v2v3 (latest)

XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
3 April 2020
Yaobo Liang
Nan Duan
Yeyun Gong
Ning Wu
Fenfei Guo
Weizhen Qi
Ming Gong
Linjun Shou
Daxin Jiang
Guihong Cao
Xiaodong Fan
Bruce Zhang
Rahul Agrawal
Edward Cui
Sining Wei
Taroon Bharti
Ying Qiao
Jiun-Hung Chen
Winnie Wu
Shuguang Liu
Fan Yang
Daniel Fernando Campos
Rangan Majumder
Ming Zhou
    ELMVLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation"

50 / 222 papers shown
Operator-Theoretic Framework for Gradient-Free Federated Learning
Operator-Theoretic Framework for Gradient-Free Federated Learning
Mohit Kumar
Mathias Brucker
Alexander Valentinitsch
Adnan Husaković
Ali Abbas
Manuela Geiß
Bernhard A. Moser
FedML
279
0
0
30 Nov 2025
Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Yakup Abrek Er
.Ilker Kesen
Gözde Gül Şahin
Aykut Erdem
ELMVLM
230
4
0
22 Aug 2025
TASE: Token Awareness and Structured Evaluation for Multilingual Language Models
TASE: Token Awareness and Structured Evaluation for Multilingual Language Models
Chenzhuo Zhao
Xinda Wang
Yue Huang
Junting Lu
Ziqian Liu
LRM
140
1
0
07 Aug 2025
Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?
Survey of NLU Benchmarks Diagnosing Linguistic Phenomena: Why not Standardize Diagnostics Benchmarks?
Khloud Al Jallad
Nada Ghneim
Ghaida Rebdawi
LM&MAELM
287
0
0
27 Jul 2025
A Culturally-Rich Romanian NLP Dataset from "Who Wants to Be a Millionaire?" Videos
A Culturally-Rich Romanian NLP Dataset from "Who Wants to Be a Millionaire?" Videos
Alexandru-Gabriel Ganea
Antonia-Adelina Popovici
Adrian-Marius Dumitran
249
0
0
06 Jun 2025
BnMMLU: Measuring Massive Multitask Language Understanding in Bengali
BnMMLU: Measuring Massive Multitask Language Understanding in Bengali
Saman Sarker Joy
Swakkhar Shatabda
ELM
231
2
0
25 May 2025
MAPS: A Multilingual Benchmark for Agent Performance and Security
MAPS: A Multilingual Benchmark for Agent Performance and Security
Omer Hofman
Jonathan Brokman
Oren Rachmil
Shamik Bose
Vikas Pahuja
Toshiya Shimizu
Trisha Starostina
Kelly Marchisio
Seraphina Goldfarb-Tarrant
Roman Vainshtein
300
2
0
21 May 2025
New Encoders for German Trained from Scratch: Comparing ModernGBERT with Converted LLM2Vec Models
New Encoders for German Trained from Scratch: Comparing ModernGBERT with Converted LLM2Vec Models
Julia Wunderle
Julia Wunderle
Jan Pfister
Fotis Jannidis
Andreas Hotho
OffRL
364
3
0
19 May 2025
ReLI: A Language-Agnostic Approach to Human-Robot Interaction
ReLI: A Language-Agnostic Approach to Human-Robot Interaction
Linus Nwankwo
Bjoern Ellensohn
Ozan Özdenizci
Elmar Rueckert
LM&Ro
718
1
0
03 May 2025
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning
Jieming Bian
Yuanzhe Peng
Lei Wang
Yin Huang
Jie Xu
FedML
421
14
0
29 Apr 2025
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM
Yazeed Alnumay
Alexandre Barbet
Anna Bialas
William Darling
Shaan Desai
...
Stephanie Howe
Olivia Lasche
Justin Lee
Anirudh Shrinivason
Jennifer Tracey
353
10
0
18 Mar 2025
TLUE: A Tibetan Language Understanding Evaluation Benchmark
TLUE: A Tibetan Language Understanding Evaluation Benchmark
Fan Gao
Cheng Huang
Nyima Tashi
Xiangxiang Wang
Thupten Tsering
...
Gadeng Luosang
Rinchen Dongrub
Dorje Tashi
Xiao Feng
Yongbin Yu
ELM
625
7
0
15 Mar 2025
LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama
LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama
Naome A. Etori
Kevin Lu
Randu Karisa
Arturs Kanepajs
LRMELM
1.0K
4
0
14 Mar 2025
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation
Weihao Xuan
Rui Yang
Heli Qi
Qingcheng Zeng
Yunze Xiao
...
Edison Marrese-Taylor
Shijian Lu
Yusuke Iwasawa
Yutaka Matsuo
Irene Li
ELM
608
50
0
13 Mar 2025
EuroBERT: Scaling Multilingual Encoders for European Languages
EuroBERT: Scaling Multilingual Encoders for European Languages
Nicolas Boizard
Hippolyte Gisserot-Boukhlef
Duarte M. Alves
André F. T. Martins
Ayoub Hammal
...
Maxime Peyrard
Nuno M. Guerreiro
Patrick Fernandes
Ricardo Rei
Pierre Colombo
1.2K
23
0
07 Mar 2025
Where Are We? Evaluating LLM Performance on African Languages
Where Are We? Evaluating LLM Performance on African LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Ife Adebara
Hawau Olamide Toyin
Nahom Tesfu Ghebremichael
AbdelRahim Elmadany
Muhammad Abdul-Mageed
443
11
0
26 Feb 2025
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous ScriptsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Muhammad Farid Adilazuarda
M. Wijanarko
Lucky Susanto
Khumaisa Nuráini
Derry Wijaya
Alham Fikri Aji
409
4
0
25 Feb 2025
KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan
KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of KazakhstanAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Mukhammed Togmanov
Nurdaulet Mukhituly
Diana Turmakhan
Jonibek Mansurov
Maiya Goloburda
...
Nurkhan Laiyk
Alham Fikri Aji
Ekaterina Kochmar
Preslav Nakov
Fajri Koto
ELM
291
10
0
18 Feb 2025
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding
Sankalp KJ
Ashutosh Kumar
Laxmaan Balaji
Nikunj Kotecha
Vinija Jain
Vasu Sharma
S. Bhaduri
ELM
1.1K
9
0
27 Jan 2025
SailCompass: Towards Reproducible and Robust Evaluation for Southeast
  Asian Languages
SailCompass: Towards Reproducible and Robust Evaluation for Southeast Asian Languages
Jia Guo
Longxu Dou
Guangtao Zeng
Stanley Kok
Wei Lu
Qian Liu
ELMLRM
347
3
0
02 Dec 2024
ChemTEB: Chemical Text Embedding Benchmark, an Overview of Embedding Models Performance & Efficiency on a Specific Domain
ChemTEB: Chemical Text Embedding Benchmark, an Overview of Embedding Models Performance & Efficiency on a Specific Domain
Ali Shiraee Kasmaee
Mohammad Khodadad
Mohammad Arshi Saloot
Nick Sherck
Stephen Dokas
H. Mahyar
Soheila Samiee
ELM
1.4K
11
0
30 Nov 2024
INCLUDE: Evaluating Multilingual Language Understanding with Regional
  Knowledge
INCLUDE: Evaluating Multilingual Language Understanding with Regional KnowledgeInternational Conference on Learning Representations (ICLR), 2024
Angelika Romanou
Negar Foroutan
Anna Sotnikova
Zeming Chen
Sree Harsha Nelaturu
...
Mike Zhang
Imanol Schlag
Marzieh Fadaee
Sara Hooker
Antoine Bosselut
ELM
519
44
0
29 Nov 2024
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual
  Semantic Textual Relatedness Task
USTCCTSU at SemEval-2024 Task 1: Reducing Anisotropy for Cross-lingual Semantic Textual Relatedness TaskInternational Workshop on Semantic Evaluation (SemEval), 2024
Jianjian Li
Shengwei Liang
Yong Liao
Hongping Deng
Haiyang Yu
415
2
0
28 Nov 2024
LLäMmlein: Transparent, Compact and Competitive German-Only Language Models from ScratchAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Jan Pfister
Julia Wunderle
Andreas Hotho
620
2
0
17 Nov 2024
Delta: A Cloud-assisted Data Enrichment Framework for On-Device
  Continual Learning
Delta: A Cloud-assisted Data Enrichment Framework for On-Device Continual LearningACM/IEEE International Conference on Mobile Computing and Networking (MobiCom), 2024
Chen Gong
Zhenzhe Zheng
Fan Wu
Xiaofeng Jia
Guihai Chen
LMTDFedML
423
12
0
24 Oct 2024
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic
  Reasoning Tasks
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Shailaja Keyur Sampat
Mutsumi Nakamura
Shankar Kailas
Kartik Aggarwal
Mandy Zhou
Yezhou Yang
Chitta Baral
MLLMCoGeReLMVLMLRM
244
1
0
17 Oct 2024
XTRUST: On the Multilingual Trustworthiness of Large Language Models
XTRUST: On the Multilingual Trustworthiness of Large Language Models
Yahan Li
Yi Wang
Yi-Ju Chang
Yuan Wu
LRMHILM
326
2
0
24 Sep 2024
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs
Basel Mousi
Nadir Durrani
Fatema Ahmad
Md. Arid Hasan
Maram Hasanain
Tameem Kabbani
Fahim Dalvi
Shammur A. Chowdhury
Firoj Alam
424
52
0
17 Sep 2024
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
Joseph Marvin Imperial
Harish Tayyar Madabushi
234
1
0
18 Jul 2024
Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models
Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models
Nikhil Sharma
Kenton Murray
Ziang Xiao
551
5
0
07 Jul 2024
Multilingual Trolley Problems for Language Models
Multilingual Trolley Problems for Language Models
Zhijing Jin
Sydney Levine
Max Kleiman-Weiner
Giorgio Piatti
Jiarui Liu
...
András Strausz
Mrinmaya Sachan
Amélie Reymond
Yejin Choi
Bernhard Schölkopf
LRM
470
0
0
02 Jul 2024
PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement
  on Multilingual and Multi-Cultural Data
PARIKSHA : A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data
Ishaan Watts
Varun Gumma
Aditya Yadavalli
Vivek Seshadri
Manohar Swaminathan
Sunayana Sitaram
ELM
325
29
0
21 Jun 2024
On the Evaluation Practices in Multilingual NLP: Can Machine Translation
  Offer an Alternative to Human Translations?
On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?
Rochelle Choenni
Sara Rajaee
Christof Monz
Ekaterina Shutova
458
6
0
20 Jun 2024
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation
  Language Model
Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model
Runzhe Zhan
Xinyi Yang
Yang Li
Lidia S. Chao
Yue Zhang
417
12
0
25 Apr 2024
Translation of Multifaceted Data without Re-Training of Machine
  Translation Systems
Translation of Multifaceted Data without Re-Training of Machine Translation Systems
Hyeonseok Moon
Seungyoon Lee
Seongtae Hong
Seungjun Lee
Chanjun Park
Heu-Jeoung Lim
207
0
0
25 Apr 2024
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and
  Historical Languages
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages
Aleksei Dorkin
Kairit Sirts
162
3
0
19 Apr 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models
  Using Multisense Consistency
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
336
11
0
18 Apr 2024
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for
  the Neural Processing of Portuguese
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese
T. Osório
Bernardo Leite
Henrique Lopes Cardoso
Luís Gomes
João Rodrigues
Rodrigo Santos
António Branco
393
6
0
08 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and
  Frontiers
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Hai-Tao Zheng
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
401
60
0
07 Apr 2024
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in
  Cross-Lingual Textual Relatedness
MaiNLP at SemEval-2024 Task 1: Analyzing Source Language Selection in Cross-Lingual Textual RelatednessInternational Workshop on Semantic Evaluation (SemEval), 2024
Shijia Zhou
Huangyan Shan
Barbara Plank
Robert Litschko
276
2
0
03 Apr 2024
Can Machine Translation Bridge Multilingual Pretraining and
  Cross-lingual Transfer Learning?
Can Machine Translation Bridge Multilingual Pretraining and Cross-lingual Transfer Learning?
Shaoxiong Ji
Timothee Mickus
Vincent Segonne
Jörg Tiedemann
CLL
290
7
0
25 Mar 2024
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for
  Vietnamese Natural Language Understanding
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding
Phong Nguyen-Thuan Do
Son Quoc Tran
Phu Gia Hoang
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
ELM
284
9
0
23 Mar 2024
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and
  Closely-Related Languages
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Fahim Faisal
Orevaoghene Ahia
Aarohi Srivastava
Kabir Ahuja
David Chiang
Yulia Tsvetkov
Antonios Anastasopoulos
260
56
0
16 Mar 2024
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in
  Korean
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in KoreanInternational Conference on Language Resources and Evaluation (LREC), 2024
Eunsu Kim
Juyoung Suk
Philhoon Oh
Haneul Yoo
Hyunjung Shim
Alice Oh
ELM
572
62
0
11 Mar 2024
Cost-Performance Optimization for Processing Low-Resource Language Tasks
  Using Commercial LLMs
Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Arijit Nag
Animesh Mukherjee
Niloy Ganguly
Soumen Chakrabarti
294
10
0
08 Mar 2024
A Measure for Transparent Comparison of Linguistic Diversity in
  Multilingual NLP Data Sets
A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets
Tanja Samardzic
Ximena Gutierrez-Vasques
Christian Bentz
Steven Moran
Olga Pelloni
282
14
0
06 Mar 2024
Evaluating the Elementary Multilingual Capabilities of Large Language
  Models with MultiQ
Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ
Carolin Holtermann
Paul Röttger
Timm Dill
Anne Lauscher
ELMLRM
333
36
0
06 Mar 2024
Natural Language Processing Methods for Symbolic Music Generation and
  Information Retrieval: a Survey
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey
Dinh-Viet-Toan Le
Louis Bigo
Mikaela Keller
Dorien Herremans
MedIm
261
32
0
27 Feb 2024
$C^3$: Confidence Calibration Model Cascade for Inference-Efficient
  Cross-Lingual Natural Language Understanding
C3C^3C3: Confidence Calibration Model Cascade for Inference-Efficient Cross-Lingual Natural Language Understanding
Taixi Lu
Haoyu Wang
Huajie Shao
Jing Gao
Huaxiu Yao
191
0
0
25 Feb 2024
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
Fajri Koto
Jinyan Su
Sara Shatnawi
Jad Doughman
Abdelrahman Boda Sadallah
...
Neha Sengupta
Shady Shehata
Farah E. Shamout
Preslav Nakov
Timothy Baldwin
ELMLRM
351
85
0
20 Feb 2024
12345
Next
Page 1 of 5