Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.11080
Cited By
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization
24 March 2020
Junjie Hu
Sebastian Ruder
Aditya Siddhant
Graham Neubig
Orhan Firat
Melvin Johnson
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization"
50 / 659 papers shown
Title
MCL@IITK at SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation using Augmented Data, Signals, and Transformers
Rohan Gupta
Jay Mundra
Deepak Mahajan
Ashutosh Modi
12
3
0
04 Apr 2021
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
Mingyang Zhou
Luowei Zhou
Shuohang Wang
Yu Cheng
Linjie Li
Zhou Yu
Jingjing Liu
MLLM
VLM
18
89
0
01 Apr 2021
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
Oleksiy Syvokon
O. Nahorna
16
32
0
31 Mar 2021
NaijaNER : Comprehensive Named Entity Recognition for 5 Nigerian Languages
W. Oyewusi
Olubayo Adekanmbi
Ife Okoh
Vitus Onuigwe
M. Salami
Opeyemi Osakuade
Sharon Ibejih
U. Musa
13
11
0
30 Mar 2021
Are Multilingual Models Effective in Code-Switching?
Genta Indra Winata
Samuel Cahyawijaya
Zihan Liu
Zhaojiang Lin
Andrea Madotto
Pascale Fung
15
70
0
24 Mar 2021
NaturalProofs: Mathematical Theorem Proving in Natural Language
Sean Welleck
Jiacheng Liu
Ronan Le Bras
Hannaneh Hajishirzi
Yejin Choi
Kyunghyun Cho
AIMat
24
62
0
24 Mar 2021
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Julia Kreutzer
Isaac Caswell
Lisa Wang
Ahsan Wahab
D. Esch
...
Duygu Ataman
Orevaoghene Ahia
Oghenefego Ahia
Sweta Agrawal
Mofetoluwa Adeyemi
20
265
0
22 Mar 2021
MasakhaNER: Named Entity Recognition for African Languages
David Ifeoluwa Adelani
Jade Z. Abbott
Graham Neubig
Daniel D'souza
Julia Kreutzer
...
T. Diop
A. Diallo
Adewale Akinfaderin
T. Marengereke
Salomey Osei
22
185
0
22 Mar 2021
MuRIL: Multilingual Representations for Indian Languages
Simran Khanuja
Diksha Bansal
Sarvesh Mehtani
Savya Khosla
Atreyee Dey
...
Shachi Dave
Shruti Gupta
Subhash Chandra Bose Gali
Vishnu Subramanian
Partha P. Talukdar
20
275
0
19 Mar 2021
Graph Convolutional Network for Swahili News Classification
Alexandros Kastanos
Tyler Martin
GNN
42
3
0
16 Mar 2021
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Po-Yao (Bernie) Huang
Mandela Patrick
Junjie Hu
Graham Neubig
Florian Metze
Alexander G. Hauptmann
MLLM
VLM
19
56
0
16 Mar 2021
Multi-view Subword Regularization
Xinyi Wang
Sebastian Ruder
Graham Neubig
11
45
0
15 Mar 2021
LightMBERT: A Simple Yet Effective Method for Multilingual BERT Distillation
Xiaoqi Jiao
Yichun Yin
Lifeng Shang
Xin Jiang
Xiao Chen
Linlin Li
Fang Wang
Qun Liu
11
9
0
11 Mar 2021
Majority Voting with Bidirectional Pre-translation For Bitext Retrieval
Alex Jones
Derry Wijaya
6
6
0
10 Mar 2021
Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages
Rajaswa Patil
Jasleen Dhillon
Siddhant Mahurkar
Saumitra Kulkarni
M. Malhotra
V. Baths
10
1
0
01 Mar 2021
Unbiased Sentence Encoder For Large-Scale Multi-lingual Search Engines
Mahdi Hajiaghayi
Monir Hajiaghayi
Mark R. Bolin
11
0
0
01 Mar 2021
RuSentEval: Linguistic Source, Encoder Force!
Vladislav Mikhailov
Ekaterina Taktasheva
Elina Sigdel
Ekaterina Artemova
VLM
11
6
0
28 Feb 2021
Bilingual Language Modeling, A transfer learning technique for Roman Urdu
Usama Khalid
M. O. Beg
Muhammad Umair Arshad
11
3
0
22 Feb 2021
Revisiting Language Encoding in Learning Multilingual Representations
Shengjie Luo
Kaiyuan Gao
Shuxin Zheng
Guolin Ke
Di He
Liwei Wang
Tie-Yan Liu
26
2
0
16 Feb 2021
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Sebastian Gehrmann
Tosin P. Adewumi
Karmanya Aggarwal
Pawan Sasanka Ammanamanchi
Aremu Anuoluwapo
...
Nishant Subramani
Wei-ping Xu
Diyi Yang
Akhila Yerukola
Jiawei Zhou
VLM
246
283
0
02 Feb 2021
Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models
Nora Kassner
Philipp Dufter
Hinrich Schütze
15
132
0
01 Feb 2021
Does Typological Blinding Impede Cross-Lingual Sharing?
Johannes Bjerva
Isabelle Augenstein
11
17
0
28 Jan 2021
First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT
Benjamin Muller
Yanai Elazar
Benoît Sagot
Djamé Seddah
LRM
16
71
0
26 Jan 2021
Attention Can Reflect Syntactic Structure (If You Let It)
Vinit Ravishankar
Artur Kulmizev
Mostafa Abdou
Anders Søgaard
Joakim Nivre
13
32
0
26 Jan 2021
Meta-Learning for Effective Multi-task and Multilingual Modelling
Ishan Tarunesh
Sushil Khyalia
Vishwajeet Kumar
Ganesh Ramakrishnan
P. Jyothi
31
16
0
25 Jan 2021
Distilling Large Language Models into Tiny and Effective Students using pQRNN
P. Kaliamoorthi
Aditya Siddhant
Edward Li
Melvin Johnson
MQ
8
17
0
21 Jan 2021
Word Alignment by Fine-tuning Embeddings on Parallel Corpora
Zi-Yi Dou
Graham Neubig
90
256
0
20 Jan 2021
Coarse and Fine-Grained Hostility Detection in Hindi Posts using Fine Tuned Multilingual Embeddings
Arkadipta De
E Venkatesh
Kaushal Kumar Maurya
M. Desarkar
15
7
0
13 Jan 2021
A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters
Mengjie Zhao
Yi Zhu
Ehsan Shareghi
Ivan Vulić
Roi Reichart
Anna Korhonen
Hinrich Schütze
11
64
0
31 Dec 2020
ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora
Ouyang Xuan
Shuohuan Wang
Chao Pang
Yu Sun
Hao Tian
Hua-Hong Wu
Haifeng Wang
51
100
0
31 Dec 2020
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models
Phillip Rust
Jonas Pfeiffer
Ivan Vulić
Sebastian Ruder
Iryna Gurevych
69
234
0
31 Dec 2020
UNKs Everywhere: Adapting Multilingual Language Models to New Scripts
Jonas Pfeiffer
Ivan Vulić
Iryna Gurevych
Sebastian Ruder
11
126
0
31 Dec 2020
Universal Sentence Representation Learning with Conditional Masked Language Model
Ziyi Yang
Yinfei Yang
Daniel Matthew Cer
Jax Law
Eric F. Darve
SSL
11
57
0
28 Dec 2020
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic
Muhammad Abdul-Mageed
AbdelRahim Elmadany
El Moatez Billah Nagoudi
VLM
60
447
0
27 Dec 2020
Continual Lifelong Learning in Natural Language Processing: A Survey
Magdalena Biesialska
Katarzyna Biesialska
Marta R. Costa-jussá
KELM
CLL
16
214
0
17 Dec 2020
WILDS: A Benchmark of in-the-Wild Distribution Shifts
Pang Wei Koh
Shiori Sagawa
Henrik Marklund
Sang Michael Xie
Marvin Zhang
...
A. Kundaje
Emma Pierson
Sergey Levine
Chelsea Finn
Percy Liang
OOD
45
1,371
0
14 Dec 2020
Orthogonal Language and Task Adapters in Zero-Shot Cross-Lingual Transfer
M. Vidoni
Ivan Vulić
Goran Glavas
29
27
0
11 Dec 2020
ParsiNLU: A Suite of Language Understanding Challenges for Persian
Daniel Khashabi
Arman Cohan
Siamak Shakeri
Pedram Hosseini
Pouya Pezeshkpour
...
Niloofar Safi Samghabadi
Mahsa Shafaei
Saber Sheybani
Ali Tazarv
Yadollah Yaghoobzadeh
13
39
0
11 Dec 2020
Automatically Identifying Language Family from Acoustic Examples in Low Resource Scenarios
Peter Wu
Yifan Zhong
A. Black
13
3
0
01 Dec 2020
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
28
66
0
24 Nov 2020
Multilingual AMR-to-Text Generation
Angela Fan
Claire Gardent
4
32
0
10 Nov 2020
Low-Resource Adaptation of Neural NLP Models
Farhad Nooralahzadeh
17
0
0
09 Nov 2020
EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering
Momchil Hardalov
Todor Mihaylov
Dimitrina Zlatkova
Yoan Dinkov
Ivan Koychev
Preslav Nakov
AI4Ed
ELM
23
50
0
05 Nov 2020
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP
Fajri Koto
Afshin Rahimi
Jey Han Lau
Timothy Baldwin
17
254
0
02 Nov 2020
VECO: Variable and Flexible Cross-lingual Pre-training for Language Understanding and Generation
Fuli Luo
Wei Wang
Jiahao Liu
Yijia Liu
Bin Bi
Songfang Huang
Fei Huang
Luo Si
26
51
0
30 Oct 2020
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
Tatiana Shavrina
Alena Fenogenova
Anton A. Emelyanov
Denis Shevelev
Ekaterina Artemova
Valentin Malykh
Vladislav Mikhailov
Maria Tikhonova
Andrey Chertok
Andrey Evlampiev
VLM
ELM
17
81
0
29 Oct 2020
Learning Contextualised Cross-lingual Word Embeddings and Alignments for Extremely Low-Resource Languages Using Parallel Corpora
Takashi Wada
Tomoharu Iwata
Yuji Matsumoto
Timothy Baldwin
Jey Han Lau
25
6
0
27 Oct 2020
Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus
Isaac Caswell
Theresa Breiner
D. Esch
Ankur Bapna
19
87
0
27 Oct 2020
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
93
142
0
24 Oct 2020
Improving Multilingual Models with Language-Clustered Vocabularies
Hyung Won Chung
Dan Garrette
Kiat Chuan Tan
Jason Riesa
VLM
58
65
0
24 Oct 2020
Previous
1
2
3
...
11
12
13
14
Next