Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.11934
Cited By
v1
v2
v3 (latest)
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 1,563 papers shown
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
Ade Romadhony
...
David Moeljadi
Radityo Eko Prasojo
Timothy Baldwin
Jey Han Lau
Sebastian Ruder
226
130
0
24 Mar 2022
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
Interspeech (Interspeech), 2022
Ye Jia
Yifan Ding
Ankur Bapna
Colin Cherry
Yu Zhang
Alexis Conneau
Nobuyuki Morioka
232
24
0
24 Mar 2022
Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error Correction
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
M. Tarnavskyi
Artem Chernodub
Kostiantyn Omelianchuk
3DV
147
27
0
24 Mar 2022
Probing for Labeled Dependency Trees
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
138
9
0
24 Mar 2022
Multilingual CheckList: Generation and Evaluation
Karthikeyan K
Shaily Bhatt
Pankaj Singh
Somak Aditya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhary
ELM
311
2
0
24 Mar 2022
A Survey on Cross-Lingual Summarization
Transactions of the Association for Computational Linguistics (TACL), 2022
Jiaan Wang
Fandong Meng
Duo Zheng
Yunlong Liang
Zhixu Li
Jianfeng Qu
Jie Zhou
AILaw
169
74
0
23 Mar 2022
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Zheng Li
Zijian Wang
Ming Tan
Ramesh Nallapati
Parminder Bhatia
Andrew O. Arnold
Bing Xiang
Dan Roth
MQ
169
46
0
21 Mar 2022
AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization
Workshop on Arabic Natural Language Processing (WANLP), 2022
Moussa Kamal Eddine
Nadi Tomeh
Farah E. Shamout
Joseph Le Roux
Michalis Vazirgiannis
214
62
0
21 Mar 2022
Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yoshinari Fujinuma
Jordan L. Boyd-Graber
Katharina Kann
AAML
258
29
0
21 Mar 2022
On Robust Prefix-Tuning for Text Classification
International Conference on Learning Representations (ICLR), 2022
Zonghan Yang
Yang Liu
VLM
182
23
0
19 Mar 2022
Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Ryokan Ri
Yoshimasa Tsuruoka
226
33
0
19 Mar 2022
Meta-X
N
L
G
_{NLG}
N
L
G
: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and Generation
Findings (Findings), 2022
Kaushal Kumar Maurya
M. Desarkar
231
9
0
19 Mar 2022
Challenges and Strategies in Cross-Cultural NLP
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Daniel Hershcovich
Stella Frank
Heather Lent
Miryam de Lhoneux
Mostafa Abdou
...
Ruixiang Cui
Constanza Fierro
Katerina Margatina
Phillip Rust
Anders Søgaard
343
232
0
18 Mar 2022
Towards Lithuanian grammatical error correction
Lukas Stankevivcius
Mantas Lukovsevivcius
3DV
134
5
0
18 Mar 2022
Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence Models
Findings (Findings), 2022
Aaron Mueller
Robert Frank
Tal Linzen
Luheng Wang
Sebastian Schuster
AIMat
225
36
0
17 Mar 2022
Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?
Findings (Findings), 2022
E. Lee
Sarubi Thillainathan
Shravan Nayak
Surangika Ranathunga
David Ifeoluwa Adelani
Ruisi Su
Arya D. McCarthy
VLM
354
51
0
16 Mar 2022
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
Findings (Findings), 2022
Zhiruo Wang
Grace Cuenca
Shuyan Zhou
Frank F. Xu
Graham Neubig
223
60
0
16 Mar 2022
Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Kuan-Hao Huang
I-Hung Hsu
Premkumar Natarajan
Kai-Wei Chang
Nanyun Peng
164
76
0
15 Mar 2022
Improving Word Translation via Two-Stage Contrastive Learning
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yaoyiran Li
Fangyu Liu
Nigel Collier
Anna Korhonen
Ivan Vulić
333
31
0
15 Mar 2022
Does Corpus Quality Really Matter for Low-Resource Languages?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Mikel Artetxe
Itziar Aldabe
Rodrigo Agerri
Olatz Perez-de-Viñaspre
Aitor Soroa Etxabe
227
21
0
15 Mar 2022
ViWOZ: A Multi-Domain Task-Oriented Dialogue Systems Dataset For Low-resource Language
Phi Nguyen Van
Tung Cao Hoang
Dũng Nguyễn Mạnh
Q. Minh
Long Tran Quoc
150
4
0
15 Mar 2022
Can Synthetic Translations Improve Bitext Quality?
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Eleftheria Briakou
Marine Carpuat
144
6
0
15 Mar 2022
VAST: The Valence-Assessing Semantics Test for Contextualizing Language Models
AAAI Conference on Artificial Intelligence (AAAI), 2022
Robert Wolfe
Aylin Caliskan
113
15
0
14 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLM
CLIP
218
158
0
14 Mar 2022
Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
AAML
209
8
0
11 Mar 2022
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Aman Kumar
Himani Shrotriya
P. Sahu
Mary Dabre
Ratish Puduppully
Anoop Kunchukuttan
Amogh Mishra
Mitesh M. Khapra
Pratyush Kumar
263
51
0
10 Mar 2022
IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
International Conference on Language Resources and Evaluation (LREC), 2022
Gabriele Sarti
Malvina Nissim
AILaw
255
51
0
07 Mar 2022
Mukayese: Turkish NLP Strikes Back
Findings (Findings), 2022
Ali Safaya
Emirhan Kurtulucs
Arda Goktougan
Deniz Yuret
233
28
0
02 Mar 2022
SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie
Ameet Deshpande
Karthik Narasimhan
VLM
323
2
0
26 Feb 2022
Morphology Without Borders: Clause-Level Morphology
Transactions of the Association for Computational Linguistics (TACL), 2022
Omer Goldman
Reut Tsarfaty
AILaw
167
3
0
25 Feb 2022
Using natural language prompts for machine translation
Xavier Garcia
Orhan Firat
AI4CE
221
38
0
23 Feb 2022
A New Generation of Perspective API: Efficient Multilingual Character-level Transformers
Knowledge Discovery and Data Mining (KDD), 2022
Alyssa Lees
Vinh Q. Tran
Yi Tay
Jeffrey Scott Sorensen
Jai Gupta
Donald Metzler
Lucy Vasserman
226
255
0
22 Feb 2022
CALCS 2021 Shared Task: Machine Translation for Code-Switched Data
Shuguang Chen
Gustavo Aguilar
A. Srinivasan
Mona T. Diab
Thamar Solorio
181
17
0
19 Feb 2022
ST-MoE: Designing Stable and Transferable Sparse Expert Models
Barret Zoph
Irwan Bello
Sameer Kumar
Nan Du
Yanping Huang
J. Dean
Noam M. Shazeer
W. Fedus
MoE
422
298
0
17 Feb 2022
Sequence-to-Sequence Resources for Catalan
Ona de Gibert
Ksenia Kharitonova
B. Figueras
Jordi Armengol-Estapé
Maite Melero
62
0
0
14 Feb 2022
Integrating question answering and text-to-SQL in Portuguese
International Conference on Computational Processing of the Portuguese Language (PROPOR), 2022
M. M. José
M. A. José
Denis Deratani Mauá
Fabio Gagliardi Cozman
LMTD
171
4
0
08 Feb 2022
Cedille: A large autoregressive French language model
Martin Müller
Florian Laurent
197
23
0
07 Feb 2022
mSLAM: Massively multilingual joint pre-training for speech and text
Ankur Bapna
Colin Cherry
Yu Zhang
Ye Jia
Melvin Johnson
Yong Cheng
Simran Khanuja
Jason Riesa
Alexis Conneau
VLM
175
122
0
03 Feb 2022
Examining Scaling and Transfer of Language Model Architectures for Machine Translation
International Conference on Machine Learning (ICML), 2022
Biao Zhang
Behrooz Ghorbani
Ankur Bapna
Yong Cheng
Xavier Garcia
Jonathan Shen
Orhan Firat
277
29
0
01 Feb 2022
XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages
The Web Conference (WWW), 2022
Tushar Abhishek
Shivprasad Sagare
Bhavyajeet Singh
Anubhav Sharma
Manish Gupta
Vasudeva Varma
166
10
0
01 Feb 2022
Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation
Transactions of the Association for Computational Linguistics (TACL), 2022
Olga Majewska
E. Razumovskaia
Edoardo Ponti
Ivan Vulić
Anna Korhonen
262
30
0
31 Jan 2022
Correcting diacritics and typos with a ByT5 transformer model
Applied Sciences (Appl. Sci.), 2022
Lukas Stankevicius
M. Lukoševičius
J. Kapočiūtė-Dzikienė
Monika Briediene
Tomas Krilavičius
194
24
0
31 Jan 2022
Schema-Free Dependency Parsing via Sequence Generation
Boda Lin
Zijun Yao
Jiaxin Shi
S. Cao
Binghao Tang
Si Li
Yong Luo
Juanzi Li
Lei Hou
138
0
0
28 Jan 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
International Conference on Language Resources and Evaluation (LREC), 2022
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
206
193
0
17 Jan 2022
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models
International Conference on Language Resources and Evaluation (LREC), 2022
Vésteinn Snæbjarnarson
Haukur Barri Símonarson
Pétur Orri Ragnarsson
Svanhvít Lilja Ingólfsdóttir
H. Jónsson
Vilhjálmur Þorsteinsson
H. Einarsson
273
31
0
14 Jan 2022
Pretrained Language Models for Text Generation: A Survey
ACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
519
263
0
14 Jan 2022
CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark
Yuan Yao
Qingxiu Dong
Jian Guan
Boxi Cao
Zhengyan Zhang
...
Zhiyuan Liu
Xianpei Han
Erhong Yang
Zhifang Sui
Maosong Sun
ALM
ELM
226
22
0
27 Dec 2021
CABACE: Injecting Character Sequence Information and Domain Knowledge for Enhanced Acronym and Long-Form Extraction
Nithish Kannen
Divyanshu Sheth
Abhranil Chandra
Shubhraneel Pal
144
1
0
25 Dec 2021
Few-shot Learning with Multilingual Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xi Lin
Todor Mihaylov
Mikel Artetxe
Tianlu Wang
Shuohui Chen
...
Luke Zettlemoyer
Zornitsa Kozareva
Mona T. Diab
Ves Stoyanov
Xian Li
BDL
ELM
LRM
359
355
0
20 Dec 2021
CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Yuan-Fang Li
Yong-Bin Kang
Rifat Shahriyar
RALM
ELM
184
49
0
16 Dec 2021
Previous
1
2
3
...
27
28
29
30
31
32
Next