ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
v1v2v3 (latest)

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 1,563 papers shown
One Country, 700+ Languages: NLP Challenges for Underrepresented
  Languages and Dialects in Indonesia
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in IndonesiaAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
Ade Romadhony
...
David Moeljadi
Radityo Eko Prasojo
Timothy Baldwin
Jey Han Lau
Sebastian Ruder
226
130
0
24 Mar 2022
Leveraging unsupervised and weakly-supervised data to improve direct
  speech-to-speech translation
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translationInterspeech (Interspeech), 2022
Ye Jia
Yifan Ding
Ankur Bapna
Colin Cherry
Yu Zhang
Alexis Conneau
Nobuyuki Morioka
232
24
0
24 Mar 2022
Ensembling and Knowledge Distilling of Large Sequence Taggers for
  Grammatical Error Correction
Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error CorrectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
M. Tarnavskyi
Artem Chernodub
Kostiantyn Omelianchuk
3DV
147
27
0
24 Mar 2022
Probing for Labeled Dependency Trees
Probing for Labeled Dependency TreesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
138
9
0
24 Mar 2022
Multilingual CheckList: Generation and Evaluation
Multilingual CheckList: Generation and Evaluation
Karthikeyan K
Shaily Bhatt
Pankaj Singh
Somak Aditya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhary
ELM
311
2
0
24 Mar 2022
A Survey on Cross-Lingual Summarization
A Survey on Cross-Lingual SummarizationTransactions of the Association for Computational Linguistics (TACL), 2022
Jiaan Wang
Fandong Meng
Duo Zheng
Yunlong Liang
Zhixu Li
Jianfeng Qu
Jie Zhou
AILaw
169
74
0
23 Mar 2022
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and
  Quantization
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and QuantizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Zheng Li
Zijian Wang
Ming Tan
Ramesh Nallapati
Parminder Bhatia
Andrew O. Arnold
Bing Xiang
Dan Roth
MQ
169
46
0
21 Mar 2022
AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive
  Summarization
AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive SummarizationWorkshop on Arabic Natural Language Processing (WANLP), 2022
Moussa Kamal Eddine
Nadi Tomeh
Farah E. Shamout
Joseph Le Roux
Michalis Vazirgiannis
214
62
0
21 Mar 2022
Match the Script, Adapt if Multilingual: Analyzing the Effect of
  Multilingual Pretraining on Cross-lingual Transferability
Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual TransferabilityAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yoshinari Fujinuma
Jordan L. Boyd-Graber
Katharina Kann
AAML
258
29
0
21 Mar 2022
On Robust Prefix-Tuning for Text Classification
On Robust Prefix-Tuning for Text ClassificationInternational Conference on Learning Representations (ICLR), 2022
Zonghan Yang
Yang Liu
VLM
182
23
0
19 Mar 2022
Pretraining with Artificial Language: Studying Transferable Knowledge in
  Language Models
Pretraining with Artificial Language: Studying Transferable Knowledge in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Ryokan Ri
Yoshimasa Tsuruoka
226
33
0
19 Mar 2022
Meta-X$_{NLG}$: A Meta-Learning Approach Based on Language Clustering
  for Zero-Shot Cross-Lingual Transfer and Generation
Meta-XNLG_{NLG}NLG​: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and GenerationFindings (Findings), 2022
Kaushal Kumar Maurya
M. Desarkar
231
9
0
19 Mar 2022
Challenges and Strategies in Cross-Cultural NLP
Challenges and Strategies in Cross-Cultural NLPAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Daniel Hershcovich
Stella Frank
Heather Lent
Miryam de Lhoneux
Mostafa Abdou
...
Ruixiang Cui
Constanza Fierro
Katerina Margatina
Phillip Rust
Anders Søgaard
343
232
0
18 Mar 2022
Towards Lithuanian grammatical error correction
Towards Lithuanian grammatical error correction
Lukas Stankevivcius
Mantas Lukovsevivcius
3DV
134
5
0
18 Mar 2022
Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive
  Bias to Sequence-to-sequence Models
Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence ModelsFindings (Findings), 2022
Aaron Mueller
Robert Frank
Tal Linzen
Luheng Wang
Sebastian Schuster
AIMat
225
36
0
17 Mar 2022
Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for
  Low-Resource Language Translation?
Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?Findings (Findings), 2022
E. Lee
Sarubi Thillainathan
Shravan Nayak
Surangika Ranathunga
David Ifeoluwa Adelani
Ruisi Su
Arya D. McCarthy
VLM
354
51
0
16 Mar 2022
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
MCoNaLa: A Benchmark for Code Generation from Multiple Natural LanguagesFindings (Findings), 2022
Zhiruo Wang
Grace Cuenca
Shuyan Zhou
Frank F. Xu
Graham Neubig
223
60
0
16 Mar 2022
Multilingual Generative Language Models for Zero-Shot Cross-Lingual
  Event Argument Extraction
Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Kuan-Hao Huang
I-Hung Hsu
Premkumar Natarajan
Kai-Wei Chang
Nanyun Peng
164
76
0
15 Mar 2022
Improving Word Translation via Two-Stage Contrastive Learning
Improving Word Translation via Two-Stage Contrastive LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yaoyiran Li
Fangyu Liu
Nigel Collier
Anna Korhonen
Ivan Vulić
333
31
0
15 Mar 2022
Does Corpus Quality Really Matter for Low-Resource Languages?
Does Corpus Quality Really Matter for Low-Resource Languages?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Mikel Artetxe
Itziar Aldabe
Rodrigo Agerri
Olatz Perez-de-Viñaspre
Aitor Soroa Etxabe
227
21
0
15 Mar 2022
ViWOZ: A Multi-Domain Task-Oriented Dialogue Systems Dataset For
  Low-resource Language
ViWOZ: A Multi-Domain Task-Oriented Dialogue Systems Dataset For Low-resource Language
Phi Nguyen Van
Tung Cao Hoang
Dũng Nguyễn Mạnh
Q. Minh
Long Tran Quoc
150
4
0
15 Mar 2022
Can Synthetic Translations Improve Bitext Quality?
Can Synthetic Translations Improve Bitext Quality?Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Eleftheria Briakou
Marine Carpuat
144
6
0
15 Mar 2022
VAST: The Valence-Assessing Semantics Test for Contextualizing Language
  Models
VAST: The Valence-Assessing Semantics Test for Contextualizing Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2022
Robert Wolfe
Aylin Caliskan
113
15
0
14 Mar 2022
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual
  Entailment
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual EntailmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Haoyu Song
Li Dong
Weinan Zhang
Ting Liu
Furu Wei
VLMCLIP
218
158
0
14 Mar 2022
Active Evaluation: Efficient NLG Evaluation with Few Pairwise
  Comparisons
Active Evaluation: Efficient NLG Evaluation with Few Pairwise ComparisonsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Akash Kumar Mohankumar
Mitesh M. Khapra
ELMAAML
209
8
0
11 Mar 2022
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic
  Languages
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Aman Kumar
Himani Shrotriya
P. Sahu
Mary Dabre
Ratish Puduppully
Anoop Kunchukuttan
Amogh Mishra
Mitesh M. Khapra
Pratyush Kumar
263
51
0
10 Mar 2022
IT5: Text-to-text Pretraining for Italian Language Understanding and
  Generation
IT5: Text-to-text Pretraining for Italian Language Understanding and GenerationInternational Conference on Language Resources and Evaluation (LREC), 2022
Gabriele Sarti
Malvina Nissim
AILaw
255
51
0
07 Mar 2022
Mukayese: Turkish NLP Strikes Back
Mukayese: Turkish NLP Strikes BackFindings (Findings), 2022
Ali Safaya
Emirhan Kurtulucs
Arda Goktougan
Deniz Yuret
233
28
0
02 Mar 2022
SemSup: Semantic Supervision for Simple and Scalable Zero-shot
  Generalization
SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie
Ameet Deshpande
Karthik Narasimhan
VLM
323
2
0
26 Feb 2022
Morphology Without Borders: Clause-Level Morphology
Morphology Without Borders: Clause-Level MorphologyTransactions of the Association for Computational Linguistics (TACL), 2022
Omer Goldman
Reut Tsarfaty
AILaw
167
3
0
25 Feb 2022
Using natural language prompts for machine translation
Using natural language prompts for machine translation
Xavier Garcia
Orhan Firat
AI4CE
221
38
0
23 Feb 2022
A New Generation of Perspective API: Efficient Multilingual
  Character-level Transformers
A New Generation of Perspective API: Efficient Multilingual Character-level TransformersKnowledge Discovery and Data Mining (KDD), 2022
Alyssa Lees
Vinh Q. Tran
Yi Tay
Jeffrey Scott Sorensen
Jai Gupta
Donald Metzler
Lucy Vasserman
226
255
0
22 Feb 2022
CALCS 2021 Shared Task: Machine Translation for Code-Switched Data
CALCS 2021 Shared Task: Machine Translation for Code-Switched Data
Shuguang Chen
Gustavo Aguilar
A. Srinivasan
Mona T. Diab
Thamar Solorio
181
17
0
19 Feb 2022
ST-MoE: Designing Stable and Transferable Sparse Expert Models
ST-MoE: Designing Stable and Transferable Sparse Expert Models
Barret Zoph
Irwan Bello
Sameer Kumar
Nan Du
Yanping Huang
J. Dean
Noam M. Shazeer
W. Fedus
MoE
422
298
0
17 Feb 2022
Sequence-to-Sequence Resources for Catalan
Sequence-to-Sequence Resources for Catalan
Ona de Gibert
Ksenia Kharitonova
B. Figueras
Jordi Armengol-Estapé
Maite Melero
62
0
0
14 Feb 2022
Integrating question answering and text-to-SQL in Portuguese
Integrating question answering and text-to-SQL in PortugueseInternational Conference on Computational Processing of the Portuguese Language (PROPOR), 2022
M. M. José
M. A. José
Denis Deratani Mauá
Fabio Gagliardi Cozman
LMTD
171
4
0
08 Feb 2022
Cedille: A large autoregressive French language model
Cedille: A large autoregressive French language model
Martin Müller
Florian Laurent
197
23
0
07 Feb 2022
mSLAM: Massively multilingual joint pre-training for speech and text
mSLAM: Massively multilingual joint pre-training for speech and text
Ankur Bapna
Colin Cherry
Yu Zhang
Ye Jia
Melvin Johnson
Yong Cheng
Simran Khanuja
Jason Riesa
Alexis Conneau
VLM
175
122
0
03 Feb 2022
Examining Scaling and Transfer of Language Model Architectures for
  Machine Translation
Examining Scaling and Transfer of Language Model Architectures for Machine TranslationInternational Conference on Machine Learning (ICML), 2022
Biao Zhang
Behrooz Ghorbani
Ankur Bapna
Yong Cheng
Xavier Garcia
Jonathan Shen
Orhan Firat
277
29
0
01 Feb 2022
XAlign: Cross-lingual Fact-to-Text Alignment and Generation for
  Low-Resource Languages
XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource LanguagesThe Web Conference (WWW), 2022
Tushar Abhishek
Shivprasad Sagare
Bhavyajeet Singh
Anubhav Sharma
Manish Gupta
Vasudeva Varma
166
10
0
01 Feb 2022
Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation
Cross-Lingual Dialogue Dataset Creation via Outline-Based GenerationTransactions of the Association for Computational Linguistics (TACL), 2022
Olga Majewska
E. Razumovskaia
Edoardo Ponti
Ivan Vulić
Anna Korhonen
262
30
0
31 Jan 2022
Correcting diacritics and typos with a ByT5 transformer model
Correcting diacritics and typos with a ByT5 transformer modelApplied Sciences (Appl. Sci.), 2022
Lukas Stankevicius
M. Lukoševičius
J. Kapočiūtė-Dzikienė
Monika Briediene
Tomas Krilavičius
194
24
0
31 Jan 2022
Schema-Free Dependency Parsing via Sequence Generation
Schema-Free Dependency Parsing via Sequence Generation
Boda Lin
Zijun Yao
Jiaxin Shi
S. Cao
Binghao Tang
Si Li
Yong Luo
Juanzi Li
Lei Hou
138
0
0
28 Jan 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Towards a Cleaner Document-Oriented Multilingual Crawled CorpusInternational Conference on Language Resources and Evaluation (LREC), 2022
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
206
193
0
17 Jan 2022
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language
  Models
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2022
Vésteinn Snæbjarnarson
Haukur Barri Símonarson
Pétur Orri Ragnarsson
Svanhvít Lilja Ingólfsdóttir
H. Jónsson
Vilhjálmur Þorsteinsson
H. Einarsson
273
31
0
14 Jan 2022
Pretrained Language Models for Text Generation: A Survey
Pretrained Language Models for Text Generation: A SurveyACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
519
263
0
14 Jan 2022
CUGE: A Chinese Language Understanding and Generation Evaluation
  Benchmark
CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark
Yuan Yao
Qingxiu Dong
Jian Guan
Boxi Cao
Zhengyan Zhang
...
Zhiyuan Liu
Xianpei Han
Erhong Yang
Zhifang Sui
Maosong Sun
ALMELM
226
22
0
27 Dec 2021
CABACE: Injecting Character Sequence Information and Domain Knowledge
  for Enhanced Acronym and Long-Form Extraction
CABACE: Injecting Character Sequence Information and Domain Knowledge for Enhanced Acronym and Long-Form Extraction
Nithish Kannen
Divyanshu Sheth
Abhranil Chandra
Shubhraneel Pal
144
1
0
25 Dec 2021
Few-shot Learning with Multilingual Language Models
Few-shot Learning with Multilingual Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xi Lin
Todor Mihaylov
Mikel Artetxe
Tianlu Wang
Shuohui Chen
...
Luke Zettlemoyer
Zornitsa Kozareva
Mona T. Diab
Ves Stoyanov
Xian Li
BDLELMLRM
359
355
0
20 Dec 2021
CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+
  Language Pairs
CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Yuan-Fang Li
Yong-Bin Kang
Rifat Shahriyar
RALMELM
184
49
0
16 Dec 2021
Previous
123...272829303132
Next