ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
v1v2v3 (latest)

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 1,563 papers shown
Evaluation of Transfer Learning for Polish with a Text-to-Text Model
Evaluation of Transfer Learning for Polish with a Text-to-Text ModelInternational Conference on Language Resources and Evaluation (LREC), 2022
Aleksandra Chrabrowa
Lukasz Dragan
Karol Grzegorczyk
D. Kajtoch
Mikołaj Koszowski
Robert Mroczkowski
Piotr Rybak
190
21
0
18 May 2022
OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource
  Language Pair for Low-Resource Sentence Retrieval
OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource Language Pair for Low-Resource Sentence RetrievalFindings (Findings), 2022
Tong Niu
Kazuma Hashimoto
Yingbo Zhou
Caiming Xiong
VLM
129
7
0
17 May 2022
Controlling Translation Formality Using Pre-trained Multilingual
  Language Models
Controlling Translation Formality Using Pre-trained Multilingual Language ModelsInternational Workshop on Spoken Language Translation (IWSLT), 2022
Elijah Matthew Rippeth
Sweta Agrawal
Marine Carpuat
AI4CE
227
20
0
13 May 2022
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language
  Generation
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Long Phan
H. Tran
Hieu Duy Nguyen
Trieu H. Trinh
ViT
303
86
0
13 May 2022
Beyond Static Models and Test Sets: Benchmarking the Potential of
  Pre-trained Models Across Tasks and Languages
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages
Kabir Ahuja
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
LRM
212
19
0
12 May 2022
On the Economics of Multilingual Few-shot Learning: Modeling the
  Cost-Performance Trade-offs of Machine Translated and Manual Data
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual DataNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Kabir Ahuja
Monojit Choudhury
Sandipan Dandapat
150
3
0
12 May 2022
UL2: Unifying Language Learning Paradigms
UL2: Unifying Language Learning ParadigmsInternational Conference on Learning Representations (ICLR), 2022
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
570
359
0
10 May 2022
Enhancing Cross-lingual Transfer by Manifold Mixup
Enhancing Cross-lingual Transfer by Manifold MixupInternational Conference on Learning Representations (ICLR), 2022
Huiyun Yang
Huadong Chen
Hao Zhou
Lei Li
AAML
168
46
0
09 May 2022
Building Machine Translation Systems for the Next Thousand Languages
Building Machine Translation Systems for the Next Thousand Languages
Ankur Bapna
Isaac Caswell
Julia Kreutzer
Orhan Firat
D. Esch
...
Apurva Shah
Yanping Huang
Zhiwen Chen
Yonghui Wu
Macduff Hughes
325
110
0
09 May 2022
Same Neurons, Different Languages: Probing Morphosyntax in Multilingual
  Pre-trained Models
Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Karolina Stañczak
Edoardo Ponti
Lucas Torroba Hennigen
Robert Bamler
Isabelle Augenstein
MILMLRM
462
11
0
04 May 2022
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models
  for African News Translation
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
David Ifeoluwa Adelani
Jesujoba Oluwadara Alabi
Angela Fan
Julia Kreutzer
Xiaoyu Shen
...
Ayodele Awokoya
Happy Buzaaba
Blessing K. Sibanda
Andiswa Bukula
Sam Manthalu
446
131
0
04 May 2022
State-of-the-art in Open-domain Conversational AI: A Survey
State-of-the-art in Open-domain Conversational AI: A Survey
Tosin Adewumi
F. Liwicki
Marcus Liwicki
313
18
0
02 May 2022
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence
  Encoders
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence EncodersConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Ivan Vulić
Goran Glavaš
Fangyu Liu
Nigel Collier
Edoardo Ponti
Anna Korhonen
269
10
0
30 Apr 2022
How Robust is Neural Machine Translation to Language Imbalance in
  Multilingual Tokenizer Training?
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?Conference of the Association for Machine Translation in the Americas (AMTA), 2022
Shiyue Zhang
Vishrav Chaudhary
Naman Goyal
James Cross
Guillaume Wenzek
Joey Tianyi Zhou
Francisco Guzman
224
22
0
29 Apr 2022
Polyglot Prompt: Multilingual Multitask PrompTraining
Polyglot Prompt: Multilingual Multitask PrompTrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jinlan Fu
See-Kiong Ng
Pengfei Liu
188
13
0
29 Apr 2022
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient
  Optimization in Few-Shot Cross-Lingual Transfer
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer
Haoran Xu
Kenton W. Murray
182
12
0
29 Apr 2022
A Comprehensive Understanding of Code-mixed Language Semantics using
  Hierarchical Transformer
A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical TransformerIEEE Transactions on Computational Social Systems (IEEE TCSS), 2022
Ayan Sengupta
Tharun Suresh
Md. Shad Akhtar
Tanmoy Chakraborty
184
13
0
27 Apr 2022
WikiMulti: a Corpus for Cross-Lingual Summarization
WikiMulti: a Corpus for Cross-Lingual Summarization
Pavel Tikhonov
Valentin Malykh
102
4
0
23 Apr 2022
Tweets2Stance: Users stance detection exploiting Zero-Shot Learning
  Algorithms on Tweets
Tweets2Stance: Users stance detection exploiting Zero-Shot Learning Algorithms on Tweets
Margherita Gambini
T. Fagni
C. Senette
Maurizio Tesconi
139
3
0
22 Apr 2022
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence
  Embedding
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence EmbeddingInternational Workshop on Semantic Evaluation (SemEval), 2022
Harish Tayyar Madabushi
Edward Gow-Smith
Marcos García
Carolina Scarton
M. Idiart
Aline Villavicencio
233
65
0
21 Apr 2022
Towards Arabic Sentence Simplification via Classification and Generative
  Approaches
Towards Arabic Sentence Simplification via Classification and Generative ApproachesWorkshop on Arabic Natural Language Processing (WANLP), 2022
Nouran Khallaf
S. Sharoff
119
7
0
20 Apr 2022
On the Representation Collapse of Sparse Mixture of Experts
On the Representation Collapse of Sparse Mixture of ExpertsNeural Information Processing Systems (NeurIPS), 2022
Zewen Chi
Li Dong
Shaohan Huang
Damai Dai
Shuming Ma
...
Payal Bajaj
Xia Song
Xian-Ling Mao
Heyan Huang
Furu Wei
MoMeMoE
310
136
0
20 Apr 2022
ALBETO and DistilBETO: Lightweight Spanish Language Models
ALBETO and DistilBETO: Lightweight Spanish Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2022
J. Canete
S. Donoso
Felipe Bravo-Marquez
Andrés Carvallo
Vladimir Araujo
200
25
0
19 Apr 2022
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks
  and Datasets
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets
Wei Chen
Zhiwei Li
Hongyi Fang
Qian-Qian Yao
Cheng Zhong
Jianye Hao
Tao Gui
Xuanjing Huang
J. Peng
Zhongyu Wei
227
77
0
19 Apr 2022
Detecting Text Formality: A Study of Text Classification Approaches
Detecting Text Formality: A Study of Text Classification ApproachesRecent Advances in Natural Language Processing (RANLP), 2022
Daryna Dementieva
Ivan Trifinov
Sergey Petrakov
219
13
0
19 Apr 2022
IndicXNLI: Evaluating Multilingual Inference for Indian Languages
IndicXNLI: Evaluating Multilingual Inference for Indian LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Divyanshu Aggarwal
V. Gupta
Anoop Kunchukuttan
172
35
0
19 Apr 2022
MASSIVE: A 1M-Example Multilingual Natural Language Understanding
  Dataset with 51 Typologically-Diverse Languages
MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jack G. M. FitzGerald
C. Hench
Charith Peris
Scott Mackie
Kay Rottmann
...
Laurie Crist
Misha Britan
Wouter Leeuwis
Gokhan Tur
Premkumar Natarajan
245
171
0
18 Apr 2022
GL-CLeF: A Global-Local Contrastive Learning Framework for Cross-lingual
  Spoken Language Understanding
GL-CLeF: A Global-Local Contrastive Learning Framework for Cross-lingual Spoken Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Libo Qin
Qiguang Chen
Tianbao Xie
Qixin Li
Jian-Guang Lou
Wanxiang Che
MingSung Kan
174
36
0
18 Apr 2022
AfriWOZ: Corpus for Exploiting Cross-Lingual Transferability for
  Generation of Dialogues in Low-Resource, African Languages
AfriWOZ: Corpus for Exploiting Cross-Lingual Transferability for Generation of Dialogues in Low-Resource, African Languages
Tosin Adewumi
Mofetoluwa Adeyemi
Aremu Anuoluwapo
Bukola Peters
Happy Buzaaba
...
Phylis Ngigi
Orevaoghene Ahia
Ruqayya Nasir
F. Liwicki
Marcus Liwicki
166
2
0
17 Apr 2022
Bridging Cross-Lingual Gaps During Leveraging the Multilingual
  Sequence-to-Sequence Pretraining for Text Generation and Understanding
Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation and Understanding
Changtong Zan
Liang Ding
Li Shen
Yu Cao
Weifeng Liu
Dacheng Tao
LRM
189
8
0
16 Apr 2022
Super-NaturalInstructions: Generalization via Declarative Instructions
  on 1600+ NLP Tasks
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP TasksConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yizhong Wang
Swaroop Mishra
Pegah Alipoormolabashi
Yeganeh Kordi
Amirreza Mirzaei
...
Chitta Baral
Yejin Choi
Noah A. Smith
Hannaneh Hajishirzi
Daniel Khashabi
ELM
644
1,016
0
16 Apr 2022
WordAlchemy: A transformer-based Reverse Dictionary
WordAlchemy: A transformer-based Reverse Dictionary
S. Mane
Harshali B. Patil
Kanhaiya Madaswar
Pranav Sadavarte
210
6
0
16 Apr 2022
Chinese Idiom Paraphrasing
Chinese Idiom ParaphrasingTransactions of the Association for Computational Linguistics (TACL), 2022
Jipeng Qiang
Yang Li
Chaowei Zhang
Yun Li
Yunhao Yuan
Yi Zhu
Xin Wu
165
10
0
15 Apr 2022
Summarization with Graphical Elements
Summarization with Graphical Elements
Maartje ter Hoeve
Julia Kiseleva
Maarten de Rijke
263
2
0
15 Apr 2022
mGPT: Few-Shot Learners Go Multilingual
mGPT: Few-Shot Learners Go MultilingualTransactions of the Association for Computational Linguistics (TACL), 2022
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
364
191
0
15 Apr 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
...
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
395
955
0
14 Apr 2022
Adapting Pre-trained Language Models to African Languages via
  Multilingual Adaptive Fine-Tuning
Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-TuningInternational Conference on Computational Linguistics (COLING), 2022
Jesujoba Oluwadara Alabi
David Ifeoluwa Adelani
Marius Mosbach
Dietrich Klakow
259
180
0
13 Apr 2022
End-to-End Speech Translation for Code Switched Speech
End-to-End Speech Translation for Code Switched SpeechFindings (Findings), 2022
Orion Weller
Matthias Sperber
Telmo Pires
Hendra Setiawan
Christian Gollan
Dominic Telaar
Matthias Paulik
234
35
0
11 Apr 2022
Assessment of Massively Multilingual Sentiment Classifiers
Assessment of Massively Multilingual Sentiment ClassifiersWorkshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2022
Krzysztof Rajda
Lukasz Augustyniak
Piotr Gramacki
Marcin Gruza
Szymon Wo'zniak
Tomasz Kajdanowicz
211
7
0
11 Apr 2022
A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and
  Challenges
A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and ChallengesIEEE Access (IEEE Access), 2022
Junyun Cui
Xiaoyu Shen
Feiping Nie
Liang Luo
Jinglong Wang
Yulong Chen
AILawELM
161
99
0
11 Apr 2022
MMTAfrica: Multilingual Machine Translation for African Languages
MMTAfrica: Multilingual Machine Translation for African LanguagesConference on Machine Translation (WMT), 2022
Chris C. Emezue
Bonaventure F. P. Dossou
134
25
0
08 Apr 2022
MAESTRO: Matched Speech Text Representations through Modality Matching
MAESTRO: Matched Speech Text Representations through Modality MatchingInterspeech (Interspeech), 2022
Zhehuai Chen
Yu Zhang
Andrew Rosenberg
Bhuvana Ramabhadran
Pedro J. Moreno
Ankur Bapna
Heiga Zen
244
119
0
07 Apr 2022
ByT5 model for massively multilingual grapheme-to-phoneme conversion
ByT5 model for massively multilingual grapheme-to-phoneme conversionInterspeech (Interspeech), 2022
Jian Zhu
Cong Zhang
David Jurgens
130
57
0
06 Apr 2022
Global Readiness of Language Technology for Healthcare: What would it
  Take to Combat the Next Pandemic?
Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?International Conference on Computational Linguistics (COLING), 2022
Ishani Mondal
Kabir Ahuja
Mohit Jain
Jacki O Neil
Kalika Bali
Monojit Choudhury
ELMLM&MA
169
4
0
06 Apr 2022
Probing Structured Pruning on Multilingual Pre-trained Models: Settings,
  Algorithms, and Efficiency
Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and EfficiencyAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yanyang Li
Fuli Luo
Runxin Xu
Songfang Huang
Fei Huang
Liwei Wang
156
3
0
06 Apr 2022
Towards Best Practices for Training Multilingual Dense Retrieval Models
Towards Best Practices for Training Multilingual Dense Retrieval Models
Xinyu Crystina Zhang
Kelechi Ogueji
Xueguang Ma
Jimmy J. Lin
RALM
153
42
0
05 Apr 2022
PaLM: Scaling Language Modeling with Pathways
PaLM: Scaling Language Modeling with PathwaysJournal of machine learning research (JMLR), 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILMLRM
1.2K
7,494
0
05 Apr 2022
On Efficiently Acquiring Annotations for Multilingual Models
On Efficiently Acquiring Annotations for Multilingual ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Joel Ruben Antony Moniz
Barun Patra
Matthew R. Gormley
193
7
0
03 Apr 2022
Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
Scaling Up Models and Data with t5x\texttt{t5x}t5x and seqio\texttt{seqio}seqioJournal of machine learning research (JMLR), 2022
Adam Roberts
Hyung Won Chung
Anselm Levskaya
Gaurav Mishra
James Bradbury
...
Brennan Saeta
Ryan Sepassi
A. Spiridonov
Joshua Newlan
Andrea Gesmundo
ALM
295
213
0
31 Mar 2022
Example-based Hypernetworks for Out-of-Distribution Generalization
Example-based Hypernetworks for Out-of-Distribution Generalization
Tomer Volk
Eyal Ben-David
Ohad Amosy
Gal Chechik
Roi Reichart
OOD
294
21
0
27 Mar 2022
Previous
123...262728...303132
Next
Page 27 of 32
Pageof 32