Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2010.11934
Cited By

mT5: A massively multilingual pre-trained text-to-text transformer

v1v2v3 (latest)

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020

Aditya Siddhant

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 1,563 papers shown

Evaluation of Transfer Learning for Polish with a Text-to-Text Model

Evaluation of Transfer Learning for Polish with a Text-to-Text ModelInternational Conference on Language Resources and Evaluation (LREC), 2022

Aleksandra Chrabrowa

Karol Grzegorczyk

Mikołaj Koszowski

Robert Mroczkowski

190

21

0

18 May 2022

OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource
Language Pair for Low-Resource Sentence Retrieval

OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource Language Pair for Low-Resource Sentence RetrievalFindings (Findings), 2022

Kazuma Hashimoto

Yingbo Zhou

129

7

0

17 May 2022

Controlling Translation Formality Using Pre-trained Multilingual
Language Models

Controlling Translation Formality Using Pre-trained Multilingual Language ModelsInternational Workshop on Spoken Language Translation (IWSLT), 2022

Elijah Matthew Rippeth

227

20

0

13 May 2022

ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language
Generation

ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Hieu Duy Nguyen

303

86

0

13 May 2022

Beyond Static Models and Test Sets: Benchmarking the Potential of
Pre-trained Models Across Tasks and Languages

Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages

Sandipan Dandapat

Sunayana Sitaram

Monojit Choudhury

212

19

0

12 May 2022

On the Economics of Multilingual Few-shot Learning: Modeling the
Cost-Performance Trade-offs of Machine Translated and Manual Data

On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual DataNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Monojit Choudhury

Sandipan Dandapat

150

3

0

12 May 2022

UL2: Unifying Language Learning Paradigms

UL2: Unifying Language Learning ParadigmsInternational Conference on Learning Representations (ICLR), 2022

Mostafa Dehghani

...

570

359

0

10 May 2022

Enhancing Cross-lingual Transfer by Manifold Mixup

Enhancing Cross-lingual Transfer by Manifold MixupInternational Conference on Learning Representations (ICLR), 2022

Lei Li

168

46

0

09 May 2022

Building Machine Translation Systems for the Next Thousand Languages

Building Machine Translation Systems for the Next Thousand Languages

...

325

110

0

09 May 2022

Same Neurons, Different Languages: Probing Morphosyntax in Multilingual
Pre-trained Models

Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Karolina Stañczak

Lucas Torroba Hennigen

Isabelle Augenstein

462

11

0

04 May 2022

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models
for African News Translation

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News TranslationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

David Ifeoluwa Adelani

Jesujoba Oluwadara Alabi

Angela Fan

Xiaoyu Shen

...

Ayodele Awokoya

Blessing K. Sibanda

446

131

0

04 May 2022

State-of-the-art in Open-domain Conversational AI: A Survey

State-of-the-art in Open-domain Conversational AI: A Survey

313

18

0

02 May 2022

Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence
Encoders

Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence EncodersConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

269

10

0

30 Apr 2022

How Robust is Neural Machine Translation to Language Imbalance in
Multilingual Tokenizer Training?

How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?Conference of the Association for Machine Translation in the Americas (AMTA), 2022

Vishrav Chaudhary

Guillaume Wenzek

Joey Tianyi Zhou

Francisco Guzman

224

22

0

29 Apr 2022

Polyglot Prompt: Multilingual Multitask PrompTraining

Polyglot Prompt: Multilingual Multitask PrompTrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

See-Kiong Ng

188

13

0

29 Apr 2022

Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient
Optimization in Few-Shot Cross-Lingual Transfer

Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer

Kenton W. Murray

182

12

0

29 Apr 2022

A Comprehensive Understanding of Code-mixed Language Semantics using
Hierarchical Transformer

A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical TransformerIEEE Transactions on Computational Social Systems (IEEE TCSS), 2022

Md. Shad Akhtar

Tanmoy Chakraborty

184

13

0

27 Apr 2022

WikiMulti: a Corpus for Cross-Lingual Summarization

WikiMulti: a Corpus for Cross-Lingual Summarization

Valentin Malykh

102

4

0

23 Apr 2022

Tweets2Stance: Users stance detection exploiting Zero-Shot Learning
Algorithms on Tweets

Tweets2Stance: Users stance detection exploiting Zero-Shot Learning Algorithms on Tweets

Margherita Gambini

Maurizio Tesconi

139

3

0

22 Apr 2022

SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence
Embedding

SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence EmbeddingInternational Workshop on Semantic Evaluation (SemEval), 2022

Harish Tayyar Madabushi

Edward Gow-Smith

Carolina Scarton

Aline Villavicencio

233

65

0

21 Apr 2022

Towards Arabic Sentence Simplification via Classification and Generative
Approaches

Towards Arabic Sentence Simplification via Classification and Generative ApproachesWorkshop on Arabic Natural Language Processing (WANLP), 2022

119

7

0

20 Apr 2022

On the Representation Collapse of Sparse Mixture of Experts

On the Representation Collapse of Sparse Mixture of ExpertsNeural Information Processing Systems (NeurIPS), 2022

...

Xia Song

310

136

0

20 Apr 2022

ALBETO and DistilBETO: Lightweight Spanish Language Models

ALBETO and DistilBETO: Lightweight Spanish Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2022

Felipe Bravo-Marquez

Andrés Carvallo

Vladimir Araujo

200

25

0

19 Apr 2022

A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks
and Datasets

A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets

Jianye Hao

Xuanjing Huang

227

77

0

19 Apr 2022

Detecting Text Formality: A Study of Text Classification Approaches

Detecting Text Formality: A Study of Text Classification ApproachesRecent Advances in Natural Language Processing (RANLP), 2022

Daryna Dementieva

Sergey Petrakov

219

13

0

19 Apr 2022

IndicXNLI: Evaluating Multilingual Inference for Indian Languages

IndicXNLI: Evaluating Multilingual Inference for Indian LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Divyanshu Aggarwal

Anoop Kunchukuttan

172

35

0

19 Apr 2022

MASSIVE: A 1M-Example Multilingual Natural Language Understanding
Dataset with 51 Typologically-Diverse Languages

MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Jack G. M. FitzGerald

...

Premkumar Natarajan

245

171

0

18 Apr 2022

GL-CLeF: A Global-Local Contrastive Learning Framework for Cross-lingual
Spoken Language Understanding

GL-CLeF: A Global-Local Contrastive Learning Framework for Cross-lingual Spoken Language UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

174

36

0

18 Apr 2022

AfriWOZ: Corpus for Exploiting Cross-Lingual Transferability for
Generation of Dialogues in Low-Resource, African Languages

AfriWOZ: Corpus for Exploiting Cross-Lingual Transferability for Generation of Dialogues in Low-Resource, African Languages

Mofetoluwa Adeyemi

Aremu Anuoluwapo

...

Orevaoghene Ahia

166

2

0

17 Apr 2022

Bridging Cross-Lingual Gaps During Leveraging the Multilingual
Sequence-to-Sequence Pretraining for Text Generation and Understanding

Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation and Understanding

Liang Ding

Li Shen

189

8

0

16 Apr 2022

Super-NaturalInstructions: Generalization via Declarative Instructions
on 1600+ NLP Tasks

Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP TasksConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Pegah Alipoormolabashi

Amirreza Mirzaei

...

Yejin Choi

Hannaneh Hajishirzi

Daniel Khashabi

644

1,016

0

16 Apr 2022

WordAlchemy: A transformer-based Reverse Dictionary

WordAlchemy: A transformer-based Reverse Dictionary

Harshali B. Patil

Kanhaiya Madaswar

Pranav Sadavarte

210

6

0

16 Apr 2022

Chinese Idiom Paraphrasing

Chinese Idiom ParaphrasingTransactions of the Association for Computational Linguistics (TACL), 2022

165

10

0

15 Apr 2022

Summarization with Graphical Elements

Summarization with Graphical Elements

Maartje ter Hoeve

Maarten de Rijke

263

2

0

15 Apr 2022

mGPT: Few-Shot Learners Go Multilingual

mGPT: Few-Shot Learners Go MultilingualTransactions of the Association for Computational Linguistics (TACL), 2022

Alena Fenogenova

Maria Tikhonova

Vladislav Mikhailov

Anastasia Kozlova

Tatiana Shavrina

364

191

0

15 Apr 2022

GPT-NeoX-20B: An Open-Source Autoregressive Language Model

GPT-NeoX-20B: An Open-Source Autoregressive Language Model

Stella Biderman

Quentin G. Anthony

...

Shivanshu Purohit

Samuel Weinbach

395

955

0

14 Apr 2022

Adapting Pre-trained Language Models to African Languages via
Multilingual Adaptive Fine-Tuning

Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-TuningInternational Conference on Computational Linguistics (COLING), 2022

Jesujoba Oluwadara Alabi

David Ifeoluwa Adelani

Dietrich Klakow

259

180

0

13 Apr 2022

End-to-End Speech Translation for Code Switched Speech

End-to-End Speech Translation for Code Switched SpeechFindings (Findings), 2022

Matthias Sperber

Hendra Setiawan

Christian Gollan

Matthias Paulik

234

35

0

11 Apr 2022

Assessment of Massively Multilingual Sentiment Classifiers

Assessment of Massively Multilingual Sentiment ClassifiersWorkshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2022

Krzysztof Rajda

Lukasz Augustyniak

Szymon Wo'zniak

Tomasz Kajdanowicz

211

7

0

11 Apr 2022

A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and
Challenges

A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and ChallengesIEEE Access (IEEE Access), 2022

Xiaoyu Shen

161

99

0

11 Apr 2022

MMTAfrica: Multilingual Machine Translation for African Languages

MMTAfrica: Multilingual Machine Translation for African LanguagesConference on Machine Translation (WMT), 2022

Chris C. Emezue

Bonaventure F. P. Dossou

134

25

0

08 Apr 2022

MAESTRO: Matched Speech Text Representations through Modality Matching

MAESTRO: Matched Speech Text Representations through Modality MatchingInterspeech (Interspeech), 2022

Zhehuai Chen

Andrew Rosenberg

Bhuvana Ramabhadran

Pedro J. Moreno

244

119

0

07 Apr 2022

ByT5 model for massively multilingual grapheme-to-phoneme conversion

ByT5 model for massively multilingual grapheme-to-phoneme conversionInterspeech (Interspeech), 2022

130

57

0

06 Apr 2022

Global Readiness of Language Technology for Healthcare: What would it
Take to Combat the Next Pandemic?

Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?International Conference on Computational Linguistics (COLING), 2022

Monojit Choudhury

169

4

0

06 Apr 2022

Probing Structured Pruning on Multilingual Pre-trained Models: Settings,
Algorithms, and Efficiency

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and EfficiencyAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Runxin Xu

Fei Huang

156

3

0

06 Apr 2022

Towards Best Practices for Training Multilingual Dense Retrieval Models

Towards Best Practices for Training Multilingual Dense Retrieval Models

Xinyu Crystina Zhang

153

42

0

05 Apr 2022

PaLM: Scaling Language Modeling with Pathways

PaLM: Scaling Language Modeling with PathwaysJournal of machine learning research (JMLR), 2022

Aakanksha Chowdhery

Sharan Narang

...

Kathy Meier-Hellstern

1.2K

7,494

0

05 Apr 2022

On Efficiently Acquiring Annotations for Multilingual Models

On Efficiently Acquiring Annotations for Multilingual ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Joel Ruben Antony Moniz

Matthew R. Gormley

193

7

0

03 Apr 2022

$Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$$

Scaling Up Models and Data with

\texttt{t5x}

\texttt{seqio}

Journal of machine learning research (JMLR), 2022

Hyung Won Chung

Anselm Levskaya

...

Andrea Gesmundo

295

213

0

31 Mar 2022

Example-based Hypernetworks for Out-of-Distribution Generalization

Example-based Hypernetworks for Out-of-Distribution Generalization

294

21

0

27 Mar 2022

1 2 3...26 27 28...30 31 32

Page 27 of 32

Pageof 32