v1v2v3 (latest)

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 1,563 papers shown

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in IndonesiaAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

...

226

130

24 Mar 2022

Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translationInterspeech (Interspeech), 2022

Colin Cherry

232

24 Mar 2022

Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error CorrectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

M. Tarnavskyi

Artem Chernodub

Kostiantyn Omelianchuk

3DV

147

24 Mar 2022

Probing for Labeled Dependency TreesAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Max Müller-Eberstein

Rob van der Goot

Barbara Plank

138

24 Mar 2022

Multilingual CheckList: Generation and Evaluation

311

24 Mar 2022

A Survey on Cross-Lingual SummarizationTransactions of the Association for Computational Linguistics (TACL), 2022

Zhixu Li

Jie Zhou

169

23 Mar 2022

DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and QuantizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

169

21 Mar 2022

AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive SummarizationWorkshop on Arabic Natural Language Processing (WANLP), 2022

Michalis Vazirgiannis

214

21 Mar 2022

Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual TransferabilityAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Yoshinari Fujinuma

Jordan L. Boyd-Graber

Katharina Kann

AAML

258

21 Mar 2022

On Robust Prefix-Tuning for Text ClassificationInternational Conference on Learning Representations (ICLR), 2022

Zonghan Yang

Yang Liu

VLM

182

19 Mar 2022

Pretraining with Artificial Language: Studying Transferable Knowledge in Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Ryokan Ri

Yoshimasa Tsuruoka

226

19 Mar 2022

$Meta-X$_{NLG}$: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and Generation$

Meta-X

_{NLG}

: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and GenerationFindings (Findings), 2022

Kaushal Kumar Maurya

M. Desarkar

231

19 Mar 2022

Challenges and Strategies in Cross-Cultural NLPAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Daniel Hershcovich

...

343

232

18 Mar 2022

Towards Lithuanian grammatical error correction

Lukas Stankevivcius

Mantas Lukovsevivcius

3DV

134

18 Mar 2022

Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence ModelsFindings (Findings), 2022

225

17 Mar 2022

Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?Findings (Findings), 2022

David Ifeoluwa Adelani

Ruisi Su

Arya D. McCarthy

VLM

354

16 Mar 2022

MCoNaLa: A Benchmark for Code Generation from Multiple Natural LanguagesFindings (Findings), 2022

Graham Neubig

223

16 Mar 2022

Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument ExtractionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

164

15 Mar 2022

Improving Word Translation via Two-Stage Contrastive LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

333

15 Mar 2022

Does Corpus Quality Really Matter for Low-Resource Languages?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Mikel Artetxe

Itziar Aldabe

Rodrigo Agerri

Olatz Perez-de-Viñaspre

Aitor Soroa Etxabe

227

15 Mar 2022

ViWOZ: A Multi-Domain Task-Oriented Dialogue Systems Dataset For Low-resource Language

150

15 Mar 2022

Can Synthetic Translations Improve Bitext Quality?Annual Meeting of the Association for Computational Linguistics (ACL), 2022

Eleftheria Briakou

Marine Carpuat

144

15 Mar 2022

VAST: The Valence-Assessing Semantics Test for Contextualizing Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2022

Robert Wolfe

Aylin Caliskan

113

14 Mar 2022

CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual EntailmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

218

158

14 Mar 2022

Active Evaluation: Efficient NLG Evaluation with Few Pairwise ComparisonsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Akash Kumar Mohankumar

Mitesh M. Khapra

ELM AAML

209

11 Mar 2022

IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Mitesh M. Khapra

263

10 Mar 2022

IT5: Text-to-text Pretraining for Italian Language Understanding and GenerationInternational Conference on Language Resources and Evaluation (LREC), 2022

Gabriele Sarti

Malvina Nissim

AILaw

255

07 Mar 2022

Mukayese: Turkish NLP Strikes BackFindings (Findings), 2022

233

02 Mar 2022

SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization

323

26 Feb 2022

Morphology Without Borders: Clause-Level MorphologyTransactions of the Association for Computational Linguistics (TACL), 2022

Omer Goldman

Reut Tsarfaty

AILaw

167

25 Feb 2022

Using natural language prompts for machine translation

Xavier Garcia

Orhan Firat

AI4CE

221

23 Feb 2022

A New Generation of Perspective API: Efficient Multilingual Character-level TransformersKnowledge Discovery and Data Mining (KDD), 2022

Alyssa Lees

Vinh Q. Tran

Yi Tay

Jeffrey Scott Sorensen

Jai Gupta

Donald Metzler

Lucy Vasserman

226

255

22 Feb 2022

CALCS 2021 Shared Task: Machine Translation for Code-Switched Data

181

19 Feb 2022

ST-MoE: Designing Stable and Transferable Sparse Expert Models

422

298

17 Feb 2022

Sequence-to-Sequence Resources for Catalan

Ona de Gibert

Ksenia Kharitonova

B. Figueras

Jordi Armengol-Estapé

Maite Melero

14 Feb 2022

Integrating question answering and text-to-SQL in PortugueseInternational Conference on Computational Processing of the Portuguese Language (PROPOR), 2022

M. M. José

M. A. José

Denis Deratani Mauá

Fabio Gagliardi Cozman

LMTD

171

08 Feb 2022

Cedille: A large autoregressive French language model

Martin Müller

Florian Laurent

197

07 Feb 2022

mSLAM: Massively multilingual joint pre-training for speech and text

Colin Cherry

175

122

03 Feb 2022

Examining Scaling and Transfer of Language Model Architectures for Machine TranslationInternational Conference on Machine Learning (ICML), 2022

277

01 Feb 2022

XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource LanguagesThe Web Conference (WWW), 2022

Vasudeva Varma

166

01 Feb 2022

Cross-Lingual Dialogue Dataset Creation via Outline-Based GenerationTransactions of the Association for Computational Linguistics (TACL), 2022

262

31 Jan 2022

Correcting diacritics and typos with a ByT5 transformer modelApplied Sciences (Appl. Sci.), 2022

Lukas Stankevicius

M. Lukoševičius

J. Kapočiūtė-Dzikienė

Monika Briediene

Tomas Krilavičius

194

31 Jan 2022

Schema-Free Dependency Parsing via Sequence Generation

Si Li

Juanzi Li

Lei Hou

138

28 Jan 2022

Towards a Cleaner Document-Oriented Multilingual Crawled CorpusInternational Conference on Language Resources and Evaluation (LREC), 2022

206

193

17 Jan 2022

A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language ModelsInternational Conference on Language Resources and Evaluation (LREC), 2022

Vésteinn Snæbjarnarson

Haukur Barri Símonarson

Pétur Orri Ragnarsson

Svanhvít Lilja Ingólfsdóttir

H. Jónsson

Vilhjálmur Þorsteinsson

H. Einarsson

273

14 Jan 2022

Pretrained Language Models for Text Generation: A SurveyACM Computing Surveys (ACM CSUR), 2022

519

263

14 Jan 2022

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark

Yuan Yao

Qingxiu Dong

Jian Guan

Boxi Cao

Zhengyan Zhang

...

Zhiyuan Liu

Xianpei Han

Erhong Yang

Zhifang Sui

Maosong Sun

ALM ELM

226

27 Dec 2021

CABACE: Injecting Character Sequence Information and Domain Knowledge for Enhanced Acronym and Long-Form Extraction

144

25 Dec 2021

Few-shot Learning with Multilingual Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

...

Luke Zettlemoyer

Xian Li

359

355

20 Dec 2021

CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs

184

16 Dec 2021