Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.11934
Cited By
v1
v2
v3 (latest)
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 1,563 papers shown
Evaluation of Transfer Learning for Polish with a Text-to-Text Model
International Conference on Language Resources and Evaluation (LREC), 2022
Aleksandra Chrabrowa
Lukasz Dragan
Karol Grzegorczyk
D. Kajtoch
Mikołaj Koszowski
Robert Mroczkowski
Piotr Rybak
190
21
0
18 May 2022
OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource Language Pair for Low-Resource Sentence Retrieval
Findings (Findings), 2022
Tong Niu
Kazuma Hashimoto
Yingbo Zhou
Caiming Xiong
VLM
129
7
0
17 May 2022
Controlling Translation Formality Using Pre-trained Multilingual Language Models
International Workshop on Spoken Language Translation (IWSLT), 2022
Elijah Matthew Rippeth
Sweta Agrawal
Marine Carpuat
AI4CE
227
20
0
13 May 2022
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Long Phan
H. Tran
Hieu Duy Nguyen
Trieu H. Trinh
ViT
303
86
0
13 May 2022
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages
Kabir Ahuja
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
LRM
212
19
0
12 May 2022
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Kabir Ahuja
Monojit Choudhury
Sandipan Dandapat
150
3
0
12 May 2022
UL2: Unifying Language Learning Paradigms
International Conference on Learning Representations (ICLR), 2022
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
570
359
0
10 May 2022
Enhancing Cross-lingual Transfer by Manifold Mixup
International Conference on Learning Representations (ICLR), 2022
Huiyun Yang
Huadong Chen
Hao Zhou
Lei Li
AAML
168
46
0
09 May 2022
Building Machine Translation Systems for the Next Thousand Languages
Ankur Bapna
Isaac Caswell
Julia Kreutzer
Orhan Firat
D. Esch
...
Apurva Shah
Yanping Huang
Zhiwen Chen
Yonghui Wu
Macduff Hughes
325
110
0
09 May 2022
Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Karolina Stañczak
Edoardo Ponti
Lucas Torroba Hennigen
Robert Bamler
Isabelle Augenstein
MILM
LRM
462
11
0
04 May 2022
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
David Ifeoluwa Adelani
Jesujoba Oluwadara Alabi
Angela Fan
Julia Kreutzer
Xiaoyu Shen
...
Ayodele Awokoya
Happy Buzaaba
Blessing K. Sibanda
Andiswa Bukula
Sam Manthalu
446
131
0
04 May 2022
State-of-the-art in Open-domain Conversational AI: A Survey
Tosin Adewumi
F. Liwicki
Marcus Liwicki
313
18
0
02 May 2022
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Ivan Vulić
Goran Glavaš
Fangyu Liu
Nigel Collier
Edoardo Ponti
Anna Korhonen
269
10
0
30 Apr 2022
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Conference of the Association for Machine Translation in the Americas (AMTA), 2022
Shiyue Zhang
Vishrav Chaudhary
Naman Goyal
James Cross
Guillaume Wenzek
Joey Tianyi Zhou
Francisco Guzman
224
22
0
29 Apr 2022
Polyglot Prompt: Multilingual Multitask PrompTraining
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jinlan Fu
See-Kiong Ng
Pengfei Liu
188
13
0
29 Apr 2022
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer
Haoran Xu
Kenton W. Murray
182
12
0
29 Apr 2022
A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical Transformer
IEEE Transactions on Computational Social Systems (IEEE TCSS), 2022
Ayan Sengupta
Tharun Suresh
Md. Shad Akhtar
Tanmoy Chakraborty
184
13
0
27 Apr 2022
WikiMulti: a Corpus for Cross-Lingual Summarization
Pavel Tikhonov
Valentin Malykh
102
4
0
23 Apr 2022
Tweets2Stance: Users stance detection exploiting Zero-Shot Learning Algorithms on Tweets
Margherita Gambini
T. Fagni
C. Senette
Maurizio Tesconi
139
3
0
22 Apr 2022
SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding
International Workshop on Semantic Evaluation (SemEval), 2022
Harish Tayyar Madabushi
Edward Gow-Smith
Marcos García
Carolina Scarton
M. Idiart
Aline Villavicencio
233
65
0
21 Apr 2022
Towards Arabic Sentence Simplification via Classification and Generative Approaches
Workshop on Arabic Natural Language Processing (WANLP), 2022
Nouran Khallaf
S. Sharoff
119
7
0
20 Apr 2022
On the Representation Collapse of Sparse Mixture of Experts
Neural Information Processing Systems (NeurIPS), 2022
Zewen Chi
Li Dong
Shaohan Huang
Damai Dai
Shuming Ma
...
Payal Bajaj
Xia Song
Xian-Ling Mao
Heyan Huang
Furu Wei
MoMe
MoE
310
136
0
20 Apr 2022
ALBETO and DistilBETO: Lightweight Spanish Language Models
International Conference on Language Resources and Evaluation (LREC), 2022
J. Canete
S. Donoso
Felipe Bravo-Marquez
Andrés Carvallo
Vladimir Araujo
200
25
0
19 Apr 2022
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets
Wei Chen
Zhiwei Li
Hongyi Fang
Qian-Qian Yao
Cheng Zhong
Jianye Hao
Tao Gui
Xuanjing Huang
J. Peng
Zhongyu Wei
227
77
0
19 Apr 2022
Detecting Text Formality: A Study of Text Classification Approaches
Recent Advances in Natural Language Processing (RANLP), 2022
Daryna Dementieva
Ivan Trifinov
Sergey Petrakov
219
13
0
19 Apr 2022
IndicXNLI: Evaluating Multilingual Inference for Indian Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Divyanshu Aggarwal
V. Gupta
Anoop Kunchukuttan
172
35
0
19 Apr 2022
MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jack G. M. FitzGerald
C. Hench
Charith Peris
Scott Mackie
Kay Rottmann
...
Laurie Crist
Misha Britan
Wouter Leeuwis
Gokhan Tur
Premkumar Natarajan
245
171
0
18 Apr 2022
GL-CLeF: A Global-Local Contrastive Learning Framework for Cross-lingual Spoken Language Understanding
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Libo Qin
Qiguang Chen
Tianbao Xie
Qixin Li
Jian-Guang Lou
Wanxiang Che
MingSung Kan
174
36
0
18 Apr 2022
AfriWOZ: Corpus for Exploiting Cross-Lingual Transferability for Generation of Dialogues in Low-Resource, African Languages
Tosin Adewumi
Mofetoluwa Adeyemi
Aremu Anuoluwapo
Bukola Peters
Happy Buzaaba
...
Phylis Ngigi
Orevaoghene Ahia
Ruqayya Nasir
F. Liwicki
Marcus Liwicki
166
2
0
17 Apr 2022
Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation and Understanding
Changtong Zan
Liang Ding
Li Shen
Yu Cao
Weifeng Liu
Dacheng Tao
LRM
189
8
0
16 Apr 2022
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yizhong Wang
Swaroop Mishra
Pegah Alipoormolabashi
Yeganeh Kordi
Amirreza Mirzaei
...
Chitta Baral
Yejin Choi
Noah A. Smith
Hannaneh Hajishirzi
Daniel Khashabi
ELM
644
1,016
0
16 Apr 2022
WordAlchemy: A transformer-based Reverse Dictionary
S. Mane
Harshali B. Patil
Kanhaiya Madaswar
Pranav Sadavarte
210
6
0
16 Apr 2022
Chinese Idiom Paraphrasing
Transactions of the Association for Computational Linguistics (TACL), 2022
Jipeng Qiang
Yang Li
Chaowei Zhang
Yun Li
Yunhao Yuan
Yi Zhu
Xin Wu
165
10
0
15 Apr 2022
Summarization with Graphical Elements
Maartje ter Hoeve
Julia Kiseleva
Maarten de Rijke
263
2
0
15 Apr 2022
mGPT: Few-Shot Learners Go Multilingual
Transactions of the Association for Computational Linguistics (TACL), 2022
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
364
191
0
15 Apr 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
...
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
395
955
0
14 Apr 2022
Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-Tuning
International Conference on Computational Linguistics (COLING), 2022
Jesujoba Oluwadara Alabi
David Ifeoluwa Adelani
Marius Mosbach
Dietrich Klakow
259
180
0
13 Apr 2022
End-to-End Speech Translation for Code Switched Speech
Findings (Findings), 2022
Orion Weller
Matthias Sperber
Telmo Pires
Hendra Setiawan
Christian Gollan
Dominic Telaar
Matthias Paulik
234
35
0
11 Apr 2022
Assessment of Massively Multilingual Sentiment Classifiers
Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2022
Krzysztof Rajda
Lukasz Augustyniak
Piotr Gramacki
Marcin Gruza
Szymon Wo'zniak
Tomasz Kajdanowicz
211
7
0
11 Apr 2022
A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges
IEEE Access (IEEE Access), 2022
Junyun Cui
Xiaoyu Shen
Feiping Nie
Liang Luo
Jinglong Wang
Yulong Chen
AILaw
ELM
161
99
0
11 Apr 2022
MMTAfrica: Multilingual Machine Translation for African Languages
Conference on Machine Translation (WMT), 2022
Chris C. Emezue
Bonaventure F. P. Dossou
134
25
0
08 Apr 2022
MAESTRO: Matched Speech Text Representations through Modality Matching
Interspeech (Interspeech), 2022
Zhehuai Chen
Yu Zhang
Andrew Rosenberg
Bhuvana Ramabhadran
Pedro J. Moreno
Ankur Bapna
Heiga Zen
244
119
0
07 Apr 2022
ByT5 model for massively multilingual grapheme-to-phoneme conversion
Interspeech (Interspeech), 2022
Jian Zhu
Cong Zhang
David Jurgens
130
57
0
06 Apr 2022
Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic?
International Conference on Computational Linguistics (COLING), 2022
Ishani Mondal
Kabir Ahuja
Mohit Jain
Jacki O Neil
Kalika Bali
Monojit Choudhury
ELM
LM&MA
169
4
0
06 Apr 2022
Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yanyang Li
Fuli Luo
Runxin Xu
Songfang Huang
Fei Huang
Liwei Wang
156
3
0
06 Apr 2022
Towards Best Practices for Training Multilingual Dense Retrieval Models
Xinyu Crystina Zhang
Kelechi Ogueji
Xueguang Ma
Jimmy J. Lin
RALM
153
42
0
05 Apr 2022
PaLM: Scaling Language Modeling with Pathways
Journal of machine learning research (JMLR), 2022
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
1.2K
7,494
0
05 Apr 2022
On Efficiently Acquiring Annotations for Multilingual Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Joel Ruben Antony Moniz
Barun Patra
Matthew R. Gormley
193
7
0
03 Apr 2022
Scaling Up Models and Data with
t5x
\texttt{t5x}
t5x
and
seqio
\texttt{seqio}
seqio
Journal of machine learning research (JMLR), 2022
Adam Roberts
Hyung Won Chung
Anselm Levskaya
Gaurav Mishra
James Bradbury
...
Brennan Saeta
Ryan Sepassi
A. Spiridonov
Joshua Newlan
Andrea Gesmundo
ALM
295
213
0
31 Mar 2022
Example-based Hypernetworks for Out-of-Distribution Generalization
Tomer Volk
Eyal Ben-David
Ohad Amosy
Gal Chechik
Roi Reichart
OOD
294
21
0
27 Mar 2022
Previous
1
2
3
...
26
27
28
...
30
31
32
Next
Page 27 of 32
Page
of 32
Go