Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.11934
Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 395 papers shown
Title
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding
Maximillian Chen
Alexandros Papangelis
Chenyang Tao
Andrew Rosenbaum
Seokhwan Kim
Yang Liu
Zhou Yu
Dilek Z. Hakkani-Tür
39
32
0
25 Oct 2022
IDK-MRC: Unanswerable Questions for Indonesian Machine Reading Comprehension
Rifki Afina Putri
Alice H. Oh
28
9
0
25 Oct 2022
EUR-Lex-Sum: A Multi- and Cross-lingual Dataset for Long-form Summarization in the Legal Domain
Dennis Aumiller
Ashish Chouhan
Michael Gertz
ELM
AILaw
32
35
0
24 Oct 2022
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
25
28
0
23 Oct 2022
Graphemic Normalization of the Perso-Arabic Script
R. Doctor
Alexander Gutkin
Cibu Johny
Brian Roark
R. Sproat
36
4
0
21 Oct 2022
Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data
Akshat Gahoi
Jayant Duneja
Anshul Padhi
Shivam Mangale
Saransh Rajput
Tanvi Kamble
D. Sharma
Vasudeva Varma
25
3
0
21 Oct 2022
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
A. Khan
Hrishikesh Kanade
G. Budhrani
Preet Jhanglani
Jia Xu
79
2
0
21 Oct 2022
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Zhehuai Chen
Ankur Bapna
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Pedro J. Moreno
Nanxin Chen
30
17
0
18 Oct 2022
Tone prediction and orthographic conversion for Basaa
I. Nikitin
Brian O'Connor
Anastasia N. Safonova
13
1
0
13 Oct 2022
Knowledge Distillation Transfer Sets and their Impact on Downstream NLU Tasks
Charith Peris
Lizhen Tan
Thomas Gueudré
Turan Gojayev
Vivi Wei
Gokmen Oz
22
4
0
10 Oct 2022
Comparing Computational Architectures for Automated Journalism
Yan Sym
João Gabriel Moura Campos
M. M. José
Fabio Gagliardi Cozman
26
0
0
08 Oct 2022
Generative Language Models for Paragraph-Level Question Generation
Asahi Ushio
Fernando Alva-Manchego
Jose Camacho-Collados
ELM
11
45
0
08 Oct 2022
Event Extraction: A Survey
Viet Dac Lai
15
9
0
07 Oct 2022
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Aishwarya Kamath
Peter Anderson
Su Wang
Jing Yu Koh
Alexander Ku
Austin Waters
Yinfei Yang
Jason Baldridge
Zarana Parekh
LM&Ro
20
45
0
06 Oct 2022
Language Models are Multilingual Chain-of-Thought Reasoners
Freda Shi
Mirac Suzgun
Markus Freitag
Xuezhi Wang
Suraj Srivats
...
Yi Tay
Sebastian Ruder
Denny Zhou
Dipanjan Das
Jason W. Wei
ReLM
LRM
170
325
0
06 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
48
13
0
06 Oct 2022
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model
Jacob Eisenstein
D. Andor
Bernd Bohnet
Michael Collins
David M. Mimno
LRM
189
24
0
05 Oct 2022
GROOT: Corrective Reward Optimization for Generative Sequential Labeling
Kazuma Hashimoto
K. Raman
VLM
11
1
0
29 Sep 2022
COMPILING: A Benchmark Dataset for Chinese Complexity Controllable Definition Generation
Jiaxin Yuan
Cunliang Kong
Chenhui Xie
Liner Yang
Erhong Yang
27
4
0
29 Sep 2022
Bidirectional Language Models Are Also Few-shot Learners
Ajay Patel
Bryan Li
Mohammad Sadegh Rasooli
Noah Constant
Colin Raffel
Chris Callison-Burch
LRM
62
45
0
29 Sep 2022
An Empirical Study on Cross-X Transfer for Legal Judgment Prediction
Joel Niklaus
Matthias Sturmer
Ilias Chalkidis
ELM
AILaw
32
18
0
25 Sep 2022
MonoByte: A Pool of Monolingual Byte-level Language Models
Hugo Queiroz Abonizio
Leandro Rodrigues de Souza
R. Lotufo
Rodrigo Nogueira
23
1
0
22 Sep 2022
A Benchmark for Understanding and Generating Dialogue between Characters in Stories
Jianzhu Yao
Ziqi Liu
Jian-Yu Guan
Minlie Huang
21
1
0
18 Sep 2022
Parameter-Efficient Finetuning for Robust Continual Multilingual Learning
Kartikeya Badola
Shachi Dave
Partha P. Talukdar
CLL
KELM
39
7
0
14 Sep 2022
MATra: A Multilingual Attentive Transliteration System for Indian Scripts
Yash Raj
Bhavesh Laddagiri
19
4
0
23 Aug 2022
A Comprehensive Survey of Natural Language Generation Advances from the Perspective of Digital Deception
Keenan I. Jones
Enes ALTUNCU
V. N. Franqueira
Yi-Chia Wang
Shujun Li
DeLMO
34
3
0
11 Aug 2022
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model
Saleh Soltan
Shankar Ananthakrishnan
Jack G. M. FitzGerald
Rahul Gupta
Wael Hamza
...
Mukund Sridhar
Fabian Triefenbach
Apurv Verma
Gökhan Tür
Premkumar Natarajan
39
82
0
02 Aug 2022
Language Modelling with Pixels
Phillip Rust
Jonas F. Lotz
Emanuele Bugliarello
Elizabeth Salesky
Miryam de Lhoneux
Desmond Elliott
VLM
30
46
0
14 Jul 2022
BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling
Javier de la Rosa
E. G. Ponferrada
Paulo Villegas
Pablo González de Prado Salas
Manu Romero
María Grandury
30
95
0
14 Jul 2022
esCorpius: A Massive Spanish Crawling Corpus
Asier Gutiérrez-Fandiño
David Pérez-Fernández
Jordi Armengol-Estapé
D. Griol
Z. Callejas
38
2
0
30 Jun 2022
Deformable CNN and Imbalance-Aware Feature Learning for Singing Technique Classification
Yuya Yamamoto
Juhan Nam
Hiroko Terasawa
16
8
0
24 Jun 2022
Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation
Shengyao Zhuang
Houxing Ren
Linjun Shou
Jian Pei
Ming Gong
Guido Zuccon
Daxin Jiang
32
64
0
21 Jun 2022
Automatic Summarization of Russian Texts: Comparison of Extractive and Abstractive Methods
Valeriya Goloviznina
Evgeny Kotelnikov
11
3
0
18 Jun 2022
A Memory System of a Robot Cognitive Architecture and its Implementation in ArmarX
Fabian Peller-Konrad
Rainer Kartmann
Christian R. G. Dreher
Andre Meixner
Fabian Reister
Markus Grotz
Tamim Asfour
16
17
0
05 Jun 2022
hmBERT: Historical Multilingual Language Models for Named Entity Recognition
Stefan Schweter
Luisa März
Katharina Schmid
Erion cCano
33
18
0
31 May 2022
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset
Ashish V. Thapliyal
Jordi Pont-Tuset
Xi Chen
Radu Soricut
VGen
73
72
0
25 May 2022
When does Parameter-Efficient Transfer Learning Work for Machine Translation?
A. Ustun
Asa Cooper Stickland
35
7
0
23 May 2022
The Geometry of Multilingual Language Model Representations
Tyler A. Chang
Z. Tu
Benjamin Bergen
16
56
0
22 May 2022
Descartes: Generating Short Descriptions of Wikipedia Articles
Marija Sakota
Maxime Peyrard
Robert West
VLM
18
2
0
20 May 2022
UL2: Unifying Language Learning Paradigms
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
57
294
0
10 May 2022
State-of-the-art in Open-domain Conversational AI: A Survey
Tosin P. Adewumi
F. Liwicki
Marcus Liwicki
24
15
0
02 May 2022
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders
Ivan Vulić
Goran Glavavs
Fangyu Liu
Nigel Collier
E. Ponti
Anna Korhonen
17
8
0
30 Apr 2022
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Shiyue Zhang
Vishrav Chaudhary
Naman Goyal
James Cross
Guillaume Wenzek
Mohit Bansal
Francisco Guzman
31
16
0
29 Apr 2022
Polyglot Prompt: Multilingual Multitask PrompTraining
Jinlan Fu
See-Kiong Ng
Pengfei Liu
17
7
0
29 Apr 2022
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets
Wei Chen
Zhiwei Li
Hongyi Fang
Qian-Qian Yao
Cheng Zhong
Jianye Hao
Qi Zhang
Xuanjing Huang
J. Peng
Zhongyu Wei
19
57
0
19 Apr 2022
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Yizhong Wang
Swaroop Mishra
Pegah Alipoormolabashi
Yeganeh Kordi
Amirreza Mirzaei
...
Chitta Baral
Yejin Choi
Noah A. Smith
Hannaneh Hajishirzi
Daniel Khashabi
ELM
27
779
0
16 Apr 2022
mGPT: Few-Shot Learners Go Multilingual
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
38
148
0
15 Apr 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
...
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
61
800
0
14 Apr 2022
MMTAfrica: Multilingual Machine Translation for African Languages
Chris C. Emezue
Bonaventure F. P. Dossou
19
24
0
08 Apr 2022
ByT5 model for massively multilingual grapheme-to-phoneme conversion
Jian Zhu
Cong Zhang
David Jurgens
11
36
0
06 Apr 2022
Previous
1
2
3
4
5
6
7
8
Next