ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXivPDFHTML

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 358 papers shown
Title
Sāmayik: A Benchmark and Dataset for English-Sanskrit Translation
Sāmayik: A Benchmark and Dataset for English-Sanskrit Translation
Ayush Maheshwari
Ashim Gupta
Amrith Krishna
Atul Kumar Singh
Ganesh Ramakrishnan
G. Anil Kumar
Jitin Singla
25
0
0
23 May 2023
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual
  Pretrained Language Models
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
Peiqin Lin
Chengzhi Hu
Zheyu Zhang
André F. T. Martins
Hinrich Schütze
27
1
0
23 May 2023
Extrapolating Multilingual Understanding Models as Multilingual
  Generators
Extrapolating Multilingual Understanding Models as Multilingual Generators
Bohong Wu
Fei Yuan
Hai Zhao
Lei Li
Jingjing Xu
AI4CE
25
2
0
22 May 2023
Rethinking Semi-supervised Learning with Language Models
Rethinking Semi-supervised Learning with Language Models
Zhengxiang Shi
Francesco Tonolini
Nikolaos Aletras
Emine Yilmaz
G. Kazai
Yunlong Jiao
27
17
0
22 May 2023
Bidirectional Transformer Reranker for Grammatical Error Correction
Bidirectional Transformer Reranker for Grammatical Error Correction
Ying Zhang
Hidetaka Kamigaito
Manabu Okumura
6
2
0
22 May 2023
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
Ariel Ekgren
Amaru Cuba Gyllensten
Felix Stollenwerk
Joey Öhman
T. Isbister
Evangelia Gogoulou
F. Carlsson
Alice Heiman
Judit Casademont
Magnus Sahlgren
21
13
0
22 May 2023
Mitigating Data Imbalance and Representation Degeneration in
  Multilingual Machine Translation
Mitigating Data Imbalance and Representation Degeneration in Multilingual Machine Translation
Wen Lai
Alexandra Chronopoulou
Alexander M. Fraser
30
4
0
22 May 2023
Multilingual Simplification of Medical Texts
Multilingual Simplification of Medical Texts
Sebastian Antony Joseph
Kathryn Kazanas
Keziah Reina
Vishnesh J. Ramanathan
Wei-ping Xu
Byron C. Wallace
Junyi Jessy Li
30
12
0
21 May 2023
SHINE: Syntax-augmented Hierarchical Interactive Encoder for Zero-shot
  Cross-lingual Information Extraction
SHINE: Syntax-augmented Hierarchical Interactive Encoder for Zero-shot Cross-lingual Information Extraction
Jun-Yu Ma
Jia-Chen Gu
Zhen-Hua Ling
Quan Liu
Cong Liu
Guoping Hu
47
1
0
21 May 2023
Glot500: Scaling Multilingual Corpora and Language Models to 500
  Languages
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
Ayyoob Imani
Peiqin Lin
Amir Hossein Kargaran
Silvia Severini
Masoud Jalili Sabet
...
Chunlan Ma
Helmut Schmid
André F. T. Martins
François Yvon
Hinrich Schütze
ALM
LRM
29
95
0
20 May 2023
mLongT5: A Multilingual and Efficient Text-To-Text Transformer for
  Longer Sequences
mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences
David C. Uthus
Santiago Ontañón
Joshua Ainslie
Mandy Guo
VLM
25
10
0
18 May 2023
Visual Question Answering: A Survey on Techniques and Common Trends in
  Recent Literature
Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature
Ana Claudia Akemi Matsuki de Faria
Felype de Castro Bastos
Jose Victor Nogueira Alves da Silva
Vitor Lopes Fabris
Valeska Uchôa
Décio Gonccalves de Aguiar Neto
C. F. G. Santos
30
22
0
18 May 2023
Generalized Multiple Intent Conditioned Slot Filling
Generalized Multiple Intent Conditioned Slot Filling
Harshil Shah
Arthur Wilcke
Marius Cobzarenco
Cristian C Cobzarenco
Edward Challis
David Barber
11
0
0
18 May 2023
PaLM 2 Technical Report
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
62
1,138
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip H. S. Torr
Adel Bibi
16
96
0
17 May 2023
Unsupervised Sentence Representation Learning with Frequency-induced
  Adversarial Tuning and Incomplete Sentence Filtering
Unsupervised Sentence Representation Learning with Frequency-induced Adversarial Tuning and Incomplete Sentence Filtering
Bing Wang
Ximing Li
Zhiyao Yang
Yuanyuan Guan
Jiayin Li
Sheng-sheng Wang
27
6
0
15 May 2023
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages
Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages
Rahul Aralikatte
Ziling Cheng
Sumanth Doddapaneni
Jackie C.K. Cheung
27
8
0
10 May 2023
An Exploration of Encoder-Decoder Approaches to Multi-Label
  Classification for Legal and Biomedical Text
An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text
Yova Kementchedjhieva
Ilias Chalkidis
13
21
0
09 May 2023
CSED: A Chinese Semantic Error Diagnosis Corpus
CSED: A Chinese Semantic Error Diagnosis Corpus
Bo Sun
Baoxin Wang
Yixuan Wang
Wanxiang Che
Dayong Wu
Shijin Wang
Ting Liu
35
4
0
09 May 2023
Transfer to a Low-Resource Language via Close Relatives: The Case Study
  on Faroese
Transfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese
Vésteinn Snaebjarnarson
A. Simonsen
Goran Glavavs
Ivan Vulić
27
19
0
18 Apr 2023
Computational modeling of semantic change
Computational modeling of semantic change
Nina Tahmasebi
Haim Dubossarsky
26
6
0
13 Apr 2023
Measuring Normative and Descriptive Biases in Language Models Using
  Census Data
Measuring Normative and Descriptive Biases in Language Models Using Census Data
Samia Touileb
Lilja Ovrelid
Erik Velldal
25
4
0
12 Apr 2023
Exploring the Use of Foundation Models for Named Entity Recognition and
  Lemmatization Tasks in Slavic Languages
Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages
Gabriela Pałka
Artur Nowakowski
24
2
0
11 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature
  Review
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
24
39
0
07 Apr 2023
Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
Jinhyuk Lee
Zhuyun Dai
Sai Meher Karthik Duddu
Tao Lei
Iftekhar Naim
Ming-Wei Chang
Vincent Zhao
17
15
0
04 Apr 2023
Resources and Few-shot Learners for In-context Learning in Slavic
  Languages
Resources and Few-shot Learners for In-context Learning in Slavic Languages
Michal vStefánik
Marek Kadlcík
Piotr Gramacki
Petr Sojka
22
3
0
04 Apr 2023
SimCSum: Joint Learning of Simplification and Cross-lingual
  Summarization for Cross-lingual Science Journalism
SimCSum: Joint Learning of Simplification and Cross-lingual Summarization for Cross-lingual Science Journalism
Mehwish Fatima
Tim Kolber
K. Markert
Michael Strube
21
0
0
04 Apr 2023
Summarizing Indian Languages using Multilingual Transformers based
  Models
Summarizing Indian Languages using Multilingual Transformers based Models
Dhaval Taunk
Vasudeva Varma
VLM
22
9
0
29 Mar 2023
Sigmoid Loss for Language Image Pre-Training
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
19
931
0
27 Mar 2023
Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text
  Diacritization
Fine-Tashkeel: Finetuning Byte-Level Models for Accurate Arabic Text Diacritization
Bashar Al-Rfooh
Gheith A. Abandah
Rami Al-Rfou
24
4
0
25 Mar 2023
XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation
  in Low Resource Languages
XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation in Low Resource Languages
Dhaval Taunk
Shivprasad Sagare
Anupam Patil
Shivansh Subramanian
Manish Gupta
Vasudeva Varma
17
3
0
22 Mar 2023
DiTTO: A Feature Representation Imitation Approach for Improving
  Cross-Lingual Transfer
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer
Shanu Kumar
Abbaraju Soujanya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhury
VLM
25
1
0
04 Mar 2023
Cross-Lingual Question Answering over Knowledge Base as Reading
  Comprehension
Cross-Lingual Question Answering over Knowledge Base as Reading Comprehension
Chen Zhang
Yuxuan Lai
Yansong Feng
Xingyu Shen
Haowei Du
Dongyan Zhao
13
3
0
26 Feb 2023
Can Pre-trained Vision and Language Models Answer Visual
  Information-Seeking Questions?
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen
Hexiang Hu
Yi Luan
Haitian Sun
Soravit Changpinyo
Alan Ritter
Ming-Wei Chang
37
80
0
23 Feb 2023
Connecting Vision and Language with Video Localized Narratives
Connecting Vision and Language with Video Localized Narratives
P. Voigtlaender
Soravit Changpinyo
Jordi Pont-Tuset
Radu Soricut
V. Ferrari
VGen
36
21
0
22 Feb 2023
Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a
  Distilled Representation
Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled Representation
M. Moradshahi
Sina J. Semnani
M. Lam
21
7
0
18 Feb 2023
Distillation of encoder-decoder transformers for sequence labelling
Distillation of encoder-decoder transformers for sequence labelling
M. Farina
D. Pappadopulo
Anant Gupta
Leslie Huang
Ozan Irsoy
Thamar Solorio
VLM
85
3
0
10 Feb 2023
The unreasonable effectiveness of few-shot learning for machine
  translation
The unreasonable effectiveness of few-shot learning for machine translation
Xavier Garcia
Yamini Bansal
Colin Cherry
George F. Foster
M. Krikun
Fan Feng
Melvin Johnson
Orhan Firat
27
102
0
02 Feb 2023
idT5: Indonesian Version of Multilingual T5 Transformer
idT5: Indonesian Version of Multilingual T5 Transformer
Mukhlish Fuadi
A. Wibawa
S. Sumpeno
11
6
0
02 Feb 2023
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark
  Datasets
Bipol: Multi-axes Evaluation of Bias with Explainability in Benchmark Datasets
Tosin P. Adewumi
Isabella Sodergren
Lama Alkhaled
Sana Sabah Sabry
F. Liwicki
Marcus Liwicki
25
4
0
28 Jan 2023
One Model for All Domains: Collaborative Domain-Prefix Tuning for
  Cross-Domain NER
One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER
Xiang Chen
Lei Li
Q. Fei
Ningyu Zhang
Chuanqi Tan
Yong-jia Jiang
Fei Huang
Huajun Chen
19
23
0
25 Jan 2023
Truveta Mapper: A Zero-shot Ontology Alignment Framework
Truveta Mapper: A Zero-shot Ontology Alignment Framework
Mariyam Amir
Murchana Baruah
Mahsa Eslamialishah
Sina Ehsani
Alireza Bahramali
Sadra Naddaf-sh
Saman Zarandioon
25
7
0
24 Jan 2023
Language Embeddings Sometimes Contain Typological Generalizations
Language Embeddings Sometimes Contain Typological Generalizations
Robert Östling
Murathan Kurfali
NAI
24
9
0
19 Jan 2023
Curriculum Script Distillation for Multilingual Visual Question
  Answering
Curriculum Script Distillation for Multilingual Visual Question Answering
Khyathi Raghavi Chandu
A. Geramifard
19
0
0
17 Jan 2023
On the State of German (Abstractive) Text Summarization
On the State of German (Abstractive) Text Summarization
Dennis Aumiller
Jing Fan
Michael Gertz
21
1
0
17 Jan 2023
GAE-ISumm: Unsupervised Graph-Based Summarization of Indian Languages
GAE-ISumm: Unsupervised Graph-Based Summarization of Indian Languages
Lakshmi Sireesha Vakada
Anudeep Ch
Mounika Marreddy
S. Oota
R. Mamidi
16
1
0
25 Dec 2022
OpineSum: Entailment-based self-training for abstractive opinion
  summarization
OpineSum: Entailment-based self-training for abstractive opinion summarization
Annie Louis
Joshua Maynez
21
7
0
21 Dec 2022
How Does Beam Search improve Span-Level Confidence Estimation in
  Generative Sequence Labeling?
How Does Beam Search improve Span-Level Confidence Estimation in Generative Sequence Labeling?
Kazuma Hashimoto
Iftekhar Naim
K. Raman
UQLM
27
2
0
21 Dec 2022
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free
  Language Models
ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models
Jonas Belouadi
Steffen Eger
44
24
0
20 Dec 2022
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for
  Natural Language Understanding in Task-Oriented Dialogue
MULTI3NLU++: A Multilingual, Multi-Intent, Multi-Domain Dataset for Natural Language Understanding in Task-Oriented Dialogue
Nikita Moghe
E. Razumovskaia
Liane Guillou
Ivan Vulić
Anna Korhonen
Alexandra Birch
19
13
0
20 Dec 2022
Previous
12345678
Next