ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
v1v2v3 (latest)

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 1,561 papers shown
Title
Meta CLIP 2: A Worldwide Scaling Recipe
Meta CLIP 2: A Worldwide Scaling Recipe
Yung-Sung Chuang
Yang Li
Dong Wang
Ching-Feng Yeh
Kehan Lyu
...
Zhuang Liu
Saining Xie
Anuj Kumar
Shang-Wen Li
Hu Xu
CLIPVLM
344
13
0
29 Jul 2025
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
Meishan Zhang
Xin Zhang
X. Zhao
Shouzheng Huang
Baotian Hu
Min Zhang
245
3
0
28 Jul 2025
When Scale Meets Diversity: Evaluating Language Models on Fine-Grained Multilingual Claim Verification
When Scale Meets Diversity: Evaluating Language Models on Fine-Grained Multilingual Claim Verification
Hanna Shcharbakova
Tatiana Anikina
N. Skachkova
Josef van Genabith
HILMALMLRM
146
1
0
28 Jul 2025
CONCAP: Seeing Beyond English with Concepts Retrieval-Augmented Captioning
CONCAP: Seeing Beyond English with Concepts Retrieval-Augmented Captioning
George Ibrahim
R. Ramos
Yova Kementchedjhieva
VLM
110
1
0
27 Jul 2025
Deep Learning Approaches for Multimodal Intent Recognition: A Survey
Deep Learning Approaches for Multimodal Intent Recognition: A Survey
Jingwei Zhao
Yuhua Wen
Qifei Li
Minchi Hu
Yingying Zhou
...
Junyang Wu
Yingming Gao
Zhengqi Wen
Jianhua Tao
Ya Li
ViT
172
1
0
24 Jul 2025
Mind the Language Gap in Digital Humanities: LLM-Aided Translation of SKOS Thesauri
Mind the Language Gap in Digital Humanities: LLM-Aided Translation of SKOS Thesauri
Felix Kraus
Nicolas Blumenröhr
Danah Tonne
Achim Streit
146
0
0
22 Jul 2025
Mangosteen: An Open Thai Corpus for Language Model Pretraining
Mangosteen: An Open Thai Corpus for Language Model Pretraining
Wannaphong Phatthiyaphaibun
Can Udomcharoenchaikit
Pakpoom Singkorapoom
Kunat Pipatanakul
Ekapol Chuangsuwanich
Peerat Limkonchotiwat
Sarana Nutanong
169
1
0
19 Jul 2025
Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?
Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?
Xi Ai
Mahardika Krisna Ihsani
Min-Yen Kan
HILM
189
1
0
17 Jul 2025
Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning
Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning
Jaedong Hwang
Kumar Tanmay
Seok-Jin Lee
Ayush Agrawal
Hamid Palangi
Kumar Ayush
Ila R Fiete
Paul Pu Liang
LRM
227
4
0
07 Jul 2025
QU-NLP at CheckThat! 2025: Multilingual Subjectivity in News Articles Detection using Feature-Augmented Transformer Models with Sequential Cross-Lingual Fine-Tuning
QU-NLP at CheckThat! 2025: Multilingual Subjectivity in News Articles Detection using Feature-Augmented Transformer Models with Sequential Cross-Lingual Fine-Tuning
Mohammad AL-Smadi
94
0
0
01 Jul 2025
The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure
The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure
Niyati Bafna
Tianjian Li
Kenton W. Murray
David R. Mortensen
David Yarowsky
Hale Sirin
Daniel Khashabi
LRM
150
6
0
28 Jun 2025
Measuring (a Sufficient) World Model in LLMs: A Variance Decomposition Framework
Measuring (a Sufficient) World Model in LLMs: A Variance Decomposition Framework
Nadav Kunievsky
James A. Evans
187
0
0
19 Jun 2025
Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models
Just Go Parallel: Improving the Multilingual Capabilities of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Muhammad Reza Qorib
Junyi Li
Hwee Tou Ng
LRM
228
4
0
16 Jun 2025
Assessing the Role of Data Quality in Training Bilingual Language Models
Assessing the Role of Data Quality in Training Bilingual Language Models
Skyler Seto
Maartje ter Hoeve
Maureen de Seyssel
David Grangier
151
0
0
15 Jun 2025
Refract ICL: Rethinking Example Selection in the Era of Million-Token Models
Refract ICL: Rethinking Example Selection in the Era of Million-Token Models
Arjun R. Akula
Kazuma Hashimoto
Krishna Srinivasan
Aditi Chaudhary
K. Raman
Michael Bendersky
212
0
0
14 Jun 2025
Training-free LLM Merging for Multi-task Learning
Training-free LLM Merging for Multi-task LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zichuan Fu
Xian Wu
Y. X. R. Wang
Wanyu Wang
Shanshan Ye
Hongzhi Yin
Yi-Ju Chang
Yefeng Zheng
Xiangyu Zhao
MoMe
180
1
0
14 Jun 2025
Hatevolution: What Static Benchmarks Don't Tell Us
Hatevolution: What Static Benchmarks Don't Tell UsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Chiara Di Bonaventura
Barbara McGillivray
Yulan He
Albert Meroño-Peñuela
183
0
0
13 Jun 2025
One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers
One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers
Diana Abagyan
Alejandro Salamanca
Andres Felipe Cruz-Salinas
Kris Cao
Hangyu Lin
Acyr Locatelli
Marzieh Fadaee
Ahmet Üstün
Sara Hooker
CLL
361
3
0
12 Jun 2025
Token Constraint Decoding Improves Robustness on Question Answering for Large Language Models
Token Constraint Decoding Improves Robustness on Question Answering for Large Language Models
Jui-Ming Yao
Hao-Yuan Chen
Zi-Xian Tang
Bing-Jia Tan
Sheng-Wei Peng
Bing-Cheng Xie
Shun-Feng Su
AAML
219
0
0
11 Jun 2025
mSTEB: Massively Multilingual Evaluation of LLMs on Speech and Text Tasks
Luel Hagos Beyene
Vivek Verma
Min Ma
Jesujoba Oluwadara Alabi
Fabian David Schmidt
Joyce Nakatumba-Nabende
David Ifeoluwa Adelani
319
2
0
10 Jun 2025
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors
Wenlong Meng
Shuguo Fan
Chengkun Wei
Min Chen
Yuwei Li
Yuanchao Zhang
Zhikun Zhang
Wenzhi Chen
193
0
0
09 Jun 2025
GLOS: Sign Language Generation with Temporally Aligned Gloss-Level Conditioning
GLOS: Sign Language Generation with Temporally Aligned Gloss-Level Conditioning
T. Lee
Hyeongjin Nam
Gyeongsik Moon
Kyoung Mu Lee
SLR
159
0
0
09 Jun 2025
Cost-Optimal Active AI Model Evaluation
Cost-Optimal Active AI Model Evaluation
Anastasios Nikolas Angelopoulos
Jacob Eisenstein
Jonathan Berant
Alekh Agarwal
Adam Fisch
189
2
0
09 Jun 2025
OneSug: The Unified End-to-End Generative Framework for E-commerce Query Suggestion
OneSug: The Unified End-to-End Generative Framework for E-commerce Query Suggestion
Xian Guo
Ben Chen
Siyuan Wang
Ying Yang
Chenyi Lei
Yuqing Ding
Han Li
206
6
0
07 Jun 2025
A Culturally-Rich Romanian NLP Dataset from "Who Wants to Be a Millionaire?" Videos
A Culturally-Rich Romanian NLP Dataset from "Who Wants to Be a Millionaire?" Videos
Alexandru-Gabriel Ganea
Antonia-Adelina Popovici
Adrian-Marius Dumitran
163
0
0
06 Jun 2025
Elementary Math Word Problem Generation using Large Language Models
Elementary Math Word Problem Generation using Large Language Models
Nimesh Ariyarathne
Harshani Bandara
Yasith Heshan
Omega Gamage
Surangika Ranathunga
...
Gayathri Lihinikaduarachchi
Tharoosha Vihidun
Meenambika Chandirakumar
Sanujen Premakumar
Sanjula Gathsara
AI4Ed
223
0
0
06 Jun 2025
A Systematic Review of Poisoning Attacks Against Large Language Models
A Systematic Review of Poisoning Attacks Against Large Language Models
Neil Fendley
Edward W. Staley
Joshua Carney
William Redman
Marie Chau
Nathan G. Drenkow
AAMLPILM
215
5
0
06 Jun 2025
LLM-based phoneme-to-grapheme for phoneme-based speech recognition
Te Ma
Min Bi
Saierdaer Yusuyin
Hao Huang
Zhijian Ou
284
2
0
05 Jun 2025
MELABenchv1: Benchmarking Large Language Models against Smaller Fine-Tuned Models for Low-Resource Maltese NLP
MELABenchv1: Benchmarking Large Language Models against Smaller Fine-Tuned Models for Low-Resource Maltese NLPAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Kurt Micallef
Claudia Borg
261
1
0
04 Jun 2025
Culture Matters in Toxic Language Detection in Persian
Culture Matters in Toxic Language Detection in PersianAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zahra Bokaei
Walid Magdy
Bonnie Webber
125
0
0
03 Jun 2025
IndoSafety: Culturally Grounded Safety for LLMs in Indonesian Languages
IndoSafety: Culturally Grounded Safety for LLMs in Indonesian Languages
Muhammad Falensi Azmi
Muhammad Dehan Al Kautsar
Alfan Farizki Wicaksono
Fajri Koto
234
2
0
03 Jun 2025
Echoes of BERT: Do Modern Language Models Rediscover the Classical NLP Pipeline?
Echoes of BERT: Do Modern Language Models Rediscover the Classical NLP Pipeline?
Michael Li
Nishant Subramani
KELM
224
1
0
02 Jun 2025
Multilingual Definition Modeling
Multilingual Definition ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Edison Marrese-Taylor
Erica K. Shimomoto
Alfredo Solano
Enrique Reid
208
0
0
02 Jun 2025
Entity Image and Mixed-Modal Image Retrieval Datasets
Entity Image and Mixed-Modal Image Retrieval Datasets
Cristian-Ioan Blaga
Paul Suganthan
Sahil Dua
Krishna Srinivasan
Enrique Alfonseca
Peter Dornbach
Tom Duerig
I. Zitouni
Zhe Dong
VLM
196
0
0
02 Jun 2025
The State of Large Language Models for African Languages: Progress and Challenges
The State of Large Language Models for African Languages: Progress and Challenges
Kedir Yassin Hussen
W. Sewunetie
Abinew Ali Ayele
Sukairaj Hafiz Imam
Shamsuddeen Hassan Muhammad
Seid Muhie Yimam
307
3
0
02 Jun 2025
IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
Pasunuti Prasanjith
Prathmesh B More
Anoop Kunchukuttan
Mary Dabre
RALM
250
0
0
02 Jun 2025
Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages
Pi-SQL: Enhancing Text-to-SQL with Fine-Grained Guidance from Pivot Programming Languages
Yongdong chi
Hanqing Wang
Zonghan Yang
Jian Yang
Xiao Yan
Yun-Nung Chen
Guanhua Chen
224
0
0
01 Jun 2025
LEMONADE: A Large Multilingual Expert-Annotated Abstractive Event Dataset for the Real World
LEMONADE: A Large Multilingual Expert-Annotated Abstractive Event Dataset for the Real WorldAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Sina J. Semnani
Pingyue Zhang
Wanyue Zhai
Haozhuo Li
Ryan Beauchamp
Trey Billing
Katayoun Kishi
Pengfei Yu
Monica S. Lam
285
2
0
01 Jun 2025
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Shaoxiong Ji
Zihao Li
Jaakko Paavola
Indraneil Paul
Hengyu Luo
CLL
463
3
0
31 May 2025
Towards Multi-dimensional Evaluation of LLM Summarization across Domains and Languages
Towards Multi-dimensional Evaluation of LLM Summarization across Domains and LanguagesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Hyangsuk Min
Yuho Lee
Minjeong Ban
Jiaqi Deng
Nicole Hee-Yeon Kim
Taewon Yun
Hang Su
Jason (Jinglun) Cai
Hwanjun Song
ELM
238
3
0
31 May 2025
Geo-Sign: Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation
Geo-Sign: Hyperbolic Contrastive Regularisation for Geometrically Aware Sign Language Translation
Edward Fish
Richard Bowden
SLR
561
4
0
30 May 2025
Improving Language and Modality Transfer in Translation by Character-level Modeling
Improving Language and Modality Transfer in Translation by Character-level ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Ioannis Tsiamas
David Dale
Marta R. Costa-jussá
128
3
0
30 May 2025
Disentangling Language and Culture for Evaluating Multilingual Large Language Models
Disentangling Language and Culture for Evaluating Multilingual Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jiahao Ying
Wei Tang
Yiran Zhao
Yixin Cao
Yu Rong
Wenxuan Zhang
ELM
194
2
0
30 May 2025
Synthetic Document Question Answering in Hungarian
Synthetic Document Question Answering in Hungarian
Jonathan Li
Zoltan Csaki
Nidhi Hiremath
Etash Guha
Fenglu Hong
Edward Ma
Urmish Thakker
200
0
0
29 May 2025
LegalSearchLM: Rethinking Legal Case Retrieval as Legal Elements Generation
LegalSearchLM: Rethinking Legal Case Retrieval as Legal Elements Generation
Chaeeun Kim
Jinu Lee
Wonseok Hwang
AILawRALMELM
321
1
0
28 May 2025
Comprehensive Evaluation on Lexical Normalization: Boundary-Aware Approaches for Unsegmented Languages
Comprehensive Evaluation on Lexical Normalization: Boundary-Aware Approaches for Unsegmented LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
S. Higashiyama
Masao Utiyama
135
1
0
28 May 2025
Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
Shintaro Ozaki
Tatsuya Hiraoka
Hiroto Otake
Hiroki Ouchi
Masaru Isonuma
...
Kentaro Inui
Taro Watanabe
Yusuke Miyao
Yohei Oseki
Yu Takagi
LRM
174
0
0
27 May 2025
Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties
Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties
Jiyoung Lee
Seungho Kim
Jieun Han
Jun-Min Lee
Kitaek Kim
Alice Oh
E. Choi
279
2
0
27 May 2025
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead
Jesujoba Oluwadara Alabi
Michael A. Hedderich
David Ifeoluwa Adelani
Dietrich Klakow
465
4
0
27 May 2025
Multilingual Pretraining for Pixel Language Models
Multilingual Pretraining for Pixel Language Models
Ilker Kesen
Jonas F. Lotz
Ingo Ziegler
Phillip Rust
Desmond Elliott
MLLMVLM
327
1
0
27 May 2025
Previous
123456...303132
Next