ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
v1v2v3 (latest)

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 1,561 papers shown
Title
Bridging Dialects: Translating Standard Bangla to Regional Variants Using Neural Models
Md. Arafat Alam Khandaker
Ziyan Shirin Raha
Bidyarthi Paul
Tashreef Muhammad
106
2
0
10 Jan 2025
AFRIDOC-MT: Document-level MT Corpus for African Languages
AFRIDOC-MT: Document-level MT Corpus for African Languages
Jesujoba Oluwadara Alabi
Israel Abebe Azime
Miaoran Zhang
C. España-Bonet
Rachel Bawden
...
Shamsuddeen Hassan Muhammad
Neo Putini
David O. Ademuyiwa
Andrew Caines
Dietrich Klakow
358
2
0
10 Jan 2025
BabyLMs for isiXhosa: Data-Efficient Language Modelling in a Low-Resource Context
BabyLMs for isiXhosa: Data-Efficient Language Modelling in a Low-Resource Context
Alexis Matzopoulos
Charl Hendriks
Hishaam Mahomed
Francois Meyer
277
0
0
08 Jan 2025
Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models
Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models
Jalisha Jashim Era
Bidyarthi Paul
Tahmid Sattar Aothoi
Mirazur Rahman Zim
Faisal Muhammad Shah
145
5
0
07 Jan 2025
CLIX: Cross-Lingual Explanations of Idiomatic Expressions
CLIX: Cross-Lingual Explanations of Idiomatic ExpressionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Aaron Gluck
Katharina von der Wense
Maria Pacheco
270
1
0
06 Jan 2025
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language UnderstandingInternational Conference on Computational Linguistics (COLING), 2025
Binh-Nguyen Nguyen
Yang He
239
2
0
05 Jan 2025
BeliN: A Novel Corpus for Bengali Religious News Headline Generation using Contextual Feature FusionNatural Language Processing Journal (JNLP), 2025
Md Osama
Ashim Dey
Kawsar Ahmed
Muhammad Ashad Kabir
342
0
0
03 Jan 2025
Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq Approaches
Yomal De Mel
Kasun Wickramasinghe
Nisansa de Silva
Surangika Ranathunga
240
4
0
03 Jan 2025
AfriHG: News headline generation for African Languages
AfriHG: News headline generation for African Languages
Toyib Ogunremi
Serah Akojenu
Anthony Soronnadi
Olubayo Adekanmbi
David Ifeoluwa Adelani
140
1
0
31 Dec 2024
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs
LLM-jp
Akiko Aizawa
Eiji Aramaki
Bowen Chen
Fei Cheng
...
Yuya Yamamoto
Yusuke Yamauchi
Hitomi Yanaka
Rio Yokota
Koichiro Yoshino
238
24
0
31 Dec 2024
YAD: Leveraging T5 for Improved Automatic Diacritization of Yor\`ub\á Text
YAD: Leveraging T5 for Improved Automatic Diacritization of Yor\`ub\á Text
Akindele Michael Olawole
Jesujoba Oluwadara Alabi
Aderonke Busayo Sakpere
David Ifeoluwa Adelani
171
3
0
31 Dec 2024
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
367
62
0
31 Dec 2024
Overview of the First Workshop on Language Models for Low-Resource
  Languages (LoResLM 2025)
Overview of the First Workshop on Language Models for Low-Resource Languages (LoResLM 2025)
Hansi Hettiarachchi
Tharindu Ranasinghe
Paul Rayson
R. Mitkov
M. Gaber
Damith Premasiri
Fiona Anting Tan
Lasitha Uyangodage
AI4CE
286
1
0
20 Dec 2024
The First Multilingual Model For The Detection of Suicide Texts
The First Multilingual Model For The Detection of Suicide Texts
Rodolfo Zevallos
Annika Schoene
J. Ortega
144
1
0
20 Dec 2024
Curriculum Learning for Cross-Lingual Data-to-Text Generation With Noisy
  Data
Curriculum Learning for Cross-Lingual Data-to-Text Generation With Noisy Data
Kancharla Aditya Hari
Manish Gupta
Vasudeva Varma
271
0
0
18 Dec 2024
Deploying Foundation Model Powered Agent Services: A Survey
Deploying Foundation Model Powered Agent Services: A Survey
Wenchao Xu
Jinyu Chen
Peirong Zheng
Xiaoquan Yi
Tianyi Tian
...
Quan Wan
Yining Qi
Yunfeng Fan
Qinliang Su
Xuemin Shen
AI4CE
447
5
0
18 Dec 2024
Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation
Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive InvestigationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Vera Neplenbroek
Arianna Bisazza
Raquel Fernández
588
3
0
18 Dec 2024
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Reasoning
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Reasoning
Hongbin Zhang
Kai Chen
Xuefeng Bai
Yang Xiang
Min Zhang
337
0
0
17 Dec 2024
LLMs are Also Effective Embedding Models: An In-depth Overview
LLMs are Also Effective Embedding Models: An In-depth Overview
Chongyang Tao
Tao Shen
Shen Gao
Junshuo Zhang
Zhen Li
Kai Hua
Wenpeng Hu
Zhengwei Tao
Shuai Ma
367
27
0
17 Dec 2024
Enabling Low-Resource Language Retrieval: Establishing Baselines for Urdu MS MARCO
Enabling Low-Resource Language Retrieval: Establishing Baselines for Urdu MS MARCOEuropean Conference on Information Retrieval (ECIR), 2024
Umer Butt
Stalin Veranasi
Günter Neumann
324
0
0
17 Dec 2024
RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari Languages
RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari LanguagesICON (ICON), 2024
Harshvivek Kashid
Pushpak Bhattacharyya
314
2
0
14 Dec 2024
Text Generation Models for Luxembourgish with Limited Data: A Balanced
  Multilingual Strategy
Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy
Alistair Plum
Tharindu Ranasinghe
Christoph Purschke
363
5
0
12 Dec 2024
Neural Text Normalization for Luxembourgish using Real-Life Variation
  Data
Neural Text Normalization for Luxembourgish using Real-Life Variation Data
Anne-Marie Lutgen
Alistair Plum
Christoph Purschke
Barbara Plank
258
2
0
12 Dec 2024
The Codec Language Model-based Zero-Shot Spontaneous Style TTS System for CoVoC Challenge 2024
The Codec Language Model-based Zero-Shot Spontaneous Style TTS System for CoVoC Challenge 2024International Symposium on Chinese Spoken Language Processing (ISCSLP), 2024
Shuoyi Zhou
Yixuan Zhou
Weiqing Li
Jun Chen
Runchuan Ye
Weihao Wu
Zijian Lin
Shun Lei
Zhiyong Wu
429
1
0
02 Dec 2024
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Sen Xing
Muyan Zhong
Zeqiang Lai
Liangchen Li
Jing Liu
Yaohui Wang
Jifeng Dai
Wenhai Wang
450
6
0
02 Dec 2024
Empowering the Deaf and Hard of Hearing Community: Enhancing Video Captions Using Large Language Models
Nadeen Fathallah
Monika Bhole
Steffen Staab
328
2
0
30 Nov 2024
INCLUDE: Evaluating Multilingual Language Understanding with Regional
  Knowledge
INCLUDE: Evaluating Multilingual Language Understanding with Regional KnowledgeInternational Conference on Learning Representations (ICLR), 2024
Angelika Romanou
Negar Foroutan
Anna Sotnikova
Zeming Chen
Sree Harsha Nelaturu
...
Mike Zhang
Imanol Schlag
Marzieh Fadaee
Sara Hooker
Antoine Bosselut
ELM
360
30
0
29 Nov 2024
Towards Santali Linguistic Inclusion: Building the First
  Santali-to-English Translation Model using mT5 Transformer and Data
  Augmentation
Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation
Syed Mohammed Mostaque Billah
Ateya Ahmed Subarna
Sudipta Nandi Sarna
Ahmad Shawkat Wasit
Anika Fariha
Asif Sushmit
Arig Yousuf Sadeque
181
1
0
29 Nov 2024
How far can bias go? -- Tracing bias from pretraining data to alignment
How far can bias go? -- Tracing bias from pretraining data to alignment
Marion Thaler
Abdullatif Köksal
Alina Leidinger
Anna Korhonen
Hinrich Schutze
386
4
0
28 Nov 2024
Open-Sora Plan: Open-Source Large Video Generation Model
Bin Lin
Yunyang Ge
Xinhua Cheng
Zongjian Li
Bin Zhu
...
Zhang Pan
Xing Zhou
Shaoling Dong
Yonghong Tian
Li-xin Yuan
VLMVGen
415
188
0
28 Nov 2024
Leveraging Large Language Models and Topic Modeling for Toxicity
  Classification
Leveraging Large Language Models and Topic Modeling for Toxicity ClassificationInternational Conference on Computing, Networking and Communications (ICNC), 2024
Haniyeh Ehsani Oskouie
Christina Chance
Claire Huang
Margaret Capetz
Elizabeth Eyeson
Majid Sarrafzadeh
177
5
0
26 Nov 2024
DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion
  Model
DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion ModelPattern Recognition Letters (PR), 2024
JiHwan Moon
Jihoon Park
Jungeun Kim
Jongseong Bae
Hyeongwoo Jeon
Ha Young Kim
284
2
0
26 Nov 2024
Transforming NLU with Babylon: A Case Study in Development of Real-time,
  Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru
  Ordering
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering
Mostafa Varzaneh
Pooja Voladoddi
Tanmay Bakshi
Uma Gunturi
220
0
0
22 Nov 2024
UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs
  on Low-Resource Languages
UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages
Bethel Melesse Tessema
Akhil Kedia
Tae-Sun Chung
186
0
0
21 Nov 2024
Training Bilingual LMs with Data Constraints in the Targeted Language
Training Bilingual LMs with Data Constraints in the Targeted LanguageAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
367
2
0
20 Nov 2024
Heuristic-Free Multi-Teacher Learning
Heuristic-Free Multi-Teacher Learning
Huy Thong Nguyen
En-Hung Chu
Lenord Melvix
Jazon Jiao
Chunglin Wen
Benjamin Louie
321
0
0
19 Nov 2024
LLäMmlein: Transparent, Compact and Competitive German-Only Language Models from ScratchAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Jan Pfister
Julia Wunderle
Andreas Hotho
526
2
0
17 Nov 2024
BanglaDialecto: An End-to-End AI-Powered Regional Speech StandardizationBigData Congress [Services Society] (BSS), 2024
Md. Nazmus Sadat Samin
Jawad Ibn Ahad
Tanjila Ahmed Medha
Fuad Rahman
M. R. Amin
Nabeel Mohammed
Shafin Rahman
204
1
0
16 Nov 2024
Xmodel-1.5: An 1B-scale Multilingual LLM
Xmodel-1.5: An 1B-scale Multilingual LLM
Wang Qun
Liu Yang
Lin Qingquan
Jiang Ling
LRM
325
0
0
15 Nov 2024
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language
  Models
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
Jonathan Fhima
Elad Ben Avraham
Oren Nuriel
Yair Kittenplon
Roy Ganz
Aviad Aberdam
Ron Litman
VLM
232
1
0
07 Nov 2024
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and ModalitiesInternational Conference on Learning Representations (ICLR), 2024
Zhaofeng Wu
Xinyan Velocity Yu
Dani Yogatama
Jiasen Lu
Yoon Kim
AIFin
457
36
0
07 Nov 2024
No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with
  Captions in 28 Languages
No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Youssef Mohamed
Runjia Li
Ibrahim Said Ahmad
Kilichbek Haydarov
Juil Sock
Kenneth Church
Mohamed Elhoseiny
VLM
187
15
0
06 Nov 2024
Deploying Multi-task Online Server with Large Language Model
Deploying Multi-task Online Server with Large Language ModelInternational Conference on Computational Linguistics (COLING), 2024
Yincen Qu
Chao Ma
Xiangying Dai
Hui Zhou
Yiting Wu
Hengyue Liu
201
0
0
06 Nov 2024
The Future of Intelligent Healthcare: A Systematic Analysis and
  Discussion on the Integration and Impact of Robots Using Large Language
  Models for Healthcare
The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare
Souren Pashangpour
Goldie Nejat
LM&MA
217
12
0
05 Nov 2024
Leveraging LLM Tutoring Systems for Non-Native English Speakers in
  Introductory CS Courses
Leveraging LLM Tutoring Systems for Non-Native English Speakers in Introductory CS Courses
Ismael Villegas Molina
Audria Montalvo
Benjamin Ochoa
Paul Denny
Leo Porter
AI4Ed
186
13
0
05 Nov 2024
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority LanguagesNeural Information Processing Systems (NeurIPS), 2024
Amir Hossein Kargaran
François Yvon
Hinrich Schutze
VLM
278
11
0
31 Oct 2024
Not All Languages are Equal: Insights into Multilingual
  Retrieval-Augmented Generation
Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation
Suhang Wu
Jialong Tang
Baosong Yang
Ante Wang
Kaidi Jia
Jiawei Yu
Junfeng Yao
Jinsong Su
163
3
0
29 Oct 2024
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
MrT5: Dynamic Token Merging for Efficient Byte-level Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Julie Kallini
Shikhar Murty
Christopher D. Manning
Christopher Potts
Róbert Csordás
391
14
0
28 Oct 2024
LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization
LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA OptimizationInternational Conference on Learning Representations (ICLR), 2024
Jui-Nan Yen
Si Si
Zhao Meng
Felix X. Yu
Sai Surya Duvvuri
Inderjit Dhillon
Cho-Jui Hsieh
Sanjiv Kumar
234
14
0
27 Oct 2024
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Naifan Cheung
Nanyun Peng
Kai-Wei Chang
367
1
0
26 Oct 2024
Previous
123...567...303132
Next