ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer
v1v2v3 (latest)

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 1,561 papers shown
Title
Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages
Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages
Gulfarogh Azam
Mohd Sadique
Saif Ali
Mohammad Nadeem
Erik Cambria
S. Sohail
M. Alam
123
0
0
26 May 2025
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model
Khalil Hennara
Muhammad Hreden
Mohamed Motaism Hamed
Zeina Aldallal
Sara Chrouf
Safwan AlModhayan
255
1
0
23 May 2025
Breaking mBad! Supervised Fine-tuning for Cross-Lingual Detoxification
Breaking mBad! Supervised Fine-tuning for Cross-Lingual Detoxification
Himanshu Beniwal
Y. Kim
Maarten Sap
Soham Dan
Thomas Hartvigsen
CLL
310
0
0
22 May 2025
Semantic Pivots Enable Cross-Lingual Transfer in Large Language Models
Semantic Pivots Enable Cross-Lingual Transfer in Large Language Models
Kaiyu He
Tong Zhou
Yubo Chen
Delai Qiu
Shengping Liu
Kang Liu
Jun Zhao
LRM
213
0
0
22 May 2025
SELF: Self-Extend the Context Length With Logistic Growth Function
SELF: Self-Extend the Context Length With Logistic Growth Function
Phat Thanh Dang
Saahil Thoppay
Wang Yang
Qifan Wang
Vipin Chaudhary
Xiaotian Han
247
0
0
22 May 2025
Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities
Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities
Xiaoyu Luo
Yiyi Chen
Johannes Bjerva
Qiongxiu Li
248
1
0
21 May 2025
LAGO: Few-shot Crosslingual Embedding Inversion Attacks via Language Similarity-Aware Graph Optimization
LAGO: Few-shot Crosslingual Embedding Inversion Attacks via Language Similarity-Aware Graph Optimization
Wenrui Yu
Yiyi Chen
Johannes Bjerva
Sokol Kosta
Qiongxiu Li
240
0
0
21 May 2025
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models
Jieying Xue
Phuong Minh Nguyen
Minh Le Nguyen
Xin Liu
181
0
0
19 May 2025
Video-GPT via Next Clip Diffusion
Video-GPT via Next Clip Diffusion
Shaobin Zhuang
Zhipeng Huang
Ying Zhang
Fangyikang Wang
Canmiao Fu
Binxin Yang
Chong Sun
Chen Li
Yali Wang
DiffMVGen
597
5
0
18 May 2025
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
MergeBench: A Benchmark for Merging Domain-Specialized LLMs
Yifei He
Siqi Zeng
Yuzheng Hu
Rui Yang
Tong Zhang
Han Zhao
MoMeALM
665
6
0
16 May 2025
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline
Hrishit Madhavi
Jacob Cherian
Yuvraj Khamkar
Dhananjay Bhagat
VLM
251
1
0
16 May 2025
Designing and Contextualising Probes for African Languages
Designing and Contextualising Probes for African Languages
Wisdom Aduah
Francois Meyer
307
0
0
15 May 2025
Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios
Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios
Huafeng Shi
Jianzhong Liang
Rongchang Xie
Xian Wu
Cheng Chen
Chang Liu
VGen
358
0
0
14 May 2025
Lost in Transliteration: Bridging the Script Gap in Neural IR
Lost in Transliteration: Bridging the Script Gap in Neural IRAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Andreas Chari
Iadh Ounis
Sean MacAvaney
208
2
0
13 May 2025
Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization
Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored ParameterizationInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Yuntai Bao
Xuhong Zhang
Tianyu Du
Xinkui Zhao
Jiang Zong
Hao Peng
Yuxiang Cai
TDI
353
0
0
08 May 2025
Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review
Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review
Josh McGiff
Nikola S. Nikolov
339
4
0
07 May 2025
Token-free Models for Sarcasm Detection
Token-free Models for Sarcasm Detection
Maitreya Sonawane
Maitreya Sonawane
Kanika Agarwal
Nishanth Sanjeev
235
0
0
02 May 2025
Investigating Task Arithmetic for Zero-Shot Information Retrieval
Investigating Task Arithmetic for Zero-Shot Information RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Marco Braga
Pranav Kasela
Alessandro Raganato
G. Pasi
RALM
334
1
0
01 May 2025
Improving Informally Romanized Language Identification
Improving Informally Romanized Language Identification
Adrian Benton
Alexander Gutkin
Christo Kirov
Brian Roark
360
0
0
30 Apr 2025
Robust Misinformation Detection by Visiting Potential Commonsense Conflict
Robust Misinformation Detection by Visiting Potential Commonsense ConflictInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Bing Wang
Ximing Li
Chaofan Li
Bingrui Zhao
Bo Fu
Renchu Guan
Shengsheng Wang
236
1
0
30 Apr 2025
A Generative-AI-Driven Claim Retrieval System Capable of Detecting and Retrieving Claims from Social Media Platforms in Multiple Languages
A Generative-AI-Driven Claim Retrieval System Capable of Detecting and Retrieving Claims from Social Media Platforms in Multiple Languages
Ivan Vykopal
Martin Hyben
Robert Moro
Michal Gregor
Jakub Simko
373
1
0
29 Apr 2025
Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning
Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning
Hongfei Xue
Yufeng Tang
Hexin Liu
Jun Zhang
Xuelong Geng
Lei Xie
LRM
222
1
0
29 Apr 2025
RepText: Rendering Visual Text via Replicating
RepText: Rendering Visual Text via Replicating
Haobo Wang
Yongjun Xu
Yongqian Li
Jiajun Li
Chaowei Zhang
Jingchao Wang
Kejia Yang
Z. Chen
VLM
278
2
0
28 Apr 2025
Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
Ocheme Anthony Ekle
Biswarup Das
166
1
0
24 Apr 2025
Trillion 7B Technical Report
Trillion 7B Technical Report
Sungjun Han
Juyoung Suk
Suyeong An
Hyungguk Kim
Kyuseok Kim
Wonsuk Yang
Seungtaek Choi
Jamin Shin
857
4
0
21 Apr 2025
Kuwain 1.5B: An Arabic SLM via Language Injection
Kuwain 1.5B: An Arabic SLM via Language Injection
Khalil Hennara
Sara Chrouf
Mohamed Motaism Hamed
Zeina Aldallal
Omar Hadid
Safwan AlModhayan
262
3
0
21 Apr 2025
Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
Shaomu Tan
Christof Monz
280
1
0
18 Apr 2025
MorphTok: Morphologically Grounded Tokenization for Indian Languages
MorphTok: Morphologically Grounded Tokenization for Indian Languages
Maharaj Brahma
Ayush Maheshwari
A. Singh
D. Adiga
Smruti Bhate
Ganesh Ramakrishnan
Rohit Saluja
Maunendra Sankar Desarkar
308
1
0
14 Apr 2025
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Shuai Zhao
Linchao Zhu
Yi Yang
Yi Yang
405
4
0
14 Apr 2025
Myanmar XNLI: Building a Dataset and Exploring Low-resource Approaches to Natural Language Inference with Myanmar
Myanmar XNLI: Building a Dataset and Exploring Low-resource Approaches to Natural Language Inference with MyanmarLanguage Resources and Evaluation (LRE), 2025
Aung Kyaw Htet
Mark Dras
126
4
0
13 Apr 2025
Lugha-Llama: Adapting Large Language Models for African Languages
Lugha-Llama: Adapting Large Language Models for African Languages
Happy Buzaaba
Alexander Wettig
David Ifeoluwa Adelani
Christiane Fellbaum
243
5
0
09 Apr 2025
NNN: Next-Generation Neural Networks for Marketing Measurement
NNN: Next-Generation Neural Networks for Marketing Measurement
Thomas Mulc
Mike Anderson
Paul Cubre
Huikun Zhang
Ivy Liu
Saket Kumar
961
0
0
08 Apr 2025
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi
Monojit Choudhury
Shivam Chauhan
Rocktim Jyoti Das
Dhruv Sahnan
Xudong Han
...
Rituraj Joshi
Gurpreet Gosal
Avraham Sheinin
Natalia Vassilieva
Preslav Nakov
245
5
0
08 Apr 2025
Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation
Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation
Biao Zhang
Fedor Moiseev
Joshua Ainslie
Paul Suganthan
Min Ma
Surya Bhupatiraju
Fede Lebron
Orhan Firat
Armand Joulin
Zhe Dong
AI4CE
186
11
0
08 Apr 2025
Regional Tiny Stories: Using Small Models to Compare Language Learning and Tokenizer Performance
Regional Tiny Stories: Using Small Models to Compare Language Learning and Tokenizer Performance
Nirvan Patil
Malhar Abhay Inamdar
Agnivo Gosai
Guruprasad Pathak
Anish Joshi
Aryan Sagavekar
Anish Joshirao
Raj Abhijit Dandekar
Rajat Dandekar
Sreedath Panat
337
1
0
07 Apr 2025
On the Connection Between Diffusion Models and Molecular Dynamics
On the Connection Between Diffusion Models and Molecular Dynamics
Liam Harcombe
Timothy T. Duignan
DiffM
297
1
0
04 Apr 2025
A$^\text{T}$A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting
AT^\text{T}TA: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background InpaintingComputer Vision and Pattern Recognition (CVPR), 2025
Yizhe Tang
Zhimin Sun
Yuzhen Du
Ran Yi
Guangben Lu
T. Hu
Luying Li
Lizhuang Ma
Fangyuan Zou
DiffM
204
3
0
02 Apr 2025
Language Models at the Syntax-Semantics Interface: A Case Study of the Long-Distance Binding of Chinese Reflexive ziji
Language Models at the Syntax-Semantics Interface: A Case Study of the Long-Distance Binding of Chinese Reflexive zijiInternational Conference on Computational Linguistics (COLING), 2025
Xiulin Yang
321
2
0
02 Apr 2025
Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations
Catch Me if You Search: When Contextual Web Search Results Affect the Detection of HallucinationsComputers in Human Behavior (CHB), 2025
Mahjabin Nahar
Eun-Ju Lee
Jin Won Park
Dongwon Lee
HILM
517
0
0
01 Apr 2025
VNJPTranslate: A comprehensive pipeline for Vietnamese-Japanese translation
VNJPTranslate: A comprehensive pipeline for Vietnamese-Japanese translation
Hoang Hai Phan
Nguyen Duc Minh Vu
Nam Dang Phuong
205
0
0
01 Apr 2025
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer Blueprints
The Challenge of Achieving Attributability in Multilingual Table-to-Text Generation with Question-Answer BlueprintsInternational Journal of Undergraduate Research and Creative Activities (IJURCA), 2025
Aden Haussmann
LMTD
321
0
0
29 Mar 2025
Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity Transfer
Improving Low-Resource Retrieval Effectiveness using Zero-Shot Linguistic Similarity TransferEuropean Conference on Information Retrieval (ECIR), 2025
Andreas Chari
Sean MacAvaney
Iadh Ounis
204
0
0
28 Mar 2025
Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation
Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation
Sarubi Thillainathan
Songchen Yuan
E. Lee
Sanath Jayasena
Surangika Ranathunga
265
1
0
28 Mar 2025
Low-resource Information Extraction with the European Clinical Case Corpus
Low-resource Information Extraction with the European Clinical Case Corpus
Soumitra Ghosh
Begona Altuna
Saeed Farzi
Pietro Ferrazzi
A. Lavelli
Giulia Mezzanotte
Manuela Speranza
Bernardo Magnini
227
1
0
26 Mar 2025
PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment
PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment
Jong Myoung Kim
Young-Jun_Lee
Ho-Jin Choi
Sangkeun Jung
240
0
0
24 Mar 2025
PM4Bench: Benchmarking Large Vision-Language Models with Parallel Multilingual Multi-Modal Multi-task Corpus
PM4Bench: Benchmarking Large Vision-Language Models with Parallel Multilingual Multi-Modal Multi-task Corpus
Junyuan Gao
Jiahe Song
Jinlin Wu
Runchuan Zhu
Guanlin Shen
...
Weijia Li
Bin Wang
Dahua Lin
Lijun Wu
Conghui He
331
5
0
24 Mar 2025
LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment
LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment
Jong Myoung Kim
Young-Jun Lee
Ho-Jin Choi
Sangkeun Jung
319
0
0
24 Mar 2025
Language-specific Neurons Do Not Facilitate Cross-Lingual Transfer
Language-specific Neurons Do Not Facilitate Cross-Lingual Transfer
Soumen Kumar Mondal
Sayambhu Sen
Abhishek Singhania
Preethi Jyothi
275
7
0
21 Mar 2025
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning
Zhaowei Liu
X. Guo
Fangqi Lou
Lingfeng Zeng
Jinyi Niu
...
Xueqian Zhao
Chao Li
Sheng Xu
Dezhi Chen
Yun Chen
ReLMAIFinOffRLAI4TSLRM
300
48
0
20 Mar 2025
A Review on Large Language Models for Visual Analytics
A Review on Large Language Models for Visual Analytics
Navya Sonal Agarwal
Sanjay Kumar Sonbhadra
354
7
0
19 Mar 2025
Previous
12345...303132
Next