Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.11934
Cited By
v1
v2
v3 (latest)
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 1,561 papers shown
Title
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Hasan Hammoud
Mohammad Zbeeb
Bernard Ghanem
112
2
0
17 Sep 2025
DeDisCo at the DISRPT 2025 Shared Task: A System for Discourse Relation Classification
Zhuoxuan Ju
Jingni Wu
Abhishek Purushothama
Amir Zeldes
170
1
0
15 Sep 2025
SignMouth: Leveraging Mouthing Cues for Sign Language Translation by Multimodal Contrastive Fusion
Wenfang Wu
Tingting Yuan
Yupeng Li
Daling Wang
Xiaoming Fu
SLR
318
0
0
12 Sep 2025
PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models
Zaur Gouliev
Jennifer Waters
Chengqian Wang
79
1
0
12 Sep 2025
MultimodalHugs: Enabling Sign Language Processing in Hugging Face
Gerard Sant
Zifan Jiang
Carlos Escolano
Amit Moryossef
Mathias Müller
Rico Sennrich
Sarah Ebling
SLR
219
0
0
10 Sep 2025
Building High-Quality Datasets for Portuguese LLMs: From Common Crawl Snapshots to Industrial-Grade Corpora
Thales Sales Almeida
Rodrigo Nogueira
Hélio Pedrini
144
4
0
10 Sep 2025
Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost
Mihai Nadas
Laura Diosan
Andreea Tomescu
Andrei Piscoran
144
0
0
09 Sep 2025
mmBERT: A Modern Multilingual Encoder with Annealed Language Learning
Marc Marone
Orion Weller
William Fleshman
Eugene Yang
Dawn J Lawrie
Benjamin Van Durme
184
8
0
08 Sep 2025
Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Deniz Bayazit
Aaron Mueller
Antoine Bosselut
140
0
0
05 Sep 2025
Entropy2Vec: Crosslingual Language Modeling Entropy as End-to-End Learnable Language Representations
Patrick Amadeus Irawan
Ryandito Diandaru
Belati Jagad Bintang Syuhada
Randy Zakya Suchrady
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
126
1
0
05 Sep 2025
OneSearch: A Preliminary Exploration of the Unified End-to-End Generative Framework for E-commerce Search
Ben Chen
X. Guo
Siyuan Wang
Zihan Liang
Yue Lv
...
Jing Chen
Chenyi Lei
Wenwu Ou
Han Li
Kun Gai
197
6
0
03 Sep 2025
MixedG2P-T5: G2P-free Speech Synthesis for Mixed-script texts using Speech Self-Supervised Learning and Language Model
Joonyong Park
Daisuke Saito
Nobuaki Minematsu
78
0
0
01 Sep 2025
Zero-shot Cross-lingual NER via Mitigating Language Difference: An Entity-aligned Translation Perspective
Zhihao Zhang
Sophia Yat Mei Lee
Dong Zhang
Shoushan Li
Guodong Zhou
113
0
0
01 Sep 2025
AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving
Shaoting Feng
Hanchen Li
Kuntai Du
Zhuohan Gu
Yuhan Liu
...
Siddhant Ray
Samuel Shen
Yihua Cheng
Ganesh Ananthanarayanan
Junchen Jiang
160
1
0
28 Aug 2025
Debiasing Multilingual LLMs in Cross-lingual Latent Space
Qiwei Peng
Guimin Hu
Yekun Chai
Anders Søgaard
132
1
0
25 Aug 2025
Speculating LLMs' Chinese Training Data Pollution from Their Tokens
Qingjie Zhang
Di Wang
Haoting Qian
Liu Yan
Tianwei Zhang
Ke Xu
Qi Li
Minlie Huang
Hewu Li
Han Qiu
94
1
0
25 Aug 2025
Evaluating the Impact of Verbal Multiword Expressions on Machine Translation
Linfeng Liu
Saptarshi Ghosh
Tianyu Jiang
84
0
0
24 Aug 2025
Quantifying Language Disparities in Multilingual Large Language Models
Songbo Hu
Ivan Vulić
Anna Korhonen
116
2
0
23 Aug 2025
OpenWHO: A Document-Level Parallel Corpus for Health Translation in Low-Resource Languages
Raphael Merx
Hanna Suominen
Trevor Cohn
Ekaterina Vylomova
267
0
0
22 Aug 2025
Long Chain-of-Thought Reasoning Across Languages
Josh Barua
Seun Eisape
Kayo Yin
Alane Suhr
LRM
140
1
0
20 Aug 2025
In2x at WMT25 Translation Task
Lei Pang
Hanyi Mao
Quanjia Xiao
HaiXiao Liu
Xiangyi Li
112
0
0
20 Aug 2025
When Alignment Hurts: Decoupling Representational Spaces in Multilingual Models
Ahmed Elshabrawy
Hour Kaing
Israfel Salazar
Alham Fikri Aji
Hideki Tanaka
Masao Utiyama
Mary Dabre
96
0
0
18 Aug 2025
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models
Ziqian Bi
Keyu Chen
Chiung-Yi Tseng
Danyang Zhang
Pohsun Feng
...
Junming Huang
Jibin Guan
Junfeng Hao
Junhao Song
Junhao Song
ELM
194
3
0
17 Aug 2025
Advancing Cross-lingual Aspect-Based Sentiment Analysis with LLMs and Constrained Decoding for Sequence-to-Sequence Models
International Conference on Agents and Artificial Intelligence (ICAART), 2025
Jakub Šmíd
P. Pribán
Pavel Král
113
6
0
14 Aug 2025
Cross-Prompt Encoder for Low-Performing Languages
Beso Mikaberidze
Teimuraz Saghinadze
Simon Ostermann
Philipp Müller
88
0
0
14 Aug 2025
Improving Generative Cross-lingual Aspect-Based Sentiment Analysis with Constrained Decoding
Jakub Šmíd
P. Pribán
Pavel Král
AI4CE
89
0
0
14 Aug 2025
Evaluating LLMs on Chinese Idiom Translation
Cai Yang
Yao Dou
David Heineman
Xiaofeng Wu
Wei Xu
150
0
0
14 Aug 2025
Large Language Models for Summarizing Czech Historical Documents and Beyond
International Conference on Agents and Artificial Intelligence (ICAART), 2025
Václav Tran
Jakub Šmíd
J. Martínek
Ladislav Lenc
Pavel Král
116
1
0
14 Aug 2025
Cross-lingual Aspect-Based Sentiment Analysis: A Survey on Tasks, Approaches, and Challenges
Information Fusion (Inf. Fusion), 2025
Jakub Šmíd
Pavel Král
124
7
0
13 Aug 2025
LACA: Improving Cross-lingual Aspect-Based Sentiment Analysis with LLM Data Augmentation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jakub Šmíd
P. Pribán
Pavel Král
76
0
0
13 Aug 2025
UWB at WASSA-2024 Shared Task 2: Cross-lingual Emotion Detection
Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2025
Jakub Šmíd
P. Pribán
Pavel Král
82
2
0
12 Aug 2025
Utilizing Multilingual Encoders to Improve Large Language Models for Low-Resource Languages
Moratuwa Engineering Research Conference (MERCon), 2025
Imalsha Puranegedara
Themira Chathumina
Nisal Ranathunga
Nisansa de Silva
Surangika Ranathunga
Mokanarangan Thayaparan
219
0
0
12 Aug 2025
TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation
A. Zebaze
Benoît Sagot
Rachel Bawden
100
1
0
12 Aug 2025
Prompt-Based Approach for Czech Sentiment Analysis
Recent Advances in Natural Language Processing (RANLP), 2025
Jakub Šmíd
P. Pribán
76
5
0
12 Aug 2025
Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks
International Conference on Language Resources and Evaluation (LREC), 2025
Jakub Šmíd
P. Pribán
O. Pražák
Pavel Král
CoGe
144
5
0
11 Aug 2025
Few-shot Cross-lingual Aspect-Based Sentiment Analysis with Sequence-to-Sequence Models
International Conference on Text, Speech and Dialogue (TSD), 2025
Jakub Šmíd
Pavel Přibáň
Pavel Král
112
0
0
11 Aug 2025
Multi-task Adversarial Attacks against Black-box Model with Few-shot Queries
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Wenqiang Wang
Yan Xiao
Hao Lin
Yangshijie Zhang
Xiaochun Cao
AAML
132
1
0
10 Aug 2025
Do Biased Models Have Biased Thoughts?
Swati Rajwal
Shivank Garg
Reem Abdel-Salam
Abdelrahman Zayed
LRM
172
0
0
08 Aug 2025
H-Net++: Hierarchical Dynamic Chunking for Tokenizer-Free Language Modelling in Morphologically-Rich Languages
Mehrdad Zakershahrak
Samira Ghodratnama
VLM
68
0
0
07 Aug 2025
Semantic Bridge: Universal Multi-Hop Question Generation via AMR-Driven Graph Synthesis
Linqing Chen
Hanmeng Zhong
Wentao Wu
Weilei Wang
LRM
116
1
0
06 Aug 2025
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Negar Foroutan
Clara Meister
Debjit Paul
Joel Niklaus
Sina Ahmadi
Antoine Bosselut
Rico Sennrich
208
2
0
06 Aug 2025
TIBSTC-CoT: A Multi-Domain Instruction Dataset for Chain-of-Thought Reasoning in Language Models
Fan Gao
Cheng Huang
Nyima Tashi
Yutong Liu
Xiangxiang Wang
...
Rinchen Dongrub
Dorje Tashi
Xiao Feng
Hao Wang
Yongbin Yu
LRM
274
2
0
04 Aug 2025
Dynaword: From One-shot to Continuously Developed Datasets
Kenneth Enevoldsen
Kristian Nørgaard Jensen
Jan Kostkan
Balázs Szabó
Márton Kardos
...
Per Møldrup Dalum
Desmond Elliott
Lukas Galke
Peter Schneider-Kamp
Kristoffer Nielbo
166
0
0
04 Aug 2025
SHAMI-MT: A Syrian Arabic Dialect to Modern Standard Arabic Bidirectional Machine Translation System
Serry Sibaee
Omer Nacar
Yasser Habashi
Adel Ammar
W. Boulila
78
1
0
04 Aug 2025
Quantum-RAG and PunGPT2: Advancing Low-Resource Language Generation and Retrieval for the Punjabi Language
Jaskaranjeet Singh
Rakesh Thakur
170
0
0
03 Aug 2025
The Art of Breaking Words: Rethinking Multilingual Tokenizer Design
Aamod Thakur
Ajay Nagpal
Atharva Savarkar
Kundeshwar Pundalik
Siddhesh Dosi
Piyush Sawarkar
Viraj Thakur
Rohit Saluja
Maunendra Sankar Desarkar
Ganesh Ramakrishnan
100
1
0
03 Aug 2025
Multi-TW: Benchmarking Multimodal Models on Traditional Chinese Question Answering in Taiwan
Jui-Ming Yao
Bing-Cheng Xie
Sheng-Wei Peng
Hao-Yuan Chen
He-Rong Zheng
Bing-Jia Tan
Peter Shaojui Wang
Shun-Feng Su
84
0
0
02 Aug 2025
UrBLiMP: A Benchmark for Evaluating the Linguistic Competence of Large Language Models in Urdu
Farah Adeeba
Brian Dillon
Hassan Sajjad
Rajesh Bhatt
ELM
68
0
0
01 Aug 2025
Beyond Gloss: A Hand-Centric Framework for Gloss-Free Sign Language Translation
Sobhan Asasi
Mohamed Ilyas Lakhal
Ozge Mercanoglu Sincan
Richard Bowden
SLR
194
1
0
31 Jul 2025
Is neural semantic parsing good at ellipsis resolution, or isn't it?
Xiao Zhang
Johan Bos
217
0
0
31 Jul 2025
Previous
1
2
3
4
5
...
30
31
32
Next