Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.11934
Cited By
v1
v2
v3 (latest)
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 1,562 papers shown
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Hasan Hammoud
Mohammad Zbeeb
Bernard Ghanem
154
2
0
17 Sep 2025
DeDisCo at the DISRPT 2025 Shared Task: A System for Discourse Relation Classification
Zhuoxuan Ju
Jingni Wu
Abhishek Purushothama
Amir Zeldes
183
1
0
15 Sep 2025
SignMouth: Leveraging Mouthing Cues for Sign Language Translation by Multimodal Contrastive Fusion
Wenfang Wu
Tingting Yuan
Yupeng Li
Daling Wang
Xiaoming Fu
SLR
328
0
0
12 Sep 2025
PolyTruth: Multilingual Disinformation Detection using Transformer-Based Language Models
Zaur Gouliev
Jennifer Waters
Chengqian Wang
99
1
0
12 Sep 2025
Building High-Quality Datasets for Portuguese LLMs: From Common Crawl Snapshots to Industrial-Grade Corpora
Thales Sales Almeida
Rodrigo Nogueira
Hélio Pedrini
157
4
0
10 Sep 2025
MultimodalHugs: Enabling Sign Language Processing in Hugging Face
Gerard Sant
Zifan Jiang
Carlos Escolano
Amit Moryossef
Mathias Müller
Rico Sennrich
Sarah Ebling
SLR
223
0
0
10 Sep 2025
Building Large-Scale English-Romanian Literary Translation Resources with Open Models
Mihai Nadas
Laura Diosan
Andreea Tomescu
Andrei Piscoran
153
0
0
09 Sep 2025
mmBERT: A Modern Multilingual Encoder with Annealed Language Learning
Marc Marone
Orion Weller
William Fleshman
Eugene Yang
Dawn J Lawrie
Benjamin Van Durme
200
10
0
08 Sep 2025
Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Deniz Bayazit
Aaron Mueller
Antoine Bosselut
141
0
0
05 Sep 2025
Entropy2Vec: Crosslingual Language Modeling Entropy as End-to-End Learnable Language Representations
Patrick Amadeus Irawan
Ryandito Diandaru
Belati Jagad Bintang Syuhada
Randy Zakya Suchrady
Alham Fikri Aji
Genta Indra Winata
Fajri Koto
Samuel Cahyawijaya
134
1
0
05 Sep 2025
OneSearch: A Preliminary Exploration of the Unified End-to-End Generative Framework for E-commerce Search
Ben Chen
X. Guo
Siyuan Wang
Zihan Liang
Yue Lv
...
Jing Chen
Chenyi Lei
Wenwu Ou
Han Li
Kun Gai
232
6
0
03 Sep 2025
Zero-shot Cross-lingual NER via Mitigating Language Difference: An Entity-aligned Translation Perspective
Zhihao Zhang
Sophia Yat Mei Lee
Dong Zhang
Shoushan Li
Guodong Zhou
121
0
0
01 Sep 2025
MixedG2P-T5: G2P-free Speech Synthesis for Mixed-script texts using Speech Self-Supervised Learning and Language Model
Joonyong Park
Daisuke Saito
Nobuaki Minematsu
86
0
0
01 Sep 2025
AdaptCache: KV Cache Native Storage Hierarchy for Low-Delay and High-Quality Language Model Serving
Shaoting Feng
Hanchen Li
Kuntai Du
Zhuohan Gu
Yuhan Liu
...
Siddhant Ray
Samuel Shen
Yihua Cheng
Ganesh Ananthanarayanan
Junchen Jiang
184
1
0
28 Aug 2025
Debiasing Multilingual LLMs in Cross-lingual Latent Space
Qiwei Peng
Guimin Hu
Yekun Chai
Anders Søgaard
148
1
0
25 Aug 2025
Speculating LLMs' Chinese Training Data Pollution from Their Tokens
Qingjie Zhang
Di Wang
Haoting Qian
Liu Yan
Tianwei Zhang
Ke Xu
Qi Li
Minlie Huang
Hewu Li
Han Qiu
96
1
0
25 Aug 2025
Evaluating the Impact of Verbal Multiword Expressions on Machine Translation
Linfeng Liu
Saptarshi Ghosh
Tianyu Jiang
86
0
0
24 Aug 2025
Quantifying Language Disparities in Multilingual Large Language Models
Songbo Hu
Ivan Vulić
Anna Korhonen
122
3
0
23 Aug 2025
OpenWHO: A Document-Level Parallel Corpus for Health Translation in Low-Resource Languages
Raphael Merx
Hanna Suominen
Trevor Cohn
Ekaterina Vylomova
287
0
0
22 Aug 2025
Long Chain-of-Thought Reasoning Across Languages
Josh Barua
Seun Eisape
Kayo Yin
Alane Suhr
LRM
157
1
0
20 Aug 2025
In2x at WMT25 Translation Task
Lei Pang
Hanyi Mao
Quanjia Xiao
HaiXiao Liu
Xiangyi Li
116
0
0
20 Aug 2025
When Alignment Hurts: Decoupling Representational Spaces in Multilingual Models
Ahmed Elshabrawy
Hour Kaing
Israfel Salazar
Alham Fikri Aji
Hideki Tanaka
Masao Utiyama
Mary Dabre
98
0
0
18 Aug 2025
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models
Ziqian Bi
Keyu Chen
Chiung-Yi Tseng
Danyang Zhang
Pohsun Feng
...
Junming Huang
Jibin Guan
Junfeng Hao
Junhao Song
Junhao Song
ELM
217
4
0
17 Aug 2025
Large Language Models for Summarizing Czech Historical Documents and Beyond
International Conference on Agents and Artificial Intelligence (ICAART), 2025
Václav Tran
Jakub Šmíd
J. Martínek
Ladislav Lenc
Pavel Král
130
1
0
14 Aug 2025
Evaluating LLMs on Chinese Idiom Translation
Cai Yang
Yao Dou
David Heineman
Xiaofeng Wu
Wei Xu
155
0
0
14 Aug 2025
Improving Generative Cross-lingual Aspect-Based Sentiment Analysis with Constrained Decoding
Jakub Šmíd
P. Pribán
Pavel Král
AI4CE
129
0
0
14 Aug 2025
Cross-Prompt Encoder for Low-Performing Languages
Beso Mikaberidze
Teimuraz Saghinadze
Simon Ostermann
Philipp Müller
107
0
0
14 Aug 2025
Advancing Cross-lingual Aspect-Based Sentiment Analysis with LLMs and Constrained Decoding for Sequence-to-Sequence Models
International Conference on Agents and Artificial Intelligence (ICAART), 2025
Jakub Šmíd
P. Pribán
Pavel Král
121
6
0
14 Aug 2025
LACA: Improving Cross-lingual Aspect-Based Sentiment Analysis with LLM Data Augmentation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jakub Šmíd
P. Pribán
Pavel Král
87
0
0
13 Aug 2025
Cross-lingual Aspect-Based Sentiment Analysis: A Survey on Tasks, Approaches, and Challenges
Information Fusion (Inf. Fusion), 2025
Jakub Šmíd
Pavel Král
137
8
0
13 Aug 2025
TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation
A. Zebaze
Benoît Sagot
Rachel Bawden
104
1
0
12 Aug 2025
UWB at WASSA-2024 Shared Task 2: Cross-lingual Emotion Detection
Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2025
Jakub Šmíd
P. Pribán
Pavel Král
90
2
0
12 Aug 2025
Prompt-Based Approach for Czech Sentiment Analysis
Recent Advances in Natural Language Processing (RANLP), 2025
Jakub Šmíd
P. Pribán
115
5
0
12 Aug 2025
Utilizing Multilingual Encoders to Improve Large Language Models for Low-Resource Languages
Moratuwa Engineering Research Conference (MERCon), 2025
Imalsha Puranegedara
Themira Chathumina
Nisal Ranathunga
Nisansa de Silva
Surangika Ranathunga
Mokanarangan Thayaparan
219
0
0
12 Aug 2025
Czech Dataset for Complex Aspect-Based Sentiment Analysis Tasks
International Conference on Language Resources and Evaluation (LREC), 2025
Jakub Šmíd
P. Pribán
O. Pražák
Pavel Král
CoGe
156
5
0
11 Aug 2025
Few-shot Cross-lingual Aspect-Based Sentiment Analysis with Sequence-to-Sequence Models
International Conference on Text, Speech and Dialogue (TSD), 2025
Jakub Šmíd
Pavel Přibáň
Pavel Král
124
0
0
11 Aug 2025
Multi-task Adversarial Attacks against Black-box Model with Few-shot Queries
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Wenqiang Wang
Yan Xiao
Hao Lin
Yangshijie Zhang
Xiaochun Cao
AAML
133
1
0
10 Aug 2025
Do Biased Models Have Biased Thoughts?
Swati Rajwal
Shivank Garg
Reem Abdel-Salam
Abdelrahman Zayed
LRM
179
0
0
08 Aug 2025
H-Net++: Hierarchical Dynamic Chunking for Tokenizer-Free Language Modelling in Morphologically-Rich Languages
Mehrdad Zakershahrak
Samira Ghodratnama
VLM
75
0
0
07 Aug 2025
Semantic Bridge: Universal Multi-Hop Question Generation via AMR-Driven Graph Synthesis
Linqing Chen
Hanmeng Zhong
Wentao Wu
Weilei Wang
LRM
120
1
0
06 Aug 2025
Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Negar Foroutan
Clara Meister
Debjit Paul
Joel Niklaus
Sina Ahmadi
Antoine Bosselut
Rico Sennrich
211
3
0
06 Aug 2025
Dynaword: From One-shot to Continuously Developed Datasets
Kenneth Enevoldsen
Kristian Nørgaard Jensen
Jan Kostkan
Balázs Szabó
Márton Kardos
...
Per Møldrup Dalum
Desmond Elliott
Lukas Galke
Peter Schneider-Kamp
Kristoffer Nielbo
172
0
0
04 Aug 2025
SHAMI-MT: A Syrian Arabic Dialect to Modern Standard Arabic Bidirectional Machine Translation System
Serry Sibaee
Omer Nacar
Yasser Habashi
Adel Ammar
W. Boulila
95
1
0
04 Aug 2025
TIBSTC-CoT: A Multi-Domain Instruction Dataset for Chain-of-Thought Reasoning in Language Models
Fan Gao
Cheng Huang
Nyima Tashi
Yutong Liu
Xiangxiang Wang
...
Rinchen Dongrub
Dorje Tashi
Xiao Feng
Hao Wang
Yongbin Yu
LRM
287
2
0
04 Aug 2025
The Art of Breaking Words: Rethinking Multilingual Tokenizer Design
Aamod Thakur
Ajay Nagpal
Atharva Savarkar
Kundeshwar Pundalik
Siddhesh Dosi
Piyush Sawarkar
Viraj Thakur
Rohit Saluja
Maunendra Sankar Desarkar
Ganesh Ramakrishnan
104
1
0
03 Aug 2025
Quantum-RAG and PunGPT2: Advancing Low-Resource Language Generation and Retrieval for the Punjabi Language
Jaskaranjeet Singh
Rakesh Thakur
176
0
0
03 Aug 2025
Multi-TW: Benchmarking Multimodal Models on Traditional Chinese Question Answering in Taiwan
Jui-Ming Yao
Bing-Cheng Xie
Sheng-Wei Peng
Hao-Yuan Chen
He-Rong Zheng
Bing-Jia Tan
Peter Shaojui Wang
Shun-Feng Su
89
0
0
02 Aug 2025
UrBLiMP: A Benchmark for Evaluating the Linguistic Competence of Large Language Models in Urdu
Farah Adeeba
Brian Dillon
Hassan Sajjad
Rajesh Bhatt
ELM
77
0
0
01 Aug 2025
Is neural semantic parsing good at ellipsis resolution, or isn't it?
Xiao Zhang
Johan Bos
229
0
0
31 Jul 2025
Beyond Gloss: A Hand-Centric Framework for Gloss-Free Sign Language Translation
Sobhan Asasi
Mohamed Ilyas Lakhal
Ozge Mercanoglu Sincan
Richard Bowden
SLR
202
1
0
31 Jul 2025
Previous
1
2
3
4
5
...
30
31
32
Next