Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2010.11934
Cited By
v1
v2
v3 (latest)
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 1,561 papers shown
Title
Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence
Sophia Hager
David Mueller
Kevin Duh
Nicholas Andrews
427
4
0
18 Mar 2025
Pensez: Less Data, Better Reasoning -- Rethinking French LLM
Huy Hoang Ha
ReLM
LRM
241
4
0
17 Mar 2025
LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama
Naome A. Etori
Kevin Lu
Randu Karisa
Arturs Kanepajs
LRM
ELM
963
2
0
14 Mar 2025
Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches
Panggih Kusuma Ningrum
Philipp Mayr
N. Smirnova
Iana Atanassova
UQLM
277
1
0
14 Mar 2025
A Hybrid Architecture with Efficient Fine Tuning for Abstractive Patent Document Summarization
International Conference on Soft Computing and Software Engineering (ICSCSE), 2025
Nevidu Jayatilleke
Ruvan Weerasinghe
AILaw
604
1
0
13 Mar 2025
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies (HPLT)
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Laurie Burchell
Ona de Gibert
Nikolay Arefyev
Mikko Aulamo
Marta Bañón
...
Pavel Stepachev
and Jörg Tiedemann
Dušan Variš
Tereza Vojtěchová
Jaume Zaragoza-Bernabeu
460
11
0
13 Mar 2025
NAMI: Efficient Image Generation via Bridged Progressive Rectified Flow Transformers
Yuhang Ma
Bo Cheng
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Xiaoyu Wu
Dawei Leng
Yuhui Yin
326
0
0
12 Mar 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yingfeng Luo
Tong Zheng
Yongyu Mu
Yangqiu Song
Qinghong Zhang
...
Ziqiang Xu
Peinan Feng
Xiaoqian Liu
Tong Xiao
Jingbo Zhu
AI4CE
1.1K
9
0
09 Mar 2025
Coreference as an indicator of context scope in multimodal narrative
Nikolai Ilinykh
Shalom Lappin
A. Sayeed
Sharid Loáiciga
170
0
0
07 Mar 2025
Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation
A. Zebaze
Benoît Sagot
Rachel Bawden
251
9
0
06 Mar 2025
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Kristian Kuznetsov
Laida Kushnareva
Polina Druzhinina
Anton Razzhigaev
Anastasia Voznyuk
Irina Piontkovskaya
Evgeny Burnaev
Serguei Barannikov
238
5
0
05 Mar 2025
Enhancing Vietnamese VQA through Curriculum Learning on Raw and Augmented Text Representations
Khoi Anh Nguyen
Linh Yen Vu
Thang Dinh Duong
Thuan Nguyen Duong
Huy Thanh Nguyen
V. Q. Dinh
232
5
0
05 Mar 2025
Wikipedia in the Era of LLMs: Evolution and Risks
Siming Huang
Yuliang Xu
Mingmeng Geng
Yao Wan
Benlin Liu
KELM
357
4
0
04 Mar 2025
In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models
David Ponce
Thierry Etchegoyhen
322
2
0
03 Mar 2025
Sherkala-Chat: Building a State-of-the-Art LLM for Kazakh in a Moderately Resourced Setting
Fajri Koto
Rituraj Joshi
Nurdaulet Mukhituly
Yanjie Wang
Zhuohan Xie
...
Sarath Chandran
Avraham Sheinin
Natalia Vassilieva
Neha Sengupta
Larry Murray
ALM
KELM
389
3
0
03 Mar 2025
Test-Time Alignment for Large Language Models via Textual Model Predictive Control
Kuang-Da Wang
Teng-Ruei Chen
Yu-Heng Hung
Shuoyang Ding
Yueh-Hua Wu
Yu-Chun Wang
Chao-Han Huck Yang
Chao-Han Huck Yang
Wen-Chih Peng
Ping-Chun Hsieh
352
0
0
28 Feb 2025
PolyPrompt: Automating Knowledge Extraction from Multilingual Language Models with Dynamic Prompt Generation
Nathan Roll
278
1
0
27 Feb 2025
HuAMR: A Hungarian AMR Parser and Dataset
Botond Barta
Endre Hamerlik
Milán Konor Nyist
Judit Ács
190
0
0
27 Feb 2025
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
L. Chen
S. Bai
Wenhao Chai
Weichu Xie
Haozhe Zhao
Leon Vinci
Junyang Lin
Baobao Chang
DiffM
297
15
0
27 Feb 2025
Few-Shot Multilingual Open-Domain QA from 5 Examples
Fan Jiang
Tom Drummond
Trevor Cohn
316
0
0
27 Feb 2025
Compressing Language Models for Specialized Domains
Miles Williams
G. Chrysostomou
Vitor Jeronymo
Nikolaos Aletras
MQ
284
1
0
25 Feb 2025
Language Models' Factuality Depends on the Language of Inquiry
Tushar Aggarwal
Kumar Tanmay
Ayush Agrawal
Kumar Ayush
Hamid Palangi
Paul Pu Liang
HILM
KELM
293
8
0
25 Feb 2025
What are Foundation Models Cooking in the Post-Soviet World?
Anton Lavrouk
Tarek Naous
Alan Ritter
Wei Xu
456
2
0
25 Feb 2025
Do Multilingual LLMs Think In English?
Lisa Schut
Y. Gal
Sebastian Farquhar
288
44
0
24 Feb 2025
Encryption-Friendly LLM Architecture
International Conference on Learning Representations (ICLR), 2024
Donghwan Rho
Taeseong Kim
Minje Park
Jung Woo Kim
Hyunsik Chae
Jung Hee Cheon
Ernest K. Ryu
478
17
0
24 Feb 2025
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Ranjan Sapkota
Shaina Raza
Manoj Karkee
245
15
0
21 Feb 2025
Multilingual Non-Factoid Question Answering with Answer Paragraph Selection
Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2024
Ritwik Mishra
Sreeram Vennam
R. Shah
Ponnurangam Kumaraguru
324
0
0
20 Feb 2025
Multilingual Language Model Pretraining using Machine-translated Data
Jiayi Wang
Yao Lu
Maurice Weber
Max Ryabinin
David Ifeoluwa Adelani
Yihong Chen
Raphael Tang
Pontus Stenetorp
LRM
352
7
0
20 Feb 2025
KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Mukhammed Togmanov
Nurdaulet Mukhituly
Diana Turmakhan
Jonibek Mansurov
Maiya Goloburda
...
Nurkhan Laiyk
Alham Fikri Aji
Ekaterina Kochmar
Preslav Nakov
Fajri Koto
ELM
214
8
0
18 Feb 2025
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
Chengyan Wu
Bolei Ma
Wenshu Fan
Zheyu Zhang
Ningyuan Deng
Yongqian Li
Baolan Chen
Yi Zhang
Yun Xue
Yun Xue
442
3
0
17 Feb 2025
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection
Bettina Messmer
Vinko Sabolčec
Martin Jaggi
173
8
0
17 Feb 2025
Generating Text from Uniform Meaning Representation
Emma Markle
Reihaneh Iranmanesh
Shira Wein
167
1
0
17 Feb 2025
LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Zhiwen Ruan
Yixia Li
He Zhu
Longyue Wang
Weihua Luo
Kaifu Zhang
Yuxiao Chen
Guanhua Chen
281
5
0
17 Feb 2025
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Masahiro Kaneko
Alham Fikri Aji
Timothy Baldwin
305
0
0
17 Feb 2025
ALGEN: Few-shot Inversion Attacks on Textual Embeddings using Alignment and Generation
Yiyi Chen
Qiongkai Xu
Johannes Bjerva
377
4
0
16 Feb 2025
The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training
Matteo Saponati
Pascal Sager
Pau Vilimelis Aceituno
Thilo Stadelmann
Benjamin Grewe
175
4
0
15 Feb 2025
Matina: A Large-Scale 73B Token Persian Text Corpus
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Sara Bourbour Hosseinbeigi
Fatemeh Taherinezhad
Heshaam Faili
Hamed Baghbani
Fatemeh Nadi
Mostafa Amiri
233
1
0
13 Feb 2025
A Large-Scale Benchmark for Vietnamese Sentence Paraphrases
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Sang Quang Nguyen
Kiet Van Nguyen
347
0
0
11 Feb 2025
Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Hangliang Ding
Dacheng Li
Runlong Su
Peiyuan Zhang
Zhijie Deng
Eric Liang
Hao Zhang
VGen
353
16
0
10 Feb 2025
Towards the Development of Balanced Synthetic Data for Correcting Grammatical Errors in Arabic: An Approach Based on Error Tagging Model and Synthetic Data Generating Model
Ahlam Alrehili
Areej Alhothali
441
1
0
07 Feb 2025
Multilingual State Space Models for Structured Question Answering in Indic Languages
A. Vats
Rahul Raja
Mrinal Mathur
Vinija Jain
Vasu Sharma
487
3
0
01 Feb 2025
A linguistically-motivated evaluation methodology for unraveling model's abilities in reading comprehension tasks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Elie Antoine
Frédéric Béchet
Géraldine Damnati
Philippe Langlais
368
1
0
29 Jan 2025
Data Duplication: A Novel Multi-Purpose Attack Paradigm in Machine Unlearning
Dayong Ye
Tainqing Zhu
Junlong Li
Kun Gao
B. Liu
Guang Dai
Wanlei Zhou
Yanmei Zhang
AAML
MU
352
5
0
28 Jan 2025
Commute Your Domains: Trajectory Optimality Criterion for Multi-Domain Learning
Alexey Rukhovich
Alexander Podolskiy
Irina Piontkovskaya
269
2
0
28 Jan 2025
Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Kritarth Prasad
Mohammadi Zaki
Pratik Rakesh Singh
Pankaj Wasnik
218
3
0
28 Jan 2025
Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Dongming Sheng
Kexin Han
Hao Li
Yan Zhang
Yucheng Huang
Jun Lang
Wenqiang Liu
178
1
0
24 Jan 2025
Can MLLMs Generalize to Multi-Party dialog? Exploring Multilingual Response Generation in Complex Scenarios
Zhongtian Hu
Yiwen Cui
Ronghan Li
Meng Zhao
Lifang Wang
182
0
0
20 Jan 2025
ViBidirectionMT-Eval: Machine Translation for Vietnamese-Chinese and Vietnamese-Lao language pair
Journal of Computer Science and Cybernetics (JCSC), 2025
Hong-Viet Tran
Minh-Quy Nguyen
Van-Vinh Nguyen
MoE
98
0
0
15 Jan 2025
Exploring Robustness of Multilingual LLMs on Real-World Noisy Data
Amirhossein Aliakbarzadeh
Lucie Flek
Akbar Karimi
216
3
0
14 Jan 2025
Language Fusion for Parameter-Efficient Cross-lingual Transfer
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Philipp Borchert
Ivan Vulić
Marie-Francine Moens
Jochen De Weerdt
354
2
0
12 Jan 2025
Previous
1
2
3
4
5
6
...
30
31
32
Next