Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2010.11934
Cited By
v1
v2
v3 (latest)
mT5: A massively multilingual pre-trained text-to-text transformer
22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"mT5: A massively multilingual pre-trained text-to-text transformer"
50 / 1,560 papers shown
Title
Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code
Jipeng Zhang
Jianshu Zhang
Yuanzhe Li
Renjie Pi
Boyao Wang
Runtao Liu
Ziqiang Zheng
Tong Zhang
142
2
0
24 Oct 2024
SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tanmay Parekh
Jeffrey Kwan
Jiarui Yu
Sparsh Johri
Hyosang Ahn
Sreya Muppalla
Kai-Wei Chang
Wei Wang
Nanyun Peng
322
7
0
24 Oct 2024
Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information Retrieval
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Dae Yon Hwang
Bilal Taha
Harshit Pande
Yaroslav Nechaev
SyDa
278
0
0
24 Oct 2024
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
Applied Soft Computing (Appl. Soft Comput.), 2024
David Ortiz-Perez
Manuel Benavent-Lledo
José García Rodríguez
David Tomás
M. Flores Vizcaya-Moreno
211
3
0
24 Oct 2024
Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases
International Joint Conference on the Analysis of Images, Social Networks and Texts (AISNT), 2024
Anna Glazkova
Dmitry A. Morozov
Timur Garipov
207
0
0
23 Oct 2024
Cross-lingual Transfer of Reward Models in Multilingual Alignment
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Jiwoo Hong
Noah Lee
Rodrigo Martínez-Castaño
César Rodríguez
James Thorne
392
15
0
23 Oct 2024
Responsible Multilingual Large Language Models: A Survey of Development, Applications, and Societal Impact
Junhua Liu
Bin Fu
LRM
146
3
0
23 Oct 2024
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Seunghyeok Hong
Dongkeun Yoon
Juyoung Suk
Javier Aula-Blasco
Mano Aslan
Vu Trong Kim
Shayekh Bin Islam
Jaume Prats-Cristià
Lucía Tormo-Bañuelos
Seungone Kim
ELM
LRM
236
0
0
23 Oct 2024
Multi-head Sequence Tagging Model for Grammatical Error Correction
Engineering applications of artificial intelligence (EAAI), 2024
Kamal Al-Sabahi
Kang Yang
Wangwang Liu
Guanyu Jiang
Xian Li
Ming Yang
166
3
0
21 Oct 2024
Findings of the Third Shared Task on Multilingual Coreference Resolution
Michal Novák
Barbora Dohnalová
Miloslav Konopík
A. Nedoluzhko
Martin Popel
O. Pražák
Jakub Sido
Milan Straka
Zdeněk Žabokrtský
Daniel Zeman
254
9
0
21 Oct 2024
Generative AI Agents in Autonomous Machines: A Safety Perspective
International Conference on Computer Aided Design (ICCAD), 2024
Jason J. Jabbour
Vijay Janapa Reddi
AI4CE
323
11
0
20 Oct 2024
Allegro: Open the Black Box of Commercial-Level Video Generation Model
Yuan Zhou
Qiuyue Wang
Yuxuan Cai
Huan Yang
VGen
VLM
318
51
0
20 Oct 2024
Grammatical Error Correction for Low-Resource Languages: The Case of Zarma
Mamadou K. Keita
Christopher Homan
Sofiane Abdoulaye Hamani
Adwoa Bremang
Marcos Zampieri
Habibatou Abdoulaye Alfari
Elysabhete Amadou Ibrahim
334
5
0
20 Oct 2024
A survey of neural-network-based methods utilising comparable data for finding translation equivalents
Michaela Denisová
Pavel Rychlý
222
0
0
19 Oct 2024
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Shangda Wu
Yashan Wang
Ruibin Yuan
Zhancheng Guo
Xu Tan
...
Yuanliang Dong
Jiafeng Liu
Xiaobing Li
Feng Yu
Maosong Sun
393
13
0
17 Oct 2024
Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysis
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yiyi Chen
Qiongxiu Li
Russa Biswas
Johannes Bjerva
287
7
0
17 Oct 2024
From Citations to Criticality: Predicting Legal Decision Influence in the Multilingual Swiss Jurisprudence
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Ronja Stern
Ken Kawamura
Matthias Sturmer
Ilias Chalkidis
Joel Niklaus
AILaw
ELM
231
1
0
17 Oct 2024
Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland
Luca Rolshoven
Vishvaksenan Rasiah
Srinanda Brügger Bose
Sarah Hostettler
Lara Burkhalter
Matthias Sturmer
Joel Niklaus
ELM
AILaw
259
4
0
17 Oct 2024
Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Weixuan Wang
Minghao Wu
Barry Haddow
Alexandra Birch
LRM
194
11
0
16 Oct 2024
Evaluating Morphological Compositional Generalization in Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Mete Ismayilzada
Yuan Chiang
Jonne Sälevä
Hale Sirin
Abdullatif Köksal
Bhuwan Dhingra
Antoine Bosselut
Lonneke van der Plas
Duygu Ataman
363
14
0
16 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5
Thao Anh Dang
Limor Raviv
Lukas Galke
261
9
0
15 Oct 2024
MARS: Multilingual Aspect-centric Review Summarisation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Sandeep Sricharan Mukku
Abinesh Kanagarajan
Chetan Aggarwal
Promod Yenigalla
172
1
0
13 Oct 2024
A Mixed-Language Multi-Document News Summarization Dataset and a Graphs-Based Extract-Generate Model
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Shengxiang Gao
Fang nan
Yongbing Zhang
Yuxin Huang
Kaiwen Tan
Z. Yu
178
3
0
13 Oct 2024
Data Processing for the OpenGPT-X Model Family
Nicolo' Brandizzi
Hammam Abdelwahab
Anirban Bhowmick
Lennard Helmer
Benny Jörg Stein
...
Georg Rehm
Dennis Wegener
Nicolas Flores-Herr
Joachim Kohler
Johannes Leveling
VLM
418
2
0
11 Oct 2024
Linguistically-Informed Multilingual Instruction Tuning: Is There an Optimal Set of Languages to Tune?
Gürkan Soykan
Gözde Gül Şahin
206
1
0
10 Oct 2024
Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Onkar Kishor Susladkar
Vishesh Tripathi
Biddwan Ahmed
95
0
0
09 Oct 2024
Signal Watermark on Large Language Models
Zhenyu Xu
Victor S. Sheng
WaLM
76
2
0
09 Oct 2024
HumVI: A Multilingual Dataset for Detecting Violent Incidents Impacting Humanitarian Aid
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Hemank Lamba
Anton Abilov
Ke Zhang
Elizabeth M. Olson
Henry k. Dambanemuya
...
David S. Batista
Christina Wille
A. Cahill
Joel R. Tetreault
Alex Jaimes
172
1
0
08 Oct 2024
Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Stefano Perrella
Lorenzo Proietti
Pere-Lluís Huguet Cabot
Edoardo Barba
Roberto Navigli
239
7
0
07 Oct 2024
DEPT: Decoupled Embeddings for Pre-training Language Models
International Conference on Learning Representations (ICLR), 2024
Alex Iacob
Lorenzo Sani
Meghdad Kurmanji
William F. Shen
Xinchi Qiu
Dongqi Cai
Yan Gao
Nicholas D. Lane
VLM
1.3K
2
0
07 Oct 2024
Passage Retrieval of Polish Texts Using OKAPI BM25 and an Ensemble of Cross Encoders
Conference on Computer Science and Information Systems (FedCSIS), 2023
Jakub Pokrywka
131
1
0
06 Oct 2024
Inner-Probe: Discovering Copyright-related Data Generation in LLM Architecture
Qichao Ma
Rui-Jie Zhu
Peiye Liu
Renye Yan
Fahong Zhang
...
Meng Li
Zhaofei Yu
Zongwei Wang
Yimao Cai
Tiejun Huang
169
1
0
06 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
306
4
0
06 Oct 2024
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
540
43
0
06 Oct 2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Tianjian Li
Haoran Xu
Weiting Tan
Kenton Murray
Daniel Khashabi
489
3
0
06 Oct 2024
Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Farhan Samir
Chan Young Park
Anjalie Field
Vered Shwartz
Yulia Tsvetkov
156
7
0
05 Oct 2024
Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tomás Feith
Akhil Arora
Martin Gerlach
Debjit Paul
Robert West
KELM
174
7
0
05 Oct 2024
MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task
Conference on Machine Translation (WMT), 2024
Juraj Juraska
Daniel Deutsch
Mara Finkelstein
Markus Freitag
219
79
0
04 Oct 2024
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Seonjeong Hwang
Yunsu Kim
Gary Geunbae Lee
192
3
0
04 Oct 2024
MetaOOD: Automatic Selection of OOD Detection Models
International Conference on Learning Representations (ICLR), 2024
Yuehan Qin
Yichi Zhang
Yi Nian
Xueying Ding
Yue Zhao
OODD
245
1
0
04 Oct 2024
X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale
International Conference on Learning Representations (ICLR), 2024
Haoran Xu
Kenton W. Murray
Philipp Koehn
Hieu T. Hoang
Akiko Eriguchi
Huda Khayrallah
295
27
0
04 Oct 2024
CorPipe at CRAC 2024: Predicting Zero Mentions from Raw Text
Milan Straka
LRM
153
6
0
03 Oct 2024
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages?
Akhilesh Aravapalli
Mounika Marreddy
R. Mamidi
R. Mamidi
Subba Reddy Oota
241
2
0
03 Oct 2024
GADFA: Generator-Assisted Decision-Focused Approach for Opinion Expressing Timing Identification
International Conference on Computational Linguistics (COLING), 2024
Chung-Chi Chen
Hiroya Takamura
Ichiro Kobayashi
Yusuke Miyao
164
0
0
02 Oct 2024
Distilling Analysis from Generative Models for Investment Decisions
Chung-Chi Chen
Hiroya Takamura
Ichiro Kobayashi
Yusuke Miyao
56
0
0
02 Oct 2024
Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Deokhyung Kang
Seonjeong Hwang
Yunsu Kim
Gary Geunbae Lee
261
0
0
01 Oct 2024
Multi-Target Cross-Lingual Summarization: a novel task and a language-neutral approach
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Diogo Pernes
Gonçalo M. Correia
Afonso Mendes
279
3
0
01 Oct 2024
Language Resources in Spanish for Automatic Text Simplification across Domains
Antonio Moreno-Sandoval
Leonardo Campillos-Llanos
Ana García-Serrano
28
1
0
30 Sep 2024
SSR: Alignment-Aware Modality Connector for Speech Language Models
International Workshop on Spoken Language Translation (IWSLT), 2024
Weiting Tan
Hirofumi Inaguma
Ning Dong
Paden Tomasello
Xutai Ma
370
9
0
30 Sep 2024
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs
Mehdi Ali
Michael Fromm
Klaudia Thellmann
Jan Ebert
Alexander Arno Weber
...
René Jäkel
Georg Rehm
Stefan Kesselheim
Joachim Kohler
Nicolas Flores-Herr
273
13
0
30 Sep 2024
Previous
1
2
3
...
6
7
8
...
30
31
32
Next