v1v2v3 (latest)

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020

ArXiv (abs)PDF HTML HuggingFace (4 upvotes)

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 1,560 papers shown

Title
Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code Jipeng Zhang Jianshu Zhang Yuanzhe Li Renjie Pi Boyao Wang Runtao Liu Ziqiang Zheng Tong Zhang 142 2 0 24 Oct 2024
SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and PreparednessConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Tanmay Parekh Jeffrey Kwan Jiarui Yu Sparsh Johri Hyosang Ahn Sreya Muppalla Kai-Wei Chang Wei Wang Nanyun Peng 322 7 0 24 Oct 2024
Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information RetrievalConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Dae Yon Hwang Bilal Taha Harshit Pande Yaroslav Nechaev SyDa 278 0 0 24 Oct 2024
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning TechniquesApplied Soft Computing (Appl. Soft Comput.), 2024 David Ortiz-Perez Manuel Benavent-Lledo José García Rodríguez David Tomás M. Flores Vizcaya-Moreno 211 3 0 24 Oct 2024
Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific KeyphrasesInternational Joint Conference on the Analysis of Images, Social Networks and Texts (AISNT), 2024 Anna Glazkova Dmitry A. Morozov Timur Garipov 207 0 0 23 Oct 2024
Cross-lingual Transfer of Reward Models in Multilingual AlignmentNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Jiwoo Hong Noah Lee Rodrigo Martínez-Castaño César Rodríguez James Thorne 392 15 0 23 Oct 2024
Responsible Multilingual Large Language Models: A Survey of Development, Applications, and Societal Impact Junhua Liu Bin Fu LRM 146 3 0 23 Oct 2024
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models Seunghyeok Hong Dongkeun Yoon Juyoung Suk Javier Aula-Blasco Mano Aslan Vu Trong Kim Shayekh Bin Islam Jaume Prats-Cristià Lucía Tormo-Bañuelos Seungone Kim ELM LRM 236 0 0 23 Oct 2024
Multi-head Sequence Tagging Model for Grammatical Error CorrectionEngineering applications of artificial intelligence (EAAI), 2024 Kamal Al-Sabahi Kang Yang Wangwang Liu Guanyu Jiang Xian Li Ming Yang 166 3 0 21 Oct 2024
Findings of the Third Shared Task on Multilingual Coreference Resolution Michal Novák Barbora Dohnalová Miloslav Konopík A. Nedoluzhko Martin Popel O. Pražák Jakub Sido Milan Straka Zdeněk Žabokrtský Daniel Zeman 254 9 0 21 Oct 2024
Generative AI Agents in Autonomous Machines: A Safety PerspectiveInternational Conference on Computer Aided Design (ICCAD), 2024 Jason J. Jabbour Vijay Janapa Reddi AI4CE 323 11 0 20 Oct 2024
Allegro: Open the Black Box of Commercial-Level Video Generation Model Yuan Zhou Qiuyue Wang Yuxuan Cai Huan Yang VGen VLM 318 51 0 20 Oct 2024
Grammatical Error Correction for Low-Resource Languages: The Case of Zarma Mamadou K. Keita Christopher Homan Sofiane Abdoulaye Hamani Adwoa Bremang Marcos Zampieri Habibatou Abdoulaye Alfari Elysabhete Amadou Ibrahim 334 5 0 20 Oct 2024
A survey of neural-network-based methods utilising comparable data for finding translation equivalents Michaela Denisová Pavel Rychlý 222 0 0 19 Oct 2024
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Shangda Wu Yashan Wang Ruibin Yuan Zhancheng Guo Xu Tan ... Yuanliang Dong Jiafeng Liu Xiaobing Li Feng Yu Maosong Sun 393 13 0 17 Oct 2024
Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological AnalysisNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Yiyi Chen Qiongxiu Li Russa Biswas Johannes Bjerva 287 7 0 17 Oct 2024
From Citations to Criticality: Predicting Legal Decision Influence in the Multilingual Swiss JurisprudenceAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 Ronja Stern Ken Kawamura Matthias Sturmer Ilias Chalkidis Joel Niklaus AILaw ELM 231 1 0 17 Oct 2024
Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland Luca Rolshoven Vishvaksenan Rasiah Srinanda Brügger Bose Sarah Hostettler Lara Burkhalter Matthias Sturmer Joel Niklaus ELM AILaw 259 4 0 17 Oct 2024
Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual InterventionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 Weixuan Wang Minghao Wu Barry Haddow Alexandra Birch LRM 194 11 0 16 Oct 2024
Evaluating Morphological Compositional Generalization in Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Mete Ismayilzada Yuan Chiang Jonne Sälevä Hale Sirin Abdullatif Köksal Bhuwan Dhingra Antoine Bosselut Lonneke van der Plas Duygu Ataman 363 14 0 16 Oct 2024
Tokenization and Morphology in Multilingual Language Models: A Comparative Analysis of mT5 and ByT5 Thao Anh Dang Limor Raviv Lukas Galke 261 9 0 15 Oct 2024
MARS: Multilingual Aspect-centric Review SummarisationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Sandeep Sricharan Mukku Abinesh Kanagarajan Chetan Aggarwal Promod Yenigalla 172 1 0 13 Oct 2024
A Mixed-Language Multi-Document News Summarization Dataset and a Graphs-Based Extract-Generate ModelNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Shengxiang Gao Fang nan Yongbing Zhang Yuxin Huang Kaiwen Tan Z. Yu 178 3 0 13 Oct 2024
Data Processing for the OpenGPT-X Model Family Nicolo' Brandizzi Hammam Abdelwahab Anirban Bhowmick Lennard Helmer Benny Jörg Stein ... Georg Rehm Dennis Wegener Nicolas Flores-Herr Joachim Kohler Johannes Leveling VLM 418 2 0 11 Oct 2024
Linguistically-Informed Multilingual Instruction Tuning: Is There an Optimal Set of Languages to Tune? Gürkan Soykan Gözde Gül Şahin 206 1 0 10 Oct 2024
Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTSConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Onkar Kishor Susladkar Vishesh Tripathi Biddwan Ahmed 95 0 0 09 Oct 2024
Signal Watermark on Large Language Models Zhenyu Xu Victor S. Sheng WaLM 76 2 0 09 Oct 2024
HumVI: A Multilingual Dataset for Detecting Violent Incidents Impacting Humanitarian AidConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Hemank Lamba Anton Abilov Ke Zhang Elizabeth M. Olson Henry k. Dambanemuya ... David S. Batista Christina Wille A. Cahill Joel R. Tetreault Alex Jaimes 172 1 0 08 Oct 2024
Beyond Correlation: Interpretable Evaluation of Machine Translation MetricsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Stefano Perrella Lorenzo Proietti Pere-Lluís Huguet Cabot Edoardo Barba Roberto Navigli 239 7 0 07 Oct 2024
DEPT: Decoupled Embeddings for Pre-training Language ModelsInternational Conference on Learning Representations (ICLR), 2024 Alex Iacob Lorenzo Sani Meghdad Kurmanji William F. Shen Xinchi Qiu Dongqi Cai Yan Gao Nicholas D. Lane VLM 1.3K 2 0 07 Oct 2024
Passage Retrieval of Polish Texts Using OKAPI BM25 and an Ensemble of Cross EncodersConference on Computer Science and Information Systems (FedCSIS), 2023 Jakub Pokrywka 131 1 0 06 Oct 2024
Inner-Probe: Discovering Copyright-related Data Generation in LLM Architecture Qichao Ma Rui-Jie Zhu Peiye Liu Renye Yan Fahong Zhang ... Meng Li Zhaofei Yu Zongwei Wang Yimao Cai Tiejun Huang 169 1 0 06 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text Jun Hirako Ryohei Sasano Koichi Takeda 306 4 0 06 Oct 2024
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective Jinhao Li Jiaming Xu Shan Huang Yonghua Chen Wen Li ... Jiayi Pan Li Ding Hao Zhou Yu Wang Guohao Dai 540 43 0 06 Oct 2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced DatasetsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 Tianjian Li Haoran Xu Weiting Tan Kenton Murray Daniel Khashabi 489 3 0 06 Oct 2024
Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on WikipediaConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Farhan Samir Chan Young Park Anjalie Field Vered Shwartz Yulia Tsvetkov 156 7 0 05 Oct 2024
Entity Insertion in Multilingual Linked Corpora: The Case of WikipediaConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Tomás Feith Akhil Arora Martin Gerlach Debjit Paul Robert West KELM 174 7 0 05 Oct 2024
MetricX-24: The Google Submission to the WMT 2024 Metrics Shared TaskConference on Machine Translation (WMT), 2024 Juraj Juraska Daniel Deutsch Mara Finkelstein Markus Freitag 219 79 0 04 Oct 2024
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Seonjeong Hwang Yunsu Kim Gary Geunbae Lee 192 3 0 04 Oct 2024
MetaOOD: Automatic Selection of OOD Detection ModelsInternational Conference on Learning Representations (ICLR), 2024 Yuehan Qin Yichi Zhang Yi Nian Xueying Ding Yue Zhao OODD 245 1 0 04 Oct 2024
X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at ScaleInternational Conference on Learning Representations (ICLR), 2024 Haoran Xu Kenton W. Murray Philipp Koehn Hieu T. Hoang Akiko Eriguchi Huda Khayrallah 295 27 0 04 Oct 2024
CorPipe at CRAC 2024: Predicting Zero Mentions from Raw Text Milan Straka LRM 153 6 0 03 Oct 2024
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages? Akhilesh Aravapalli Mounika Marreddy R. Mamidi R. Mamidi Subba Reddy Oota 241 2 0 03 Oct 2024
GADFA: Generator-Assisted Decision-Focused Approach for Opinion Expressing Timing IdentificationInternational Conference on Computational Linguistics (COLING), 2024 Chung-Chi Chen Hiroya Takamura Ichiro Kobayashi Yusuke Miyao 164 0 0 02 Oct 2024
Distilling Analysis from Generative Models for Investment Decisions Chung-Chi Chen Hiroya Takamura Ichiro Kobayashi Yusuke Miyao 56 0 0 02 Oct 2024
Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic ParsingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Deokhyung Kang Seonjeong Hwang Yunsu Kim Gary Geunbae Lee 261 0 0 01 Oct 2024
Multi-Target Cross-Lingual Summarization: a novel task and a language-neutral approachConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 Diogo Pernes Gonçalo M. Correia Afonso Mendes 279 3 0 01 Oct 2024
Language Resources in Spanish for Automatic Text Simplification across Domains Antonio Moreno-Sandoval Leonardo Campillos-Llanos Ana García-Serrano 28 1 0 30 Sep 2024
SSR: Alignment-Aware Modality Connector for Speech Language ModelsInternational Workshop on Spoken Language Translation (IWSLT), 2024 Weiting Tan Hirofumi Inaguma Ning Dong Paden Tomasello Xutai Ma 370 9 0 30 Sep 2024
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs Mehdi Ali Michael Fromm Klaudia Thellmann Jan Ebert Alexander Arno Weber ... René Jäkel Georg Rehm Stefan Kesselheim Joachim Kohler Nicolas Flores-Herr 273 13 0 30 Sep 2024

All Papers

mT5: A massively multilingual pre-trained text-to-text transformer

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"