ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.11934
  4. Cited By
mT5: A massively multilingual pre-trained text-to-text transformer

mT5: A massively multilingual pre-trained text-to-text transformer

22 October 2020
Linting Xue
Noah Constant
Adam Roberts
Mihir Kale
Rami Al-Rfou
Aditya Siddhant
Aditya Barua
Colin Raffel
ArXivPDFHTML

Papers citing "mT5: A massively multilingual pre-trained text-to-text transformer"

50 / 358 papers shown
Title
CroissantLLM: A Truly Bilingual French-English Language Model
CroissantLLM: A Truly Bilingual French-English Language Model
Manuel Faysse
Patrick Fernandes
Nuno M. Guerreiro
António Loison
Duarte M. Alves
...
François Yvon
André F.T. Martins
Gautier Viaud
C´eline Hudelot
Pierre Colombo
43
32
0
01 Feb 2024
A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis
  on Noisy Bangla Texts
A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bangla Texts
Kazi Toufique Elahi
Tasnuva Binte Rahman
Shakil Shahriar
Samir Sarker
Md. Tanvir Rouf Shawon
G. M. Shahariar
18
1
0
25 Jan 2024
Contrastive Preference Optimization: Pushing the Boundaries of LLM
  Performance in Machine Translation
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
Haoran Xu
Amr Sharaf
Yunmo Chen
Weiting Tan
Lingfeng Shen
Benjamin Van Durme
Kenton W. Murray
Young Jin Kim
ALM
36
197
0
16 Jan 2024
PIXAR: Auto-Regressive Language Modeling in Pixel Space
PIXAR: Auto-Regressive Language Modeling in Pixel Space
Yintao Tai
Xiyang Liao
Alessandro Suglia
Antonio Vergari
MLLM
21
7
0
06 Jan 2024
Multilingual large language models leak human stereotypes across
  language boundaries
Multilingual large language models leak human stereotypes across language boundaries
Yang Trista Cao
Anna Sotnikova
Jieyu Zhao
Linda X. Zou
Rachel Rudinger
Hal Daumé
PILM
18
10
0
12 Dec 2023
Leveraging Domain Adaptation and Data Augmentation to Improve Quránic
  IR in English and Arabic
Leveraging Domain Adaptation and Data Augmentation to Improve Quránic IR in English and Arabic
Vera Pavlova
21
2
0
05 Dec 2023
Towards A Foundation Model For Trajectory Intelligence
Towards A Foundation Model For Trajectory Intelligence
Alameen Najjar
7
2
0
30 Nov 2023
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Andrea W Wen-Yi
David Mimno
25
14
0
29 Nov 2023
RETSim: Resilient and Efficient Text Similarity
RETSim: Resilient and Efficient Text Similarity
Marina Zhang
Owen Vallis
Aysegul Bumin
Tanay Vakharia
Elie Bursztein
23
1
0
28 Nov 2023
DP-NMT: Scalable Differentially-Private Machine Translation
DP-NMT: Scalable Differentially-Private Machine Translation
Timour Igamberdiev
Doan Nam Long Vu
Felix Künnecke
Zhuo Yu
Jannik Holmer
Ivan Habernal
27
7
0
24 Nov 2023
Prompt Pool based Class-Incremental Continual Learning for Dialog State
  Tracking
Prompt Pool based Class-Incremental Continual Learning for Dialog State Tracking
Hong Liu
Yucheng Cai
Yuan Zhou
Zhijian Ou
Yi Huang
Junlan Feng
CLL
19
2
0
17 Nov 2023
Take One Step at a Time to Know Incremental Utility of Demonstration: An
  Analysis on Reranking for Few-Shot In-Context Learning
Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning
Kazuma Hashimoto
K. Raman
Michael Bendersky
35
2
0
16 Nov 2023
Language and Task Arithmetic with Parameter-Efficient Layers for
  Zero-Shot Summarization
Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization
Alexandra Chronopoulou
Jonas Pfeiffer
Joshua Maynez
Xinyi Wang
Sebastian Ruder
Priyanka Agrawal
MoMe
24
14
0
15 Nov 2023
Structural Priming Demonstrates Abstract Grammatical Representations in
  Multilingual Language Models
Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models
J. Michaelov
Catherine Arnett
Tyler A. Chang
Benjamin Bergen
34
12
0
15 Nov 2023
How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
Fei Yuan
Shuai Yuan
Zhiyong Wu
Lei Li
20
10
0
15 Nov 2023
MELA: Multilingual Evaluation of Linguistic Acceptability
MELA: Multilingual Evaluation of Linguistic Acceptability
Ziyin Zhang
Yikang Liu
Wei Huang
Junyu Mao
Rui Wang
Hai Hu
22
3
0
15 Nov 2023
Leveraging LLMs for Synthesizing Training Data Across Many Languages in
  Multilingual Dense Retrieval
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval
Nandan Thakur
Jianmo Ni
Gustavo Hernández Ábrego
John Wieting
Jimmy J. Lin
Daniel Matthew Cer
RALM
29
12
0
10 Nov 2023
There's no Data Like Better Data: Using QE Metrics for MT Data Filtering
There's no Data Like Better Data: Using QE Metrics for MT Data Filtering
Jan-Thorsten Peter
David Vilar
Daniel Deutsch
Mara Finkelstein
Juraj Juraska
Markus Freitag
9
16
0
09 Nov 2023
Cultural Adaptation of Recipes
Cultural Adaptation of Recipes
Yong Cao
Yova Kementchedjhieva
Ruixiang Cui
Antonia Karamolegkou
Li Zhou
Megan Dare
Lucia Donatelli
Daniel Hershcovich
18
5
0
26 Oct 2023
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64
  Languages
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
Chiyu Zhang
Khai Duy Doan
Qisheng Liao
Muhammad Abdul-Mageed
34
6
0
23 Oct 2023
Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering
Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering
Ofir Arviv
Dmitry Nikolaev
Taelin Karidi
Omri Abend
LRM
30
3
0
20 Oct 2023
A Systematic Study of Performance Disparities in Multilingual
  Task-Oriented Dialogue Systems
A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
Songbo Hu
Han Zhou
Moy Yuan
Milan Gritta
Guchun Zhang
Ignacio Iacobacci
Anna Korhonen
Ivan Vulić
26
3
0
19 Oct 2023
CAW-coref: Conjunction-Aware Word-level Coreference Resolution
CAW-coref: Conjunction-Aware Word-level Coreference Resolution
Karel DÓosterlinck
Semere Kiros Bitew
Brandon Papineau
Christopher Potts
Thomas Demeester
Chris Develder
24
8
0
09 Oct 2023
A Benchmark for Learning to Translate a New Language from One Grammar
  Book
A Benchmark for Learning to Translate a New Language from One Grammar Book
Garrett Tanzer
Mirac Suzgun
Chenguang Xi
Dan Jurafsky
Luke Melas-Kyriazi
24
51
0
28 Sep 2023
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
Atakan Kara
Farrin Marouf Sofian
Andrew Bond
Gözde Gül Sahin
16
4
0
20 Sep 2023
DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
DictaBERT: A State-of-the-Art BERT Suite for Modern Hebrew
Shaltiel Shmidman
Avi Shmidman
Moshe Koppel
17
7
0
31 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
52
14
0
23 Aug 2023
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder
  Language Models
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Wei Ping
Peng-Tao Xu
M. Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
27
33
0
15 Aug 2023
NewsDialogues: Towards Proactive News Grounded Conversation
NewsDialogues: Towards Proactive News Grounded Conversation
Siheng Li
Yichun Yin
Cheng Yang
Wangjie Jiang
Yiwei Li
Ze-Long Cheng
Lifeng Shang
Xin Jiang
Qun Liu
Yujiu Yang
21
5
0
12 Aug 2023
Okapi: Instruction-tuned Large Language Models in Multiple Languages
  with Reinforcement Learning from Human Feedback
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
Viet Dac Lai
Chien Van Nguyen
Nghia Trung Ngo
Thuat Nguyen
Franck Dernoncourt
Ryan A. Rossi
Thien Huu Nguyen
ALM
38
127
0
29 Jul 2023
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for
  Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems
Songbo Hu
Han Zhou
Mete Hergul
Milan Gritta
Guchun Zhang
Ignacio Iacobacci
Ivan Vulić
Anna Korhonen
24
10
0
26 Jul 2023
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Foundational Models Defining a New Era in Vision: A Survey and Outlook
Muhammad Awais
Muzammal Naseer
Salman Khan
Rao Muhammad Anwer
Hisham Cholakkal
M. Shah
Ming Yang
F. Khan
VLM
22
117
0
25 Jul 2023
Empowering Cross-lingual Behavioral Testing of NLP Models with
  Typological Features
Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features
Ester Hlavnova
Sebastian Ruder
30
5
0
11 Jul 2023
Low-Resource Cross-Lingual Adaptive Training for Nigerian Pidgin
Low-Resource Cross-Lingual Adaptive Training for Nigerian Pidgin
Pin-Jie Lin
Muhammed Saeed
Ernie Chang
Merel C. J. Scholman
32
5
0
01 Jul 2023
On Evaluating Multilingual Compositional Generalization with Translated
  Datasets
On Evaluating Multilingual Compositional Generalization with Translated Datasets
Zi Wang
Daniel Hershcovich
18
7
0
20 Jun 2023
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning
Hengli Li
Songchun Zhu
Zilong Zheng
11
8
0
15 Jun 2023
Large-scale Language Model Rescoring on Long-form Data
Large-scale Language Model Rescoring on Long-form Data
Tongzhou Chen
Cyril Allauzen
Yinghui Huang
Daniel S. Park
David Rybach
...
Rodrigo Cabrera
Kartik Audhkhasi
Bhuvana Ramabhadran
Pedro J. Moreno
Michael Riley
22
14
0
13 Jun 2023
Can current NLI systems handle German word order? Investigating language
  model performance on a new German challenge set of minimal pairs
Can current NLI systems handle German word order? Investigating language model performance on a new German challenge set of minimal pairs
Ines Reinig
K. Markert
16
0
0
07 Jun 2023
Unsupervised Paraphrasing of Multiword Expressions
Unsupervised Paraphrasing of Multiword Expressions
Takashi Wada
Yuji Matsumoto
Timothy Baldwin
Jey Han Lau
24
0
0
02 Jun 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Mohit Bansal
DiffM
27
49
0
30 May 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Xi Chen
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Soravit Changpinyo
...
Mojtaba Seyedhosseini
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
VLM
44
187
0
29 May 2023
Byte-Level Grammatical Error Correction Using Synthetic and Curated
  Corpora
Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora
Svanhvít Lilja Ingólfsdóttir
Pétur Orri Ragnarsson
H. Jónsson
Haukur Barri Símonarson
Vilhjálmur Þorsteinsson
Vésteinn Snæbjarnarson
SyDa
30
9
0
29 May 2023
A Practical Toolkit for Multilingual Question and Answer Generation
A Practical Toolkit for Multilingual Question and Answer Generation
Asahi Ushio
Fernando Alva-Manchego
Jose Camacho-Collados
SyDa
24
13
0
27 May 2023
Revisiting non-English Text Simplification: A Unified Multilingual
  Benchmark
Revisiting non-English Text Simplification: A Unified Multilingual Benchmark
Michael Joseph Ryan
Tarek Naous
Wei-ping Xu
24
24
0
25 May 2023
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal
  Image Generation
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Marco Bellagente
Manuel Brack
H. Teufel
Felix Friedrich
Bjorn Deiseroth
...
Koen Oostermeijer
Andres Felipe Cruz Salinas
P. Schramowski
Kristian Kersting
Samuel Weinbach
36
15
0
24 May 2023
An Efficient Multilingual Language Model Compression through Vocabulary
  Trimming
An Efficient Multilingual Language Model Compression through Vocabulary Trimming
Asahi Ushio
Yi Zhou
Jose Camacho-Collados
39
7
0
24 May 2023
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
El Moatez Billah Nagoudi
AbdelRahim Elmadany
Ahmed Oumar El-Shangiti
Muhammad Abdul-Mageed
LM&MA
30
17
0
24 May 2023
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual
  Transfer
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
Akari Asai
Sneha Kudugunta
Xinyan Velocity Yu
Terra Blevins
Hila Gonen
Machel Reid
Yulia Tsvetkov
Sebastian Ruder
Hannaneh Hajishirzi
31
54
0
24 May 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language
  Models
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models
Tarek Naous
Michael Joseph Ryan
Alan Ritter
Wei-ping Xu
24
85
0
23 May 2023
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
Chenxi Whitehouse
Monojit Choudhury
Alham Fikri Aji
SyDa
LRM
30
68
0
23 May 2023
Previous
12345678
Next