Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2106.03193
Cited By
The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Transactions of the Association for Computational Linguistics (TACL), 2021
6 June 2021
Naman Goyal
Cynthia Gao
Vishrav Chaudhary
Peng-Jen Chen
Guillaume Wenzek
Da Ju
Sanjan Krishnan
MarcÁurelio Ranzato
Francisco Guzman
Angela Fan
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation"
46 / 246 papers shown
Adam Mickiewicz University at WMT 2022: NER-Assisted and Quality-Aware Neural Machine Translation
Conference on Machine Translation (WMT), 2022
Artur Nowakowski
Gabriela Pałka
Kamil Guttmann
Miko Pokrywka
206
6
0
07 Sep 2022
Multilingual Bidirectional Unsupervised Translation Through Multilingual Finetuning and Back-Translation
Bryan Li
Mohammad Sadegh Rasooli
Ajay Patel
Chris Callison-Burch
207
4
0
06 Sep 2022
Domain-Specific Text Generation for Machine Translation
Conference of the Association for Machine Translation in the Americas (AMTA), 2022
Yasmin Moslem
Rejwanul Haque
John D. Kelleher
Andy Way
199
24
0
11 Aug 2022
Language Tokens: A Frustratingly Simple Approach Improves Zero-Shot Performance of Multilingual Translation
Conference of the Association for Machine Translation in the Americas (AMTA), 2022
Muhammad N. ElNokrashy
Amr Hendy
Mohamed Maher
Mohamed Afify
Hany Awadalla
164
2
0
11 Aug 2022
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model
Saleh Soltan
Shankar Ananthakrishnan
Jack G. M. FitzGerald
Rahul Gupta
Wael Hamza
...
Mukund Sridhar
Fabian Triefenbach
Apurv Verma
Gokhan Tur
Premkumar Natarajan
367
89
0
02 Aug 2022
Bitext Mining Using Distilled Sentence Representations for Low-Resource Languages
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Kevin Heffernan
Onur cCelebi
Holger Schwenk
276
65
0
25 May 2022
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Tu Vu
Aditya Barua
Brian Lester
Daniel Cer
Mohit Iyyer
Noah Constant
CLL
338
72
0
25 May 2022
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
Spoken Language Technology Workshop (SLT), 2022
Alexis Conneau
Min Ma
Simran Khanuja
Yu Zhang
Vera Axelrod
Siddharth Dalmia
Jason Riesa
Clara E. Rivera
Ankur Bapna
VLM
505
476
0
25 May 2022
T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Paul-Ambroise Duquenne
Hongyu Gong
Benoît Sagot
Holger Schwenk
226
21
0
24 May 2022
Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Javier Ferrando
Gerard I. Gállego
Belen Alastruey
Carlos Escolano
Marta R. Costa-jussá
341
53
0
23 May 2022
Local Byte Fusion for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Makesh Narsimhan Sreedhar
Xiangpeng Wan
Yu-Jie Cheng
Junjie Hu
484
7
0
23 May 2022
Multilingual Machine Translation with Hyper-Adapters
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Christos Baziotis
Mikel Artetxe
James Cross
Shruti Bhosale
266
27
0
22 May 2022
What Do Compressed Multilingual Machine Translation Models Forget?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Alireza Mohammadshahi
Vassilina Nikoulina
Alexandre Berard
Caroline Brun
James Henderson
Laurent Besacier
AI4CE
400
12
0
22 May 2022
Building Machine Translation Systems for the Next Thousand Languages
Ankur Bapna
Isaac Caswell
Julia Kreutzer
Orhan Firat
D. Esch
...
Apurva Shah
Yanping Huang
Zhiwen Chen
Yonghui Wu
Macduff Hughes
325
110
0
09 May 2022
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
David Ifeoluwa Adelani
Jesujoba Oluwadara Alabi
Angela Fan
Julia Kreutzer
Xiaoyu Shen
...
Ayodele Awokoya
Happy Buzaaba
Blessing K. Sibanda
Andiswa Bukula
Sam Manthalu
446
131
0
04 May 2022
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation
International Conference on Language Resources and Evaluation (LREC), 2022
Idris Abdulmumin
S. Dash
Musa Abdullahi Dawud
Shantipriya Parida
Shamsuddeen Hassan Muhammad
Ibrahim Said Ahmad
Subhadarshi Panda
Ondrej Bojar
B. Galadanci
Bello Shehu Bello
289
21
0
02 May 2022
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training?
Conference of the Association for Machine Translation in the Americas (AMTA), 2022
Shiyue Zhang
Vishrav Chaudhary
Naman Goyal
James Cross
Guillaume Wenzek
Joey Tianyi Zhou
Francisco Guzman
224
22
0
29 Apr 2022
MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jack G. M. FitzGerald
C. Hench
Charith Peris
Scott Mackie
Kay Rottmann
...
Laurie Crist
Misha Britan
Wouter Leeuwis
Gokhan Tur
Premkumar Natarajan
245
171
0
18 Apr 2022
MuCoT: Multilingual Contrastive Training for Question-Answering in Low-resource Languages
Gokul Karthik Kumar
Abhishek Singh Gehlot
Sahal Shaji Mullappilly
Karthik Nandakumar
154
14
0
12 Apr 2022
MMTAfrica: Multilingual Machine Translation for African Languages
Conference on Machine Translation (WMT), 2022
Chris C. Emezue
Bonaventure F. P. Dossou
134
25
0
08 Apr 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
Interspeech (Interspeech), 2022
Alexis Conneau
Ankur Bapna
Yu Zhang
Min Ma
Patrick von Platen
...
Orhan Firat
Michael Auli
Sebastian Ruder
Jason Riesa
Melvin Johnson
VLM
AILaw
ELM
272
23
0
21 Mar 2022
MCoNaLa: A Benchmark for Code Generation from Multiple Natural Languages
Findings (Findings), 2022
Zhiruo Wang
Grace Cuenca
Shuyan Zhou
Frank F. Xu
Graham Neubig
224
60
0
16 Mar 2022
Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Yong Cheng
Ankur Bapna
Orhan Firat
Yuan Cao
Pidong Wang
Wolfgang Macherey
155
15
0
15 Mar 2022
DeepNet: Scaling Transformers to 1,000 Layers
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Hongyu Wang
Shuming Ma
Li Dong
Shaohan Huang
Dongdong Zhang
Furu Wei
MoE
AI4CE
319
205
0
01 Mar 2022
OCR Improves Machine Translation for Low-Resource Languages
Findings (Findings), 2022
Oana Ignat
Jean Maillard
Vishrav Chaudhary
Francisco Guzmán
197
13
0
27 Feb 2022
Sequence-to-Sequence Resources for Catalan
Ona de Gibert
Ksenia Kharitonova
B. Figueras
Jordi Armengol-Estapé
Maite Melero
68
0
0
14 Feb 2022
mSLAM: Massively multilingual joint pre-training for speech and text
Ankur Bapna
Colin Cherry
Yu Zhang
Ye Jia
Melvin Johnson
Yong Cheng
Simran Khanuja
Jason Riesa
Alexis Conneau
VLM
178
122
0
03 Feb 2022
Does Transliteration Help Multilingual Language Modeling?
Findings (Findings), 2022
Ibraheem Muhammad Moosa
Mahmud Elahi Akhter
Ashfia Binte Habib
318
14
0
29 Jan 2022
Few-shot Learning with Multilingual Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xi Lin
Todor Mihaylov
Mikel Artetxe
Tianlu Wang
Shuohui Chen
...
Luke Zettlemoyer
Zornitsa Kozareva
Mona T. Diab
Ves Stoyanov
Xian Li
BDL
ELM
LRM
359
355
0
20 Dec 2021
Data Processing Matters: SRPH-Konvergen AI's Machine Translation System for WMT'21
Conference on Machine Translation (WMT), 2021
Lintang Sutawika
Jan Christian Blaise Cruz
108
3
0
20 Nov 2021
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task
Conference on Machine Translation (WMT), 2021
Jian Yang
Shuming Ma
Haoyang Huang
Dongdong Zhang
Li Dong
...
Alexandre Muzio
Saksham Singhal
Hany Awadalla
Xia Song
Furu Wei
130
46
0
03 Nov 2021
Empirical Analysis of Korean Public AI Hub Parallel Corpora and in-depth Analysis using LIWC
Chanjun Park
Midan Shim
Sugyeong Eo
Seolhwa Lee
Jaehyung Seo
Hyeonseok Moon
Heuiseok Lim
115
8
0
28 Oct 2021
Continual Learning in Multilingual NMT via Language-Specific Embeddings
Conference on Machine Translation (WMT), 2021
Alexandre Berard
CLL
169
23
0
20 Oct 2021
Towards Making the Most of Multilingual Pretraining for Zero-Shot Neural Machine Translation
Guanhua Chen
Shuming Ma
Yun-Nung Chen
Dongdong Zhang
Jia Pan
Wenping Wang
Furu Wei
LRM
182
17
0
16 Oct 2021
Alternative Input Signals Ease Transfer in Multilingual Machine Translation
Simeng Sun
Angela Fan
James Cross
Vishrav Chaudhary
C. Tran
Philipp Koehn
Francisco Guzman
134
17
0
15 Oct 2021
Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation
Benjamin Muller
Luca Soldaini
Rik Koncel-Kedziorski
Eric Lind
Alessandro Moschitti
LRM
203
10
0
14 Oct 2021
The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine Translation
Orevaoghene Ahia
Julia Kreutzer
Sara Hooker
307
58
0
06 Oct 2021
How BPE Affects Memorization in Transformers
Eugene Kharitonov
Marco Baroni
Dieuwke Hupkes
444
37
0
06 Oct 2021
Efficient Inference for Multilingual Neural Machine Translation
Alexandre Berard
Dain Lee
Stéphane Clinchant
K. Jung
Vassilina Nikoulina
319
12
0
14 Sep 2021
Evaluating Multiway Multilingual NMT in the Turkic Languages
Jamshidbek Mirzakhalov
A. Babu
Aigiz Kunafin
Ahsan Wahab
Behzodbek Moydinboyev
...
Julia Kreutzer
Francis M. Tyers
Orhan Firat
John Licato
Sriram Chellappan
ELM
185
14
0
13 Sep 2021
IndicBART: A Pre-trained Model for Indic Natural Language Generation
Findings (Findings), 2021
Mary Dabre
Himani Shrotriya
Anoop Kunchukuttan
Ratish Puduppully
Mitesh M. Khapra
Pratyush Kumar
247
87
0
07 Sep 2021
Survey of Low-Resource Machine Translation
Computational Linguistics (CL), 2021
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
512
200
0
01 Sep 2021
The paradox of the compositionality of natural language: a neural machine translation case study
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Verna Dankers
Elia Bruni
Dieuwke Hupkes
CoGe
376
86
0
12 Aug 2021
Signal Transformer: Complex-valued Attention and Meta-Learning for Signal Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yihong Dong
Ying Peng
Muqiao Yang
Songtao Lu
Qingjiang Shi
411
12
0
05 Jun 2021
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
Transactions of the Association for Computational Linguistics (TACL), 2021
Gowtham Ramesh
Sumanth Doddapaneni
Aravinth Bheemaraj
Mayank Jobanputra
AK Raghavan
...
K. Deepak
Vivek Raghavan
Anoop Kunchukuttan
Pratyush Kumar
Mitesh Khapra
LRM
373
268
0
12 Apr 2021
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Transactions of the Association for Computational Linguistics (TACL), 2021
Julia Kreutzer
Isaac Caswell
Lisa Wang
Ahsan Wahab
D. Esch
...
Duygu Ataman
Orevaoghene Ahia
Oghenefego Ahia
Sweta Agrawal
Mofetoluwa Adeyemi
441
311
0
22 Mar 2021
Previous
1
2
3
4
5
Page 5 of 5