Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.00401
Cited By
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning
2 August 2020
Y. Tang
C. Tran
Xian Li
Peng-Jen Chen
Naman Goyal
Vishrav Chaudhary
Jiatao Gu
Angela Fan
CLL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multilingual Translation with Extensible Multilingual Pretraining and Finetuning"
50 / 232 papers shown
Title
DUB: Discrete Unit Back-translation for Speech Translation
Dong Zhang
Rong Ye
Tom Ko
Mingxuan Wang
Yaqian Zhou
11
23
0
19 May 2023
Accelerating Transformer Inference for Translation via Parallel Decoding
Andrea Santilli
Silvio Severino
Emilian Postolache
Valentino Maiorca
Michele Mancusi
R. Marin
Emanuele Rodolà
14
77
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip H. S. Torr
Adel Bibi
16
96
0
17 May 2023
RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training
Chulun Zhou
Yunlong Liang
Fandong Meng
Jinan Xu
Jinsong Su
Jie Zhou
VLM
16
4
0
13 May 2023
Perturbation-based QE: An Explainable, Unsupervised Word-level Quality Estimation Method for Blackbox Machine Translation
Tu Anh Dinh
J. Niehues
18
5
0
12 May 2023
SemEval-2023 Task 2: Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2)
B. Fetahu
Sudipta Kar
Zhiyu Zoey Chen
Oleg Rokhlenko
S. Malmasi
22
56
0
11 May 2023
Train Global, Tailor Local: Minimalist Multilingual Translation into Endangered Languages
Zhong Zhou
J. Niehues
Alexander Waibel
22
0
0
05 May 2023
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model
Iakovos Evdaimon
Hadi Abdine
Christos Xypolopoulos
Stamatis Outsios
Michalis Vazirgiannis
Giorgos Stamou
VLM
23
7
0
03 Apr 2023
Indian Language Summarization using Pretrained Sequence-to-Sequence Models
Ashok Urlana
S. Bhatt
Nirmal Surange
Manish Shrivastava
46
13
0
25 Mar 2023
PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
Yongil Kim
Yerin Hwang
Hyeongu Yun
Seunghyun Yoon
Trung Bui
Kyomin Jung
17
6
0
15 Mar 2023
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
Bang-ju Yang
Fenglin Liu
Yuexian Zou
Xian Wu
Yaowei Wang
David A. Clifton
23
8
0
11 Mar 2023
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization
Ruochen Zhang
Carsten Eickhoff
40
5
0
07 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu
Tianlong Chen
Zhenyu (Allen) Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zhangyang Wang
19
29
0
03 Mar 2023
A Persian Benchmark for Joint Intent Detection and Slot Filling
M. Akbari
Amir-Hossein Karimi
Tayyebeh Saeedi
Zeinab Saeidi
Kiana Ghezelbash
Fatemeh Shamsezat
Mohammad Akbari
Ali Mohades
15
3
0
01 Mar 2023
Evaluating and Improving the Coreference Capabilities of Machine Translation Models
Asaf Yehudai
Arie Cattan
Omri Abend
Gabriel Stanovsky
LRM
ELM
22
2
0
16 Feb 2023
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Chenglei Si
Zhengyan Zhang
Yingfa Chen
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
AAML
19
1
0
14 Feb 2023
Training-free Lexical Backdoor Attacks on Language Models
Yujin Huang
Terry Yue Zhuo
Qiongkai Xu
Han Hu
Xingliang Yuan
Chunyang Chen
SILM
17
42
0
08 Feb 2023
TransFool: An Adversarial Attack against Neural Machine Translation Models
Sahar Sadrizadeh
Ljiljana Dolamic
P. Frossard
SILM
AAML
27
12
0
02 Feb 2023
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization
Laura Nguyen
Thomas Scialom
Benjamin Piwowarski
Jacopo Staiano
19
7
0
26 Jan 2023
Cross-lingual Argument Mining in the Medical Domain
Anar Yeginbergenova
Rodrigo Agerri
42
7
0
25 Jan 2023
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
Kung-Hsiang Huang
Siffi Singh
Xiaofei Ma
Wei Xiao
Wei Xiao
Nicholas Dingwall
William Yang Wang
Kathleen McKeown
HILM
21
13
0
25 Jan 2023
Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining
Karol Nowakowski
M. Ptaszynski
Kyoko Murasaki
Jagna Nieuwazny
15
23
0
18 Jan 2023
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records
Gyubok Lee
Hyeonji Hwang
Seongsu Bae
Yeonsu Kwon
W. Shin
Seongjun Yang
Minjoon Seo
Jong-Yeup Kim
E. Choi
13
18
0
16 Jan 2023
Extrinsic Evaluation of Machine Translation Metrics
Nikita Moghe
Tom Sherborne
Mark Steedman
Alexandra Birch
ELM
18
17
0
20 Dec 2022
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model
Yeskendir Koishekenov
Alexandre Berard
Vassilina Nikoulina
MoE
22
28
0
19 Dec 2022
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
31
6
0
19 Dec 2022
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
H. Inaguma
Sravya Popuri
Ilia Kulikov
Peng-Jen Chen
Changhan Wang
Yu-An Chung
Yun Tang
Ann Lee
Shinji Watanabe
J. Pino
43
51
0
15 Dec 2022
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation
Maha Elbayad
Anna Y. Sun
Shruti Bhosale
MoE
38
8
0
15 Dec 2022
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
Jianhao Yan
Jin Xu
Fandong Meng
Jie Zhou
Yue Zhang
16
3
0
08 Dec 2022
Findings of the WMT 2022 Shared Task on Translation Suggestion
Zhen Yang
Fandong Meng
Yingxue Zhang
Ernan Li
Jie Zhou
LRM
22
2
0
30 Nov 2022
Extending the Subwording Model of Multilingual Pretrained Models for New Languages
K. Imamura
Eiichiro Sumita
VLM
6
3
0
29 Nov 2022
TSMind: Alibaba and Soochow University's Submission to the WMT22 Translation Suggestion Task
Xin Ge
Ke Min Wang
Jiayi Wang
Nini Xiao
Xiangyu Duan
Yu Zhao
Yuqi Zhang
14
2
0
16 Nov 2022
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
Ayush Maheshwari
Nikhil Singh
Amrith Krishna
Ganesh Ramakrishnan
13
12
0
15 Nov 2022
Hierarchical Phrase-based Sequence-to-Sequence Learning
Bailin Wang
Ivan Titov
Jacob Andreas
Yoon Kim
17
7
0
15 Nov 2022
Efficient Speech Translation with Pre-trained Models
Zhaolin Li
J. Niehues
11
2
0
09 Nov 2022
Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions
Shuhao Gu
Bojie Hu
Yang Feng
CLL
23
11
0
03 Nov 2022
Robust Domain Adaptation for Pre-trained Multilingual Neural Machine Translation Models
Mathieu Grosso
Pirashanth Ratnamogan
Alexis Mathey
William Vanhuffel
Michael Fotso Fotso
17
3
0
26 Oct 2022
LANS: Large-scale Arabic News Summarization Corpus
Abdulaziz Alhamadani
Xuchao Zhang
Jianfeng He
Chang-Tien Lu
16
2
0
24 Oct 2022
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
17
28
0
23 Oct 2022
Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings
Iker García-Ferrero
Rodrigo Agerri
German Rigau
61
21
0
23 Oct 2022
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
A. Khan
Hrishikesh Kanade
G. Budhrani
Preet Jhanglani
Jia Xu
71
2
0
21 Oct 2022
Retrofitting Multilingual Sentence Embeddings with Abstract Meaning Representation
Deng Cai
Xin Li
Jackie Chun-Sing Ho
Lidong Bing
W. Lam
10
6
0
18 Oct 2022
RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise
Jinming Zhao
Haomiao Yang
Gholamreza Haffari
Ehsan Shareghi
VLM
11
2
0
16 Oct 2022
COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation
Nan Wang
Qifan Wang
Yi-Chia Wang
Maziar Sanjabi
Jingzhou Liu
Hamed Firooz
Hongning Wang
Shaoliang Nie
28
6
0
14 Oct 2022
Investigating Massive Multilingual Pre-Trained Machine Translation Models for Clinical Domain via Transfer Learning
Lifeng Han
G. Erofeev
Irina Sorokina
Serge Gladkoff
Goran Nenadic
20
8
0
12 Oct 2022
Language-Family Adapters for Low-Resource Multilingual Neural Machine Translation
Alexandra Chronopoulou
Dario Stojanovski
Alexander M. Fraser
21
17
0
30 Sep 2022
Embarrassingly Easy Document-Level MT Metrics: How to Convert Any Pretrained Metric Into a Document-Level Metric
Giorgos Vernikos
Brian Thompson
Prashant Mathur
Marcello Federico
36
40
0
27 Sep 2022
Cem Mil Podcasts: A Spoken Portuguese Document Corpus For Multi-modal, Multi-lingual and Multi-Dialect Information Access Research
Ekaterina Garmash
Edgar Tanaka
Ann Clifton
Joana Correia
Sharmistha Jat
Winstead Zhu
R. Jones
Jussi Karlgren
6
3
0
23 Sep 2022
The first neural machine translation system for the Erzya language
David Dale
63
7
0
19 Sep 2022
Rethinking Round-Trip Translation for Machine Translation Evaluation
Terry Yue Zhuo
Qiongkai Xu
Xuanli He
Trevor Cohn
LRM
22
2
0
15 Sep 2022
Previous
1
2
3
4
5
Next