ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.00401
  4. Cited By
Multilingual Translation with Extensible Multilingual Pretraining and
  Finetuning

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning

2 August 2020
Y. Tang
C. Tran
Xian Li
Peng-Jen Chen
Naman Goyal
Vishrav Chaudhary
Jiatao Gu
Angela Fan
    CLL
ArXivPDFHTML

Papers citing "Multilingual Translation with Extensible Multilingual Pretraining and Finetuning"

50 / 232 papers shown
Title
DUB: Discrete Unit Back-translation for Speech Translation
DUB: Discrete Unit Back-translation for Speech Translation
Dong Zhang
Rong Ye
Tom Ko
Mingxuan Wang
Yaqian Zhou
11
23
0
19 May 2023
Accelerating Transformer Inference for Translation via Parallel Decoding
Accelerating Transformer Inference for Translation via Parallel Decoding
Andrea Santilli
Silvio Severino
Emilian Postolache
Valentino Maiorca
Michele Mancusi
R. Marin
Emanuele Rodolà
14
77
0
17 May 2023
Language Model Tokenizers Introduce Unfairness Between Languages
Language Model Tokenizers Introduce Unfairness Between Languages
Aleksandar Petrov
Emanuele La Malfa
Philip H. S. Torr
Adel Bibi
16
96
0
17 May 2023
RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training
RC3: Regularized Contrastive Cross-lingual Cross-modal Pre-training
Chulun Zhou
Yunlong Liang
Fandong Meng
Jinan Xu
Jinsong Su
Jie Zhou
VLM
16
4
0
13 May 2023
Perturbation-based QE: An Explainable, Unsupervised Word-level Quality
  Estimation Method for Blackbox Machine Translation
Perturbation-based QE: An Explainable, Unsupervised Word-level Quality Estimation Method for Blackbox Machine Translation
Tu Anh Dinh
J. Niehues
18
5
0
12 May 2023
SemEval-2023 Task 2: Fine-grained Multilingual Named Entity Recognition
  (MultiCoNER 2)
SemEval-2023 Task 2: Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2)
B. Fetahu
Sudipta Kar
Zhiyu Zoey Chen
Oleg Rokhlenko
S. Malmasi
22
56
0
11 May 2023
Train Global, Tailor Local: Minimalist Multilingual Translation into
  Endangered Languages
Train Global, Tailor Local: Minimalist Multilingual Translation into Endangered Languages
Zhong Zhou
J. Niehues
Alexander Waibel
22
0
0
05 May 2023
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model
GreekBART: The First Pretrained Greek Sequence-to-Sequence Model
Iakovos Evdaimon
Hadi Abdine
Christos Xypolopoulos
Stamatis Outsios
Michalis Vazirgiannis
Giorgos Stamou
VLM
23
7
0
03 Apr 2023
Indian Language Summarization using Pretrained Sequence-to-Sequence
  Models
Indian Language Summarization using Pretrained Sequence-to-Sequence Models
Ashok Urlana
S. Bhatt
Nirmal Surange
Manish Shrivastava
46
13
0
25 Mar 2023
PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning
Yongil Kim
Yerin Hwang
Hyeongu Yun
Seunghyun Yoon
Trung Bui
Kyomin Jung
17
6
0
15 Mar 2023
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and
  Multilingual Natural Language Generation
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
Bang-ju Yang
Fenglin Liu
Yuexian Zou
Xian Wu
Yaowei Wang
David A. Clifton
23
8
0
11 Mar 2023
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched
  Summarization
CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization
Ruochen Zhang
Carsten Eickhoff
40
5
0
07 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu
Tianlong Chen
Zhenyu (Allen) Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zhangyang Wang
19
29
0
03 Mar 2023
A Persian Benchmark for Joint Intent Detection and Slot Filling
A Persian Benchmark for Joint Intent Detection and Slot Filling
M. Akbari
Amir-Hossein Karimi
Tayyebeh Saeedi
Zeinab Saeidi
Kiana Ghezelbash
Fatemeh Shamsezat
Mohammad Akbari
Ali Mohades
15
3
0
01 Mar 2023
Evaluating and Improving the Coreference Capabilities of Machine
  Translation Models
Evaluating and Improving the Coreference Capabilities of Machine Translation Models
Asaf Yehudai
Arie Cattan
Omri Abend
Gabriel Stanovsky
LRM
ELM
22
2
0
16 Feb 2023
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input
  Noises
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises
Chenglei Si
Zhengyan Zhang
Yingfa Chen
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
AAML
19
1
0
14 Feb 2023
Training-free Lexical Backdoor Attacks on Language Models
Training-free Lexical Backdoor Attacks on Language Models
Yujin Huang
Terry Yue Zhuo
Qiongkai Xu
Han Hu
Xingliang Yuan
Chunyang Chen
SILM
17
42
0
08 Feb 2023
TransFool: An Adversarial Attack against Neural Machine Translation
  Models
TransFool: An Adversarial Attack against Neural Machine Translation Models
Sahar Sadrizadeh
Ljiljana Dolamic
P. Frossard
SILM
AAML
27
12
0
02 Feb 2023
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and
  Layout-Aware Summarization
LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization
Laura Nguyen
Thomas Scialom
Benjamin Piwowarski
Jacopo Staiano
19
7
0
26 Jan 2023
Cross-lingual Argument Mining in the Medical Domain
Cross-lingual Argument Mining in the Medical Domain
Anar Yeginbergenova
Rodrigo Agerri
42
7
0
25 Jan 2023
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
SWING: Balancing Coverage and Faithfulness for Dialogue Summarization
Kung-Hsiang Huang
Siffi Singh
Xiaofei Ma
Wei Xiao
Wei Xiao
Nicholas Dingwall
William Yang Wang
Kathleen McKeown
HILM
21
13
0
25 Jan 2023
Adapting Multilingual Speech Representation Model for a New,
  Underresourced Language through Multilingual Fine-tuning and Continued
  Pretraining
Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining
Karol Nowakowski
M. Ptaszynski
Kyoko Murasaki
Jagna Nieuwazny
15
23
0
18 Jan 2023
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records
Gyubok Lee
Hyeonji Hwang
Seongsu Bae
Yeonsu Kwon
W. Shin
Seongjun Yang
Minjoon Seo
Jong-Yeup Kim
E. Choi
13
18
0
16 Jan 2023
Extrinsic Evaluation of Machine Translation Metrics
Extrinsic Evaluation of Machine Translation Metrics
Nikita Moghe
Tom Sherborne
Mark Steedman
Alexandra Birch
ELM
18
17
0
20 Dec 2022
Memory-efficient NLLB-200: Language-specific Expert Pruning of a
  Massively Multilingual Machine Translation Model
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model
Yeskendir Koishekenov
Alexandre Berard
Vassilina Nikoulina
MoE
22
28
0
19 Dec 2022
SegAugment: Maximizing the Utility of Speech Translation Data with
  Segmentation-based Augmentations
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
Ioannis Tsiamas
José A. R. Fonollosa
Marta R. Costa-jussá
31
6
0
19 Dec 2022
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
H. Inaguma
Sravya Popuri
Ilia Kulikov
Peng-Jen Chen
Changhan Wang
Yu-An Chung
Yun Tang
Ann Lee
Shinji Watanabe
J. Pino
43
51
0
15 Dec 2022
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual
  Machine Translation
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation
Maha Elbayad
Anna Y. Sun
Shruti Bhosale
MoE
38
8
0
15 Dec 2022
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding
Jianhao Yan
Jin Xu
Fandong Meng
Jie Zhou
Yue Zhang
16
3
0
08 Dec 2022
Findings of the WMT 2022 Shared Task on Translation Suggestion
Findings of the WMT 2022 Shared Task on Translation Suggestion
Zhen Yang
Fandong Meng
Yingxue Zhang
Ernan Li
Jie Zhou
LRM
22
2
0
30 Nov 2022
Extending the Subwording Model of Multilingual Pretrained Models for New
  Languages
Extending the Subwording Model of Multilingual Pretrained Models for New Languages
K. Imamura
Eiichiro Sumita
VLM
6
3
0
29 Nov 2022
TSMind: Alibaba and Soochow University's Submission to the WMT22
  Translation Suggestion Task
TSMind: Alibaba and Soochow University's Submission to the WMT22 Translation Suggestion Task
Xin Ge
Ke Min Wang
Jiayi Wang
Nini Xiao
Xiangyu Duan
Yu Zhao
Yuqi Zhang
14
2
0
16 Nov 2022
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
A Benchmark and Dataset for Post-OCR text correction in Sanskrit
Ayush Maheshwari
Nikhil Singh
Amrith Krishna
Ganesh Ramakrishnan
13
12
0
15 Nov 2022
Hierarchical Phrase-based Sequence-to-Sequence Learning
Hierarchical Phrase-based Sequence-to-Sequence Learning
Bailin Wang
Ivan Titov
Jacob Andreas
Yoon Kim
17
7
0
15 Nov 2022
Efficient Speech Translation with Pre-trained Models
Efficient Speech Translation with Pre-trained Models
Zhaolin Li
J. Niehues
11
2
0
09 Nov 2022
Continual Learning of Neural Machine Translation within Low Forgetting
  Risk Regions
Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions
Shuhao Gu
Bojie Hu
Yang Feng
CLL
23
11
0
03 Nov 2022
Robust Domain Adaptation for Pre-trained Multilingual Neural Machine
  Translation Models
Robust Domain Adaptation for Pre-trained Multilingual Neural Machine Translation Models
Mathieu Grosso
Pirashanth Ratnamogan
Alexis Mathey
William Vanhuffel
Michael Fotso Fotso
17
3
0
26 Oct 2022
LANS: Large-scale Arabic News Summarization Corpus
LANS: Large-scale Arabic News Summarization Corpus
Abdulaziz Alhamadani
Xuchao Zhang
Jianfeng He
Chang-Tien Lu
16
2
0
24 Oct 2022
RuCoLA: Russian Corpus of Linguistic Acceptability
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
17
28
0
23 Oct 2022
Model and Data Transfer for Cross-Lingual Sequence Labelling in
  Zero-Resource Settings
Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings
Iker García-Ferrero
Rodrigo Agerri
German Rigau
61
21
0
23 Oct 2022
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
SIT at MixMT 2022: Fluent Translation Built on Giant Pre-trained Models
A. Khan
Hrishikesh Kanade
G. Budhrani
Preet Jhanglani
Jia Xu
71
2
0
21 Oct 2022
Retrofitting Multilingual Sentence Embeddings with Abstract Meaning
  Representation
Retrofitting Multilingual Sentence Embeddings with Abstract Meaning Representation
Deng Cai
Xin Li
Jackie Chun-Sing Ho
Lidong Bing
W. Lam
10
6
0
18 Oct 2022
RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech
  Translation without Quality Compromise
RedApt: An Adaptor for wav2vec 2 Encoding \\ Faster and Smaller Speech Translation without Quality Compromise
Jinming Zhao
Haomiao Yang
Gholamreza Haffari
Ehsan Shareghi
VLM
11
2
0
16 Oct 2022
COFFEE: Counterfactual Fairness for Personalized Text Generation in
  Explainable Recommendation
COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation
Nan Wang
Qifan Wang
Yi-Chia Wang
Maziar Sanjabi
Jingzhou Liu
Hamed Firooz
Hongning Wang
Shaoliang Nie
28
6
0
14 Oct 2022
Investigating Massive Multilingual Pre-Trained Machine Translation
  Models for Clinical Domain via Transfer Learning
Investigating Massive Multilingual Pre-Trained Machine Translation Models for Clinical Domain via Transfer Learning
Lifeng Han
G. Erofeev
Irina Sorokina
Serge Gladkoff
Goran Nenadic
20
8
0
12 Oct 2022
Language-Family Adapters for Low-Resource Multilingual Neural Machine
  Translation
Language-Family Adapters for Low-Resource Multilingual Neural Machine Translation
Alexandra Chronopoulou
Dario Stojanovski
Alexander M. Fraser
21
17
0
30 Sep 2022
Embarrassingly Easy Document-Level MT Metrics: How to Convert Any
  Pretrained Metric Into a Document-Level Metric
Embarrassingly Easy Document-Level MT Metrics: How to Convert Any Pretrained Metric Into a Document-Level Metric
Giorgos Vernikos
Brian Thompson
Prashant Mathur
Marcello Federico
36
40
0
27 Sep 2022
Cem Mil Podcasts: A Spoken Portuguese Document Corpus For Multi-modal,
  Multi-lingual and Multi-Dialect Information Access Research
Cem Mil Podcasts: A Spoken Portuguese Document Corpus For Multi-modal, Multi-lingual and Multi-Dialect Information Access Research
Ekaterina Garmash
Edgar Tanaka
Ann Clifton
Joana Correia
Sharmistha Jat
Winstead Zhu
R. Jones
Jussi Karlgren
6
3
0
23 Sep 2022
The first neural machine translation system for the Erzya language
The first neural machine translation system for the Erzya language
David Dale
63
7
0
19 Sep 2022
Rethinking Round-Trip Translation for Machine Translation Evaluation
Rethinking Round-Trip Translation for Machine Translation Evaluation
Terry Yue Zhuo
Qiongkai Xu
Xuanli He
Trevor Cohn
LRM
22
2
0
15 Sep 2022
Previous
12345
Next