ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.06748
  4. Cited By
Balancing Training for Multilingual Neural Machine Translation
v1v2v3v4 (latest)

Balancing Training for Multilingual Neural Machine Translation

Annual Meeting of the Association for Computational Linguistics (ACL), 2020
14 April 2020
Xinyi Wang
Yulia Tsvetkov
Graham Neubig
ArXiv (abs)PDFHTML

Papers citing "Balancing Training for Multilingual Neural Machine Translation"

50 / 73 papers shown
Flexing in 73 Languages: A Single Small Model for Multilingual Inflection
Flexing in 73 Languages: A Single Small Model for Multilingual InflectionInternational Conference on Text, Speech and Dialogue (TSD), 2025
Tomáš Sourada
Jana Straková
148
1
0
27 Oct 2025
Using Temperature Sampling to Effectively Train Robot Learning Policies on Imbalanced Datasets
Using Temperature Sampling to Effectively Train Robot Learning Policies on Imbalanced Datasets
Basavasagar Patil
Sydney Belt
Jayjun Lee
Nima Fazeli
Bernadette Bucher
169
0
0
22 Oct 2025
DynamixSFT: Dynamic Mixture Optimization of Instruction Tuning Collections
DynamixSFT: Dynamic Mixture Optimization of Instruction Tuning Collections
Haebin Shin
Lei Ji
Xiao Liu
Zhiwei Yu
Qi Chen
Yeyun Gong
178
2
0
16 Aug 2025
LLaVA-NeuMT: Selective Layer-Neuron Modulation for Efficient Multilingual Multimodal Translation
LLaVA-NeuMT: Selective Layer-Neuron Modulation for Efficient Multilingual Multimodal Translation
Jingxuan Wei
Caijun Jia
Qi Chen
Yujun Cai
Linzhuang Sun
Xiangxiang Zhang
Gaowei Wu
Bihui Yu
205
0
0
25 Jul 2025
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
Weixuan Wang
Minghao Wu
Barry Haddow
Alexandra Birch
616
2
0
18 May 2025
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization
DRPruning: Efficient Large Language Model Pruning through Distributionally Robust OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Hexuan Deng
Wenxiang Jiao
Xuebo Liu
Min Zhang
Zhaopeng Tu
Zhaopeng Tu
VLM
652
4
0
21 Nov 2024
What is Wrong with Perplexity for Long-context Language Modeling?
What is Wrong with Perplexity for Long-context Language Modeling?International Conference on Learning Representations (ICLR), 2024
Lizhe Fang
Yifei Wang
Zhaoyang Liu
Chenheng Zhang
Stefanie Jegelka
Jinyang Gao
Bolin Ding
Yisen Wang
807
45
0
31 Oct 2024
Optimizing the Training Schedule of Multilingual NMT using Reinforcement Learning
Optimizing the Training Schedule of Multilingual NMT using Reinforcement Learning
Alexis Allemann
Àlex R. Atrio
Andrei Popescu-Belis
392
2
0
08 Oct 2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Upsample or Upweight? Balanced Training on Heavily Imbalanced DatasetsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Tianjian Li
Haoran Xu
Weiting Tan
Kenton Murray
Daniel Khashabi
653
4
0
06 Oct 2024
Can the Variation of Model Weights be used as a Criterion for Self-Paced
  Multilingual NMT?
Can the Variation of Model Weights be used as a Criterion for Self-Paced Multilingual NMT?
Àlex R. Atrio
Alexis Allemann
Ljiljana Dolamic
Andrei Popescu-Belis
392
1
0
05 Oct 2024
NLIP_Lab-IITH Low-Resource MT System for WMT24 Indic MT Shared Task
NLIP_Lab-IITH Low-Resource MT System for WMT24 Indic MT Shared TaskConference on Machine Translation (WMT), 2024
Pramit Sahoo
Maharaj Brahma
Maunendra Sankar Desarkar
162
0
0
04 Oct 2024
Can Optimization Trajectories Explain Multi-Task Transfer?
Can Optimization Trajectories Explain Multi-Task Transfer?
David Mueller
Mark Dredze
Nicholas Andrews
509
2
0
26 Aug 2024
Low-Resource Machine Translation through the Lens of Personalized
  Federated Learning
Low-Resource Machine Translation through the Lens of Personalized Federated LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Viktor Moskvoretskii
N. Tupitsa
Chris Biemann
Samuel Horváth
Eduard A. Gorbunov
Irina Nikishina
FedML
215
1
0
18 Jun 2024
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large
  Language Models
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
Minghao Wu
Thuy-Trang Vu
Zhuang Li
Gholamreza Haffari
239
12
0
13 Jun 2024
To Label or Not to Label: Hybrid Active Learning for Neural Machine
  Translation
To Label or Not to Label: Hybrid Active Learning for Neural Machine TranslationInternational Conference on Computational Linguistics (COLING), 2024
Abdul Hameed Azeemi
I. Qazi
Agha Ali Raza
AI4CE
262
9
0
14 Mar 2024
Aya Model: An Instruction Finetuned Open-Access Multilingual Language
  Model
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
Ahmet Üstün
Viraat Aryabumi
Zheng-Xin Yong
Wei-Yin Ko
Daniel D'souza
...
Shayne Longpre
Niklas Muennighoff
Marzieh Fadaee
Julia Kreutzer
Sara Hooker
ALMELMSyDaLRM
339
351
0
12 Feb 2024
Order Matters in the Presence of Dataset Imbalance for Multilingual
  Learning
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Dami Choi
Derrick Xin
Hamid Dadkhahi
Justin Gilmer
Ankush Garg
Orhan Firat
Chih-Kuan Yeh
Andrew M. Dai
Behrooz Ghorbani
398
8
0
11 Dec 2023
Error Norm Truncation: Robust Training in the Presence of Data Noise for
  Text Generation Models
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation ModelsInternational Conference on Learning Representations (ICLR), 2023
Tianjian Li
Haoran Xu
Philipp Koehn
Daniel Khashabi
Kenton W. Murray
251
6
0
02 Oct 2023
Neural Machine Translation for the Indigenous Languages of the Americas:
  An Introduction
Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction
Manuel Mager
Rajat Bhatnagar
Graham Neubig
Ngoc Thang Vu
Katharina Kann
221
15
0
11 Jun 2023
Towards Higher Pareto Frontier in Multilingual Machine Translation
Towards Higher Pareto Frontier in Multilingual Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yi-Chong Huang
Xiaocheng Feng
Xinwei Geng
Baohang Li
Bing Qin
221
17
0
25 May 2023
LIMIT: Language Identification, Misidentification, and Translation using
  Hierarchical Models in 350+ Languages
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ LanguagesConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
M. Agarwal
Md Mahfuz Ibn Alam
Antonios Anastasopoulos
349
13
0
23 May 2023
A Pretrainer's Guide to Training Data: Measuring the Effects of Data
  Age, Domain Coverage, Quality, & Toxicity
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & ToxicityNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Shayne Longpre
Gregory Yauney
Emily Reif
Katherine Lee
Adam Roberts
...
Denny Zhou
Jason W. Wei
Kevin Robinson
David M. Mimno
Daphne Ippolito
461
226
0
22 May 2023
RECKONING: Reasoning through Dynamic Knowledge Encoding
RECKONING: Reasoning through Dynamic Knowledge EncodingNeural Information Processing Systems (NeurIPS), 2023
Zeming Chen
Gail Weiss
E. Mitchell
Asli Celikyilmaz
Antoine Bosselut
KELMLRM
407
16
0
10 May 2023
Learning Language-Specific Layers for Multilingual Machine Translation
Learning Language-Specific Layers for Multilingual Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Telmo Pires
Robin M. Schmidt
Yi-Hsiu Liao
Stephan Peitz
364
22
0
04 May 2023
UniMax: Fairer and more Effective Language Sampling for Large-Scale
  Multilingual Pretraining
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual PretrainingInternational Conference on Learning Representations (ICLR), 2023
Hyung Won Chung
Noah Constant
Xavier Garcia
Adam Roberts
Yi Tay
Sharan Narang
Orhan Firat
336
127
0
18 Apr 2023
On the Pareto Front of Multilingual Neural Machine Translation
On the Pareto Front of Multilingual Neural Machine TranslationNeural Information Processing Systems (NeurIPS), 2023
Liang Chen
Shuming Ma
Dongdong Zhang
Furu Wei
Baobao Chang
MoE
408
8
0
06 Apr 2023
Towards Reliable Neural Machine Translation with Consistency-Aware
  Meta-Learning
Towards Reliable Neural Machine Translation with Consistency-Aware Meta-LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Rongxiang Weng
Qiang Wang
Wensen Cheng
Changfeng Zhu
Min Zhang
366
3
0
20 Mar 2023
Scaling Laws for Multilingual Neural Machine Translation
Scaling Laws for Multilingual Neural Machine TranslationInternational Conference on Machine Learning (ICML), 2023
Patrick Fernandes
Behrooz Ghorbani
Xavier Garcia
Markus Freitag
Orhan Firat
291
37
0
19 Feb 2023
Measuring The Impact Of Programming Language Distribution
Measuring The Impact Of Programming Language DistributionInternational Conference on Machine Learning (ICML), 2023
Gabriel Orlanski
Kefan Xiao
Xavier Garcia
Jeffrey Hui
Joshua Howland
J. Malmaud
Jacob Austin
Rishah Singh
Michele Catasta
513
49
0
03 Feb 2023
Causes and Cures for Interference in Multilingual Translation
Causes and Cures for Interference in Multilingual TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Uri Shaham
Maha Elbayad
Vedanuj Goswami
Omer Levy
Shruti Bhosale
368
32
0
14 Dec 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Domain Curricula for Code-Switched MT at MixMT 2022Conference on Machine Translation (WMT), 2022
Lekan Raheem
Maab Elrashid
193
1
0
31 Oct 2022
Forging Multiple Training Objectives for Pre-trained Language Models via
  Meta-Learning
Forging Multiple Training Objectives for Pre-trained Language Models via Meta-LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hongqiu Wu
Ruixue Ding
Haizhen Zhao
Boli Chen
Pengjun Xie
Fei Huang
Min Zhang
MoMe
318
8
0
19 Oct 2022
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale
  African Languages
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African LanguagesConference on Machine Translation (WMT), 2022
Wenxiang Jiao
Zhaopeng Tu
Jiarui Li
Wenxuan Wang
Shu Yang
Shuming Shi
300
16
0
18 Oct 2022
You Can Have Your Data and Balance It Too: Towards Balanced and
  Efficient Multilingual Models
You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models
Tomasz Limisiewicz
Daniel Malkin
Gabriel Stanovsky
219
4
0
13 Oct 2022
Language Tokens: A Frustratingly Simple Approach Improves Zero-Shot
  Performance of Multilingual Translation
Language Tokens: A Frustratingly Simple Approach Improves Zero-Shot Performance of Multilingual TranslationConference of the Association for Machine Translation in the Americas (AMTA), 2022
Muhammad N. ElNokrashy
Amr Hendy
Mohamed Maher
Mohamed Afify
Hany Awadalla
247
2
0
11 Aug 2022
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional
  MoEs
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEsNeural Information Processing Systems (NeurIPS), 2022
Jinguo Zhu
Xizhou Zhu
Wenhai Wang
Xiaohua Wang
Jiaming Song
Xiaogang Wang
Jifeng Dai
MoMeMoE
363
89
0
09 Jun 2022
Multilingual Neural Machine Translation with Deep Encoder and Multiple
  Shallow Decoders
Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow DecodersConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Xiang Kong
Adithya Renduchintala
James Cross
Yuqing Tang
Jiatao Gu
Xian Li
236
32
0
05 Jun 2022
Unifying the Convergences in Multilingual Neural Machine Translation
Unifying the Convergences in Multilingual Neural Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yi-Chong Huang
Xiaocheng Feng
Xinwei Geng
Bing Qin
276
7
0
03 May 2022
Meta Learning for Natural Language Processing: A Survey
Meta Learning for Natural Language Processing: A SurveyNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Hung-yi Lee
Shang-Wen Li
Ngoc Thang Vu
450
55
0
03 May 2022
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient
  Optimization in Few-Shot Cross-Lingual Transfer
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer
Haoran Xu
Kenton W. Murray
217
12
0
29 Apr 2022
PAEG: Phrase-level Adversarial Example Generation for Neural Machine
  Translation
PAEG: Phrase-level Adversarial Example Generation for Neural Machine TranslationInternational Conference on Computational Linguistics (COLING), 2022
Juncheng Wan
Jian Yang
Shuming Ma
Dongdong Zhang
Weinan Zhang
Yong Yu
Zhoujun Li
SILMAAML
329
5
0
06 Jan 2022
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared
  Task
Multilingual Machine Translation Systems from Microsoft for WMT21 Shared TaskConference on Machine Translation (WMT), 2021
Jian Yang
Shuming Ma
Haoyang Huang
Dongdong Zhang
Li Dong
...
Alexandre Muzio
Saksham Singhal
Hany Awadalla
Xia Song
Furu Wei
167
46
0
03 Nov 2021
Tricks for Training Sparse Translation Models
Tricks for Training Sparse Translation Models
Dheeru Dua
Shruti Bhosale
Vedanuj Goswami
James Cross
M. Lewis
Angela Fan
MoE
360
23
0
15 Oct 2021
Multilingual Neural Machine Translation:Can Linguistic Hierarchies Help?
Multilingual Neural Machine Translation:Can Linguistic Hierarchies Help?
Fahimeh Saleh
Wray Buntine
Gholamreza Haffari
Lan Du
220
8
0
15 Oct 2021
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation
  with Multi-Armed Bandits
Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits
Julia Kreutzer
David Vilar
Artem Sokolov
267
18
0
13 Oct 2021
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual
  Learning
Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning
Seanie Lee
Haebeom Lee
Juho Lee
Sung Ju Hwang
MoMeCLL
446
19
0
06 Oct 2021
Improving Multilingual Translation by Representation and Gradient
  Regularization
Improving Multilingual Translation by Representation and Gradient RegularizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yilin Yang
Akiko Eriguchi
Alexandre Muzio
Prasad Tadepalli
Stefan Lee
Hany Hassan
240
42
0
10 Sep 2021
Distributionally Robust Multilingual Machine Translation
Distributionally Robust Multilingual Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Chunting Zhou
Daniel Levy
Xian Li
Marjan Ghazvininejad
Graham Neubig
261
27
0
09 Sep 2021
Competence-based Curriculum Learning for Multilingual Machine
  Translation
Competence-based Curriculum Learning for Multilingual Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Mingliang Zhang
Fandong Meng
Y. Tong
Jie Zhou
251
18
0
09 Sep 2021
Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural
  Machine Translation Training
Uncertainty-Aware Balancing for Multilingual and Multi-Domain Neural Machine Translation TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Minghao Wu
Yitong Li
Meng Zhang
Liangyou Li
Gholamreza Haffari
Qun Liu
236
27
0
06 Sep 2021
12
Next
Page 1 of 2