ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.06823
  4. Cited By
Incorporating BERT into Neural Machine Translation

Incorporating BERT into Neural Machine Translation

International Conference on Learning Representations (ICLR), 2020
17 February 2020
Jinhua Zhu
Ziheng Lu
Lijun Wu
Di He
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
    FedMLAIMat
ArXiv (abs)PDFHTMLGithub (362★)

Papers citing "Incorporating BERT into Neural Machine Translation"

50 / 182 papers shown
Improving Non-autoregressive Generation with Mixup Training
Improving Non-autoregressive Generation with Mixup Training
Ting Jiang
Shaohan Huang
Zihan Zhang
Deqing Wang
Fuzhen Zhuang
Furu Wei
Haizhen Huang
Liangjie Zhang
Tao Gui
112
9
0
21 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A
  Review
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
210
53
0
20 Oct 2021
Neural Medication Extraction: A Comparison of Recent Models in
  Supervised and Semi-supervised Learning Settings
Neural Medication Extraction: A Comparison of Recent Models in Supervised and Semi-supervised Learning Settings
A. Kocabiyikoglu
François Portet
Raheel Qader
Jean-Marc Babouchkine
MedIm
161
5
0
19 Oct 2021
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain
  Language Model Compression
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression
Chenhe Dong
Yaliang Li
Ying Shen
Minghui Qiu
VLM
308
8
0
16 Oct 2021
MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better
  Translators
MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators
Zhixing Tan
Xiangwen Zhang
Shuo Wang
Yang Liu
VLMLRM
510
58
0
13 Oct 2021
On the Complementarity between Pre-Training and Back-Translation for
  Neural Machine Translation
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation
Xuebo Liu
Longyue Wang
Yang Li
Liang Ding
Lidia S. Chao
Shuming Shi
Zhaopeng Tu
146
29
0
05 Oct 2021
Discovering Drug-Target Interaction Knowledge from Biomedical Literature
Discovering Drug-Target Interaction Knowledge from Biomedical Literature
Yutai Hou
Ziheng Lu
Lijun Wu
Shufang Xie
Yang Fan
Jinhua Zhu
Wanxiang Che
Tao Qin
Tie-Yan Liu
180
16
0
27 Sep 2021
Everything Is All It Takes: A Multipronged Strategy for Zero-Shot
  Cross-Lingual Information Extraction
Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
M. Yarmohammadi
Shijie Wu
Marc Marone
Haoran Xu
Seth Ebner
...
Craig Harman
Kenton W. Murray
Aaron Steven White
Mark Dredze
Benjamin Van Durme
166
29
0
14 Sep 2021
Multilingual Translation via Grafting Pre-trained Language Models
Multilingual Translation via Grafting Pre-trained Language Models
Zewei Sun
Mingxuan Wang
Lei Li
AI4CE
421
22
0
11 Sep 2021
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural
  Machine Translation
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Haoran Xu
Benjamin Van Durme
Kenton W. Murray
258
73
0
09 Sep 2021
Paraphrase Generation as Unsupervised Machine Translation
Paraphrase Generation as Unsupervised Machine TranslationInternational Conference on Computational Linguistics (COLING), 2021
Xiaofei Sun
Yufei Tian
Yuxian Meng
Nanyun Peng
Leilei Gan
Jiwei Li
Chun Fan
LRM
153
7
0
07 Sep 2021
On the Copying Behaviors of Pre-Training for Neural Machine Translation
On the Copying Behaviors of Pre-Training for Neural Machine TranslationFindings (Findings), 2021
Xuebo Liu
Longyue Wang
Yang Li
Liang Ding
Lidia S. Chao
Shuming Shi
Zhaopeng Tu
146
25
0
17 Jul 2021
Self Training with Ensemble of Teacher Models
Self Training with Ensemble of Teacher Models
Soumyadeep Ghosh
Sanjay Kumar
Janu Verma
Awanish Kumar
120
3
0
17 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Noise Stability Regularization for Improving BERT Fine-tuningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
187
46
0
10 Jul 2021
A Primer on Pretrained Multilingual Language Models
A Primer on Pretrained Multilingual Language Models
Sumanth Doddapaneni
Gowtham Ramesh
Mitesh M. Khapra
Anoop Kunchukuttan
Pratyush Kumar
LRM
216
86
0
01 Jul 2021
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin
  Information
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin InformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Zijun Sun
Xiaoya Li
Xiaofei Sun
Yuxian Meng
Xiang Ao
Qing He
Leilei Gan
Jiwei Li
SSeg
258
208
0
30 Jun 2021
Neural Machine Translation for Low-Resource Languages: A Survey
Neural Machine Translation for Low-Resource Languages: A SurveyACM Computing Surveys (CSUR), 2021
Surangika Ranathunga
E. Lee
Marjana Prifti Skenduli
Ravi Shekhar
Mehreen Alam
Rishemjit Kaur
320
322
0
29 Jun 2021
R-Drop: Regularized Dropout for Neural Networks
R-Drop: Regularized Dropout for Neural NetworksNeural Information Processing Systems (NeurIPS), 2021
Xiaobo Liang
Lijun Wu
Juntao Li
Yue Wang
Qi Meng
Tao Qin
Wei Chen
Hao Fei
Tie-Yan Liu
298
512
0
28 Jun 2021
Dual-view Molecule Pre-training
Dual-view Molecule Pre-training
Jinhua Zhu
Ziheng Lu
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
AI4CE
246
55
0
17 Jun 2021
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language
  Generation
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xin Liu
Baosong Yang
Dayiheng Liu
Haibo Zhang
Weihua Luo
Min Zhang
Haiying Zhang
Jinsong Su
156
19
0
11 Jun 2021
AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT
AUGVIC: Exploiting BiText Vicinity for Low-Resource NMTFindings (Findings), 2021
Tasnim Mohiuddin
M Saiful Bari
Shafiq Joty
189
8
0
09 Jun 2021
Self-supervised and Supervised Joint Training for Resource-rich Machine
  Translation
Self-supervised and Supervised Joint Training for Resource-rich Machine TranslationInternational Conference on Machine Learning (ICML), 2021
Yong Cheng
Wei Wang
Lu Jiang
Wolfgang Macherey
172
18
0
08 Jun 2021
Diverse Pretrained Context Encodings Improve Document Translation
Diverse Pretrained Context Encodings Improve Document TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Domenic Donato
Lei Yu
Chris Dyer
129
16
0
07 Jun 2021
BERTGEN: Multi-task Generation through BERT
BERTGEN: Multi-task Generation through BERTAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Faidon Mitzalis
Ozan Caglayan
Pranava Madhyastha
Lucia Specia
VLM
111
7
0
07 Jun 2021
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
BERTTune: Fine-Tuning Neural Machine Translation with BERTScoreAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Inigo Jauregi Unanue
Jacob Parnell
Massimo Piccardi
116
35
0
04 Jun 2021
Transfer Learning for Sequence Generation: from Single-source to
  Multi-source
Transfer Learning for Sequence Generation: from Single-source to Multi-sourceAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xuancheng Huang
Jingfang Xu
Maosong Sun
Yang Liu
123
6
0
31 May 2021
On Compositional Generalization of Neural Machine Translation
On Compositional Generalization of Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Yafu Li
Yongjing Yin
Yulong Chen
Yue Zhang
351
52
0
31 May 2021
Fast Nearest Neighbor Machine Translation
Fast Nearest Neighbor Machine TranslationFindings (Findings), 2021
Yuxian Meng
Xiaoya Li
Xiayu Zheng
Leilei Gan
Xiaofei Sun
Tianwei Zhang
Jiwei Li
LRM
287
52
0
30 May 2021
Good for Misconceived Reasons: An Empirical Revisiting on the Need for
  Visual Context in Multimodal Machine Translation
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Zhiyong Wu
Lingpeng Kong
W. Bi
Xiang Li
B. Kao
LRM
134
97
0
30 May 2021
Verb Sense Clustering using Contextualized Word Representations for
  Semantic Frame Induction
Verb Sense Clustering using Contextualized Word Representations for Semantic Frame InductionFindings (Findings), 2021
Kosuke Yamada
Ryohei Sasano
Koichi Takeda
100
9
0
27 May 2021
Prevent the Language Model from being Overconfident in Neural Machine
  Translation
Prevent the Language Model from being Overconfident in Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Mengqi Miao
Fandong Meng
Yijin Liu
Xiao-Hua Zhou
Jie Zhou
271
45
0
24 May 2021
MathBERT: A Pre-Trained Model for Mathematical Formula Understanding
MathBERT: A Pre-Trained Model for Mathematical Formula Understanding
Shuai Peng
Ke Yuan
Liangcai Gao
Zhi Tang
AIMat
219
117
0
02 May 2021
Mitigating Political Bias in Language Models Through Reinforced
  Calibration
Mitigating Political Bias in Language Models Through Reinforced CalibrationAAAI Conference on Artificial Intelligence (AAAI), 2021
Ruibo Liu
Chenyan Jia
Jason W. Wei
Guangxuan Xu
Lili Wang
Soroush Vosoughi
175
109
0
30 Apr 2021
MOROCCO: Model Resource Comparison Framework
MOROCCO: Model Resource Comparison Framework
Valentin Malykh
Alexander Kukushkin
Ekaterina Artemova
Vladislav Mikhailov
Maria Tikhonova
Tatiana Shavrina
147
0
0
29 Apr 2021
Zero-shot Cross-lingual Transfer of Neural Machine Translation with
  Multilingual Pretrained Encoders
Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained EncodersConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Guanhua Chen
Shuming Ma
Yun-Nung Chen
Li Dong
Dongdong Zhang
Jianxiong Pan
Wenping Wang
Furu Wei
149
41
0
18 Apr 2021
TransVG: End-to-End Visual Grounding with Transformers
TransVG: End-to-End Visual Grounding with TransformersIEEE International Conference on Computer Vision (ICCV), 2021
Jiajun Deng
Zhengyuan Yang
Tianlang Chen
Wen-gang Zhou
Houqiang Li
ViT
612
442
0
17 Apr 2021
Context-Adaptive Document-Level Neural Machine Translation
Context-Adaptive Document-Level Neural Machine TranslationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Linlin Zhang
154
1
0
16 Apr 2021
Reward Optimization for Neural Machine Translation with Learned Metrics
Reward Optimization for Neural Machine Translation with Learned Metrics
Raphael Shu
Kang Min Yoo
Jung-Woo Ha
209
14
0
15 Apr 2021
On the Inductive Bias of Masked Language Modeling: From Statistical to
  Syntactic Dependencies
On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic DependenciesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Tianyi Zhang
Tatsunori Hashimoto
AI4CE
211
30
0
12 Apr 2021
UniDrop: A Simple yet Effective Technique to Improve Transformer without
  Extra Cost
UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra CostNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Zhen Wu
Lijun Wu
Qi Meng
Ziheng Lu
Shufang Xie
Tao Qin
Xinyu Dai
Tie-Yan Liu
201
25
0
11 Apr 2021
Better Neural Machine Translation by Extracting Linguistic Information
  from BERT
Better Neural Machine Translation by Extracting Linguistic Information from BERTConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Hassan S. Shavarani
Anoop Sarkar
193
17
0
07 Apr 2021
ODE Transformer: An Ordinary Differential Equation-Inspired Model for
  Neural Machine Translation
ODE Transformer: An Ordinary Differential Equation-Inspired Model for Neural Machine Translation
Bei Li
Quan Du
Tao Zhou
Shuhan Zhou
Xin Zeng
Tong Xiao
Jingbo Zhu
188
23
0
06 Apr 2021
Neural Inverse Text Normalization
Neural Inverse Text NormalizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Monica Sunkara
Chaitanya P. Shivade
S. Bodapati
Katrin Kirchhoff
205
33
0
12 Feb 2021
Speech Recognition by Simply Fine-tuning BERT
Speech Recognition by Simply Fine-tuning BERTIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Wen-Chin Huang
Chia-Hua Wu
Shang-Bao Luo
Kuan-Yu Chen
Hsin-Min Wang
Tomoki Toda
255
32
0
30 Jan 2021
Natural Language Specification of Reinforcement Learning Policies
  through Differentiable Decision Trees
Natural Language Specification of Reinforcement Learning Policies through Differentiable Decision TreesIEEE Robotics and Automation Letters (RA-L), 2021
Pradyumna Tambwekar
Andrew Silva
N. Gopalan
Matthew C. Gombolay
244
10
0
18 Jan 2021
To Understand Representation of Layer-aware Sequence Encoders as
  Multi-order-graph
To Understand Representation of Layer-aware Sequence Encoders as Multi-order-graph
Sufeng Duan
Hai Zhao
MILM
309
0
0
16 Jan 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Prefix-Tuning: Optimizing Continuous Prompts for GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xiang Lisa Li
Abigail Z. Jacobs
654
5,179
0
01 Jan 2021
Neural Machine Translation: A Review of Methods, Resources, and Tools
Neural Machine Translation: A Review of Methods, Resources, and ToolsAI Open (AO), 2020
Zhixing Tan
Shuo Wang
Zonghan Yang
Gang Chen
Xuancheng Huang
Maosong Sun
Yang Liu
3DVAI4TS
252
123
0
31 Dec 2020
Dynamic Curriculum Learning for Low-Resource Neural Machine Translation
Dynamic Curriculum Learning for Low-Resource Neural Machine TranslationInternational Conference on Computational Linguistics (COLING), 2020
Chen Xu
Bojie Hu
Yufan Jiang
Kai Feng
Zeyang Wang
Shen Huang
Qi Ju
Tong Xiao
Jingbo Zhu
257
23
0
30 Nov 2020
Empowering Things with Intelligence: A Survey of the Progress,
  Challenges, and Opportunities in Artificial Intelligence of Things
Empowering Things with Intelligence: A Survey of the Progress, Challenges, and Opportunities in Artificial Intelligence of ThingsIEEE Internet of Things Journal (IEEE IoT J.), 2020
Jing Zhang
Dacheng Tao
220
561
0
17 Nov 2020
Previous
1234
Next