ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.06823
  4. Cited By
Incorporating BERT into Neural Machine Translation

Incorporating BERT into Neural Machine Translation

International Conference on Learning Representations (ICLR), 2020
17 February 2020
Jinhua Zhu
Ziheng Lu
Lijun Wu
Di He
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
    FedMLAIMat
ArXiv (abs)PDFHTMLGithub (362★)

Papers citing "Incorporating BERT into Neural Machine Translation"

50 / 182 papers shown
RE$^2$: Improving Chinese Grammatical Error Correction via Retrieving Appropriate Examples with Explanation
RE2^22: Improving Chinese Grammatical Error Correction via Retrieving Appropriate Examples with Explanation
Baoxin Wang
Yumeng Luo
Yixuan Wang
Dayong Wu
Wanxiang Che
Shijin Wang
LRM
88
0
0
30 Sep 2025
A comparison of pipelines for the translation of a low resource language based on transformers
A comparison of pipelines for the translation of a low resource language based on transformers
Chiara Bonfanti
Michele Colombino
Giulia Coucourde
Faeze Memari
Stefano Pinardi
Rosa Meo
94
0
0
15 Sep 2025
GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation
GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation
Jiafeng Xiong
Yuting Zhao
237
0
0
24 Jul 2025
PDFMathTranslate: Scientific Document Translation Preserving Layouts
PDFMathTranslate: Scientific Document Translation Preserving Layouts
Rongxin Ouyang
Chang Chu
Zhikuang Xin
Xiangyao Ma
189
0
0
02 Jul 2025
Self-supervised Latent Space Optimization with Nebula Variational Coding
Self-supervised Latent Space Optimization with Nebula Variational CodingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yida Wang
D. Tan
Nassir Navab
Federico Tombari
DRLSSL
333
1
0
02 Jun 2025
Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model
Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model
Phan Tran Minh Dat
Vo Hoang Nhat Khang
Quan Thanh Tho
202
0
0
16 May 2025
Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation
Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation
Biao Zhang
Fedor Moiseev
Joshua Ainslie
Paul Suganthan
Min Ma
Surya Bhupatiraju
Fede Lebron
Orhan Firat
Armand Joulin
Zhe Dong
AI4CE
198
11
0
08 Apr 2025
Exploring Various Sequential Learning Methods for Deformation History Modeling
Exploring Various Sequential Learning Methods for Deformation History ModelingInternational Conference on Engineering Applications of Neural Networks (ICEANN), 2025
Muhammed Adil Yatkin
Mihkel Korgesaar
Jani Romanoff
Umit Islak
Hasan Kurban
AI4TS
168
0
0
04 Apr 2025
A Framework for Lightweight Responsible Prompting Recommendation
A Framework for Lightweight Responsible Prompting Recommendation
Tiago Machado
Sara E. Berger
Cassia Sanctos
Vagner Figueiredo de Santana
Lemara Williams
Zhaoqing Wu
144
0
0
29 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse UpcyclingInternational Conference on Learning Representations (ICLR), 2025
R. Teo
T. Nguyen
MoE
372
5
0
14 Mar 2025
LM2: Large Memory Models
LM2: Large Memory Models
Jikun Kang
Wenqi Wu
Filippos Christianos
Alex J. Chan
Fraser Greenlee
George Thomas
Marvin Purtorab
Andy Toulis
KELM
316
7
0
09 Feb 2025
Self-Evolution Knowledge Distillation for LLM-based Machine Translation
Self-Evolution Knowledge Distillation for LLM-based Machine TranslationInternational Conference on Computational Linguistics (COLING), 2024
Yuncheng Song
Liang Ding
Changtong Zan
Shujian Huang
397
0
0
19 Dec 2024
Get Large Language Models Ready to Speak: A Late-fusion Approach for
  Speech Generation
Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation
Maohao Shen
Shun Zhang
Jilong Wu
Zhiping Xiu
Ehab AlBadawy
Yiting Lu
M. Seltzer
Qing He
205
6
0
27 Oct 2024
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
MCUBERT: Memory-Efficient BERT Inference on Commodity MicrocontrollersInternational Conference on Computer Aided Design (ICCAD), 2024
Zebin Yang
Renze Chen
Taiqiang Wu
Ngai Wong
Yun Liang
Runsheng Wang
R. Huang
Meng Li
MQ
251
2
0
23 Oct 2024
A Data Selection Approach for Enhancing Low Resource Machine Translation
  Using Cross-Lingual Sentence Representations
A Data Selection Approach for Enhancing Low Resource Machine Translation Using Cross-Lingual Sentence Representations
Nidhi Kowtal
Tejas Deshpande
Raviraj Joshi
191
3
0
04 Sep 2024
Hybrid Dynamic Pruning: A Pathway to Efficient Transformer Inference
Hybrid Dynamic Pruning: A Pathway to Efficient Transformer Inference
Ghadeer Jaradat
M. Tolba
Ghada Alsuhli
Hani Saleh
Mahmoud Al-Qutayri
Thanos Stouraitis
Baker Mohammad
134
1
0
17 Jul 2024
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge
  Distillation: A Case Study
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study
Aniruddha Roy
Pretam Ray
Ayush Maheshwari
Sudeshna Sarkar
Pawan Goyal
248
2
0
09 Jul 2024
Uncertainty-Guided Likelihood Tree Search
Uncertainty-Guided Likelihood Tree Search
Julia Grosse
Ruotian Wu
Ahmad Rashid
Cheng Zhang
Philipp Hennig
Pascal Poupart
Agustinus Kristiadi
376
3
0
04 Jul 2024
WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather
  Representations from Small Datasets
WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets
Adib Hasan
Mardavij Roozbehani
M. Dahleh
AI4TS
163
1
0
22 May 2024
A Combination of BERT and Transformer for Vietnamese Spelling Correction
A Combination of BERT and Transformer for Vietnamese Spelling CorrectionAsian Conference on Intelligent Information and Database Systems (ACIIDS), 2024
Trung Hieu Ngo
Ham Duong Tran
Tin Huynh
Kiem Hoang
179
6
0
04 May 2024
Efficient infusion of self-supervised representations in Automatic
  Speech Recognition
Efficient infusion of self-supervised representations in Automatic Speech Recognition
Darshan Prabhu
Sai Ganesh Mirishkar
Pankaj Wasnik
93
0
0
19 Apr 2024
Select and Reorder: A Novel Approach for Neural Sign Language Production
Select and Reorder: A Novel Approach for Neural Sign Language Production
Harry Walsh
Ben Saunders
Richard Bowden
SLR
181
3
0
17 Apr 2024
HDLdebugger: Streamlining HDL debugging with Large Language Models
HDLdebugger: Streamlining HDL debugging with Large Language ModelsACM Transactions on Design Automation of Electronic Systems (TODAES), 2024
Xufeng Yao
Haoyang Li
T. H. Chan
Wenyi Xiao
Mingxuan Yuan
Yu Huang
Lei Chen
Bei Yu
153
39
0
18 Mar 2024
Thermometer: Towards Universal Calibration for Large Language Models
Thermometer: Towards Universal Calibration for Large Language Models
Maohao Shen
Subhro Das
Kristjan Greenewald
P. Sattigeri
Greg Wornell
Soumya Ghosh
298
23
0
20 Feb 2024
Direct Large Language Model Alignment Through Self-Rewarding Contrastive
  Prompt Distillation
Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Aiwei Liu
Haoping Bai
Zhiyun Lu
Xiang Kong
Simon Wang
Jiulong Shan
Mengsi Cao
Lijie Wen
ALM
111
22
0
19 Feb 2024
LLMs for Robotic Object Disambiguation
LLMs for Robotic Object Disambiguation
Connie Jiang
Yiqing Xu
David Hsu
LM&Ro
180
2
0
07 Jan 2024
Enhancing Context Through Contrast
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
229
0
0
06 Jan 2024
A Survey of Text Watermarking in the Era of Large Language Models
A Survey of Text Watermarking in the Era of Large Language ModelsACM Computing Surveys (ACM Comput. Surv.), 2023
Aiwei Liu
Leyi Pan
Yijian Lu
Jingjing Li
Xuming Hu
Xi Zhang
Lijie Wen
Irwin King
Hui Xiong
Philip S. Yu
WaLM
319
120
0
13 Dec 2023
Conditional Prompt Tuning for Multimodal Fusion
Conditional Prompt Tuning for Multimodal Fusion
Ruixia Jiang
Lingbo Liu
Changwen Chen
178
0
0
28 Nov 2023
Interpreting Pretrained Language Models via Concept Bottlenecks
Interpreting Pretrained Language Models via Concept Bottlenecks
Zhen Tan
Lu Cheng
Song Wang
Yuan Bo
Wenlin Yao
Huan Liu
LRM
222
33
0
08 Nov 2023
Integrating Pre-trained Language Model into Neural Machine Translation
Integrating Pre-trained Language Model into Neural Machine Translation
Soon-Jae Hwang
Chang-Sung Jeong
320
2
0
30 Oct 2023
UniMAP: Universal SMILES-Graph Representation Learning
UniMAP: Universal SMILES-Graph Representation Learning
Shikun Feng
Lixin Yang
Wei-Ying Ma
Yanyan Lan
OffRL
182
9
0
22 Oct 2023
On Synthetic Data for Back Translation
On Synthetic Data for Back Translation
Jiahao Xu
Yubin Ruan
Wei Bi
Guoping Huang
Shuming Shi
Lihui Chen
Lemao Liu
121
14
0
20 Oct 2023
Document-Level Language Models for Machine Translation
Document-Level Language Models for Machine TranslationConference on Machine Translation (WMT), 2023
Frithjof Petrick
Christian Herold
Pavel Petrushkov
Shahram Khadivi
Hermann Ney
143
14
0
18 Oct 2023
Diversifying Question Generation over Knowledge Base via External Natural Questions
Diversifying Question Generation over Knowledge Base via External Natural QuestionsInternational Conference on Language Resources and Evaluation (LREC), 2023
Shasha Guo
Jing Zhang
Xirui Ke
Cuiping Li
Hong Chen
347
7
0
23 Sep 2023
Sign Language Translation with Iterative Prototype
Sign Language Translation with Iterative PrototypeIEEE International Conference on Computer Vision (ICCV), 2023
Huijie Yao
Wen-gang Zhou
Hao Feng
Hezhen Hu
Hao Zhou
Houqiang Li
SLR
82
26
0
23 Aug 2023
EM-Network: Oracle Guided Self-distillation for Sequence Learning
EM-Network: Oracle Guided Self-distillation for Sequence LearningInternational Conference on Machine Learning (ICML), 2023
J. Yoon
Sunghwan Ahn
Hyeon Seung Lee
Minchan Kim
Seokhwan Kim
N. Kim
VLM
284
2
0
14 Jun 2023
Neural Machine Translation for the Indigenous Languages of the Americas:
  An Introduction
Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction
Manuel Mager
Rajat Bhatnagar
Graham Neubig
Ngoc Thang Vu
Katharina Kann
170
14
0
11 Jun 2023
Improving Non-autoregressive Translation Quality with Pretrained
  Language Model, Embedding Distillation and Upsampling Strategy for CTC
Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTCIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Shensian Syu
Jun Xie
Hung-yi Lee
239
1
0
10 Jun 2023
Improving Language Model Integration for Neural Machine Translation
Improving Language Model Integration for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Christian Herold
Yingbo Gao
Mohammad Zeineldeen
Hermann Ney
189
3
0
08 Jun 2023
LIC-GAN: Language Information Conditioned Graph Generative GAN Model
LIC-GAN: Language Information Conditioned Graph Generative GAN Model
Robert Lo
Arnhav Datar
Abishek Sridhar
GAN
172
3
0
02 Jun 2023
GPT4Image: Large Pre-trained Models Help Vision Models Learn Better on Perception Task
GPT4Image: Large Pre-trained Models Help Vision Models Learn Better on Perception TaskThe Web Conference (WWW), 2023
Ning Ding
Yehui Tang
Zhongqian Fu
Chaoting Xu
Kai Han
Yunhe Wang
MLLMVLM
123
2
0
01 Jun 2023
Sequential Integrated Gradients: a simple but effective method for
  explaining language models
Sequential Integrated Gradients: a simple but effective method for explaining language modelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Joseph Enguehard
168
56
0
25 May 2023
Syntactic Knowledge via Graph Attention with BERT in Machine Translation
Syntactic Knowledge via Graph Attention with BERT in Machine Translation
Yuqian Dai
S. Sharoff
M. Kamps
101
1
0
22 May 2023
Scene Graph as Pivoting: Inference-time Image-free Unsupervised
  Multimodal Machine Translation with Visual Scene Hallucination
Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene HallucinationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Hao Fei
Qianfeng Liu
Meishan Zhang
Hao Fei
Tat-Seng Chua
LRM
291
61
0
20 May 2023
Dynamic Transformers Provide a False Sense of Efficiency
Dynamic Transformers Provide a False Sense of EfficiencyAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yiming Chen
Simin Chen
Zexin Li
Wei Yang
Cong Liu
R. Tan
Haizhou Li
AAML
190
14
0
20 May 2023
The eBible Corpus: Data and Model Benchmarks for Bible Translation for
  Low-Resource Languages
The eBible Corpus: Data and Model Benchmarks for Bible Translation for Low-Resource Languages
Vesa Akerman
David Baines
Damien Daspit
Ulf Hermjakob
Tae Young Jang
Colin Leong
Michael Martin
Joel Mathew
J. Robie
Marcus Schwarting
131
2
0
19 Apr 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a
  Regularized Encoder-Decoder
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
167
59
0
08 Apr 2023
Document-Level Machine Translation with Large Language Models
Document-Level Machine Translation with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Longyue Wang
Chenyang Lyu
Tianbo Ji
Zhirui Zhang
Dian Yu
Shuming Shi
Zhaopeng Tu
ELM
238
171
0
05 Apr 2023
How to Design Translation Prompts for ChatGPT: An Empirical Study
How to Design Translation Prompts for ChatGPT: An Empirical Study
Yuan Gao
Ruili Wang
Feng Hou
243
66
0
05 Apr 2023
1234
Next