Incorporating BERT into Neural Machine Translation

International Conference on Learning Representations (ICLR), 2020

17 February 2020

ArXiv (abs)PDF HTML Github (362★)

Papers citing "Incorporating BERT into Neural Machine Translation"

50 / 182 papers shown

^2

: Improving Chinese Grammatical Error Correction via Retrieving Appropriate Examples with Explanation

30 Sep 2025

A comparison of pipelines for the translation of a low resource language based on transformers

15 Sep 2025

GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation

Jiafeng Xiong

Yuting Zhao

237

24 Jul 2025

PDFMathTranslate: Scientific Document Translation Preserving Layouts

189

02 Jul 2025

Self-supervised Latent Space Optimization with Nebula Variational CodingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

333

02 Jun 2025

Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model

Phan Tran Minh Dat

Vo Hoang Nhat Khang

Quan Thanh Tho

202

16 May 2025

Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation

198

08 Apr 2025

Exploring Various Sequential Learning Methods for Deformation History ModelingInternational Conference on Engineering Applications of Neural Networks (ICEANN), 2025

168

04 Apr 2025

A Framework for Lightweight Responsible Prompting Recommendation

Tiago Machado

Sara E. Berger

Cassia Sanctos

Vagner Figueiredo de Santana

Lemara Williams

Zhaoqing Wu

144

29 Mar 2025

MoLEx: Mixture of Layer Experts for Finetuning with Sparse UpcyclingInternational Conference on Learning Representations (ICLR), 2025

R. Teo

T. Nguyen

MoE

372

14 Mar 2025

LM2: Large Memory Models

316

09 Feb 2025

Self-Evolution Knowledge Distillation for LLM-based Machine TranslationInternational Conference on Computational Linguistics (COLING), 2024

397

19 Dec 2024

Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation

205

27 Oct 2024

MCUBERT: Memory-Efficient BERT Inference on Commodity MicrocontrollersInternational Conference on Computer Aided Design (ICCAD), 2024

Meng Li

251

23 Oct 2024

A Data Selection Approach for Enhancing Low Resource Machine Translation Using Cross-Lingual Sentence Representations

Nidhi Kowtal

Tejas Deshpande

Raviraj Joshi

191

04 Sep 2024

Hybrid Dynamic Pruning: A Pathway to Efficient Transformer Inference

134

17 Jul 2024

Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study

Sudeshna Sarkar

248

09 Jul 2024

Uncertainty-Guided Likelihood Tree Search

376

04 Jul 2024

WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets

163

22 May 2024

A Combination of BERT and Transformer for Vietnamese Spelling CorrectionAsian Conference on Intelligent Information and Database Systems (ACIIDS), 2024

179

04 May 2024

Efficient infusion of self-supervised representations in Automatic Speech Recognition

Darshan Prabhu

Sai Ganesh Mirishkar

Pankaj Wasnik

19 Apr 2024

Select and Reorder: A Novel Approach for Neural Sign Language Production

181

17 Apr 2024

HDLdebugger: Streamlining HDL debugging with Large Language ModelsACM Transactions on Design Automation of Electronic Systems (TODAES), 2024

Mingxuan Yuan

153

18 Mar 2024

Thermometer: Towards Universal Calibration for Large Language Models

298

20 Feb 2024

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation

Aiwei Liu

Lijie Wen

111

19 Feb 2024

LLMs for Robotic Object Disambiguation

180

07 Jan 2024

Enhancing Context Through Contrast

229

06 Jan 2024

A Survey of Text Watermarking in the Era of Large Language ModelsACM Computing Surveys (ACM Comput. Surv.), 2023

Aiwei Liu

Yijian Lu

Lijie Wen

Irwin King

Hui Xiong

Philip S. Yu

WaLM

319

120

13 Dec 2023

Conditional Prompt Tuning for Multimodal Fusion

Ruixia Jiang

Lingbo Liu

Changwen Chen

178

28 Nov 2023

Interpreting Pretrained Language Models via Concept Bottlenecks

Huan Liu

222

08 Nov 2023

Integrating Pre-trained Language Model into Neural Machine Translation

Soon-Jae Hwang

Chang-Sung Jeong

320

30 Oct 2023

UniMAP: Universal SMILES-Graph Representation Learning

182

22 Oct 2023

On Synthetic Data for Back Translation

Wei Bi

121

20 Oct 2023

Document-Level Language Models for Machine TranslationConference on Machine Translation (WMT), 2023

143

18 Oct 2023

Diversifying Question Generation over Knowledge Base via External Natural QuestionsInternational Conference on Language Resources and Evaluation (LREC), 2023

347

23 Sep 2023

Sign Language Translation with Iterative PrototypeIEEE International Conference on Computer Vision (ICCV), 2023

Hao Feng

23 Aug 2023

EM-Network: Oracle Guided Self-distillation for Sequence LearningInternational Conference on Machine Learning (ICML), 2023

284

14 Jun 2023

Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction

Graham Neubig

170

11 Jun 2023

Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTCIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Shensian Syu

Jun Xie

Hung-yi Lee

239

10 Jun 2023

Improving Language Model Integration for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

189

08 Jun 2023

LIC-GAN: Language Information Conditioned Graph Generative GAN Model

172

02 Jun 2023

GPT4Image: Large Pre-trained Models Help Vision Models Learn Better on Perception TaskThe Web Conference (WWW), 2023

123

01 Jun 2023

Sequential Integrated Gradients: a simple but effective method for explaining language modelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Joseph Enguehard

168

25 May 2023

Syntactic Knowledge via Graph Attention with BERT in Machine Translation

Yuqian Dai

S. Sharoff

M. Kamps

101

22 May 2023

Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene HallucinationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Hao Fei

291

20 May 2023

Dynamic Transformers Provide a False Sense of EfficiencyAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Yiming Chen

Haizhou Li

190

20 May 2023

The eBible Corpus: Data and Model Benchmarks for Bible Translation for Low-Resource Languages

131

19 Apr 2023

Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder

Zhiyuan Liu

167

08 Apr 2023

Document-Level Machine Translation with Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Dian Yu

238

171

05 Apr 2023

How to Design Translation Prompts for ChatGPT: An Empirical Study

Yuan Gao

Ruili Wang

Feng Hou

243

05 Apr 2023