Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2002.06823
Cited By
Incorporating BERT into Neural Machine Translation
International Conference on Learning Representations (ICLR), 2020
17 February 2020
Jinhua Zhu
Ziheng Lu
Lijun Wu
Di He
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
FedML
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (362★)
Papers citing
"Incorporating BERT into Neural Machine Translation"
50 / 182 papers shown
Title
RE
2
^2
2
: Improving Chinese Grammatical Error Correction via Retrieving Appropriate Examples with Explanation
Baoxin Wang
Yumeng Luo
Yixuan Wang
Dayong Wu
Wanxiang Che
Shijin Wang
LRM
84
0
0
30 Sep 2025
A comparison of pipelines for the translation of a low resource language based on transformers
Chiara Bonfanti
Michele Colombino
Giulia Coucourde
Faeze Memari
Stefano Pinardi
Rosa Meo
90
0
0
15 Sep 2025
GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation
Jiafeng Xiong
Yuting Zhao
217
0
0
24 Jul 2025
PDFMathTranslate: Scientific Document Translation Preserving Layouts
Rongxin Ouyang
Chang Chu
Zhikuang Xin
Xiangyao Ma
185
0
0
02 Jul 2025
Self-supervised Latent Space Optimization with Nebula Variational Coding
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yida Wang
D. Tan
Nassir Navab
Federico Tombari
DRL
SSL
333
1
0
02 Jun 2025
Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model
Phan Tran Minh Dat
Vo Hoang Nhat Khang
Quan Thanh Tho
186
0
0
16 May 2025
Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation
Biao Zhang
Fedor Moiseev
Joshua Ainslie
Paul Suganthan
Min Ma
Surya Bhupatiraju
Fede Lebron
Orhan Firat
Armand Joulin
Zhe Dong
AI4CE
198
11
0
08 Apr 2025
Exploring Various Sequential Learning Methods for Deformation History Modeling
International Conference on Engineering Applications of Neural Networks (ICEANN), 2025
Muhammed Adil Yatkin
Mihkel Korgesaar
Jani Romanoff
Umit Islak
Hasan Kurban
AI4TS
164
0
0
04 Apr 2025
A Framework for Lightweight Responsible Prompting Recommendation
Tiago Machado
Sara E. Berger
Cassia Sanctos
Vagner Figueiredo de Santana
Lemara Williams
Zhaoqing Wu
140
0
0
29 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
International Conference on Learning Representations (ICLR), 2025
R. Teo
T. Nguyen
MoE
368
5
0
14 Mar 2025
LM2: Large Memory Models
Jikun Kang
Wenqi Wu
Filippos Christianos
Alex J. Chan
Fraser Greenlee
George Thomas
Marvin Purtorab
Andy Toulis
KELM
316
7
0
09 Feb 2025
Self-Evolution Knowledge Distillation for LLM-based Machine Translation
International Conference on Computational Linguistics (COLING), 2024
Yuncheng Song
Liang Ding
Changtong Zan
Shujian Huang
385
0
0
19 Dec 2024
Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation
Maohao Shen
Shun Zhang
Jilong Wu
Zhiping Xiu
Ehab AlBadawy
Yiting Lu
M. Seltzer
Qing He
201
6
0
27 Oct 2024
MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers
International Conference on Computer Aided Design (ICCAD), 2024
Zebin Yang
Renze Chen
Taiqiang Wu
Ngai Wong
Yun Liang
Runsheng Wang
R. Huang
Meng Li
MQ
251
2
0
23 Oct 2024
A Data Selection Approach for Enhancing Low Resource Machine Translation Using Cross-Lingual Sentence Representations
Nidhi Kowtal
Tejas Deshpande
Raviraj Joshi
179
3
0
04 Sep 2024
Hybrid Dynamic Pruning: A Pathway to Efficient Transformer Inference
Ghadeer Jaradat
M. Tolba
Ghada Alsuhli
Hani Saleh
Mahmoud Al-Qutayri
Thanos Stouraitis
Baker Mohammad
134
1
0
17 Jul 2024
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study
Aniruddha Roy
Pretam Ray
Ayush Maheshwari
Sudeshna Sarkar
Pawan Goyal
248
2
0
09 Jul 2024
Uncertainty-Guided Likelihood Tree Search
Julia Grosse
Ruotian Wu
Ahmad Rashid
Cheng Zhang
Philipp Hennig
Pascal Poupart
Agustinus Kristiadi
376
3
0
04 Jul 2024
WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets
Adib Hasan
Mardavij Roozbehani
M. Dahleh
AI4TS
155
1
0
22 May 2024
A Combination of BERT and Transformer for Vietnamese Spelling Correction
Asian Conference on Intelligent Information and Database Systems (ACIIDS), 2024
Trung Hieu Ngo
Ham Duong Tran
Tin Huynh
Kiem Hoang
175
6
0
04 May 2024
Efficient infusion of self-supervised representations in Automatic Speech Recognition
Darshan Prabhu
Sai Ganesh Mirishkar
Pankaj Wasnik
89
0
0
19 Apr 2024
Select and Reorder: A Novel Approach for Neural Sign Language Production
Harry Walsh
Ben Saunders
Richard Bowden
SLR
169
3
0
17 Apr 2024
HDLdebugger: Streamlining HDL debugging with Large Language Models
ACM Transactions on Design Automation of Electronic Systems (TODAES), 2024
Xufeng Yao
Haoyang Li
T. H. Chan
Wenyi Xiao
Mingxuan Yuan
Yu Huang
Lei Chen
Bei Yu
145
39
0
18 Mar 2024
Thermometer: Towards Universal Calibration for Large Language Models
Maohao Shen
Subhro Das
Kristjan Greenewald
P. Sattigeri
Greg Wornell
Soumya Ghosh
294
23
0
20 Feb 2024
Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Aiwei Liu
Haoping Bai
Zhiyun Lu
Xiang Kong
Simon Wang
Jiulong Shan
Mengsi Cao
Lijie Wen
ALM
107
22
0
19 Feb 2024
LLMs for Robotic Object Disambiguation
Connie Jiang
Yiqing Xu
David Hsu
LM&Ro
176
2
0
07 Jan 2024
Enhancing Context Through Contrast
Kshitij Ambilduke
Aneesh Shetye
Diksha Bagade
Rishika Bhagwatkar
Khurshed Fitter
P. Vagdargi
Shital S. Chiddarwar
217
0
0
06 Jan 2024
A Survey of Text Watermarking in the Era of Large Language Models
ACM Computing Surveys (ACM Comput. Surv.), 2023
Aiwei Liu
Leyi Pan
Yijian Lu
Jingjing Li
Xuming Hu
Xi Zhang
Lijie Wen
Irwin King
Hui Xiong
Philip S. Yu
WaLM
319
120
0
13 Dec 2023
Conditional Prompt Tuning for Multimodal Fusion
Ruixia Jiang
Lingbo Liu
Changwen Chen
178
0
0
28 Nov 2023
Interpreting Pretrained Language Models via Concept Bottlenecks
Zhen Tan
Lu Cheng
Song Wang
Yuan Bo
Wenlin Yao
Huan Liu
LRM
214
33
0
08 Nov 2023
Integrating Pre-trained Language Model into Neural Machine Translation
Soon-Jae Hwang
Chang-Sung Jeong
316
2
0
30 Oct 2023
UniMAP: Universal SMILES-Graph Representation Learning
Shikun Feng
Lixin Yang
Wei-Ying Ma
Yanyan Lan
OffRL
178
9
0
22 Oct 2023
On Synthetic Data for Back Translation
Jiahao Xu
Yubin Ruan
Wei Bi
Guoping Huang
Shuming Shi
Lihui Chen
Lemao Liu
105
14
0
20 Oct 2023
Document-Level Language Models for Machine Translation
Conference on Machine Translation (WMT), 2023
Frithjof Petrick
Christian Herold
Pavel Petrushkov
Shahram Khadivi
Hermann Ney
139
14
0
18 Oct 2023
Diversifying Question Generation over Knowledge Base via External Natural Questions
International Conference on Language Resources and Evaluation (LREC), 2023
Shasha Guo
Jing Zhang
Xirui Ke
Cuiping Li
Hong Chen
343
7
0
23 Sep 2023
Sign Language Translation with Iterative Prototype
IEEE International Conference on Computer Vision (ICCV), 2023
Huijie Yao
Wen-gang Zhou
Hao Feng
Hezhen Hu
Hao Zhou
Houqiang Li
SLR
78
26
0
23 Aug 2023
EM-Network: Oracle Guided Self-distillation for Sequence Learning
International Conference on Machine Learning (ICML), 2023
J. Yoon
Sunghwan Ahn
Hyeon Seung Lee
Minchan Kim
Seokhwan Kim
N. Kim
VLM
280
2
0
14 Jun 2023
Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction
Manuel Mager
Rajat Bhatnagar
Graham Neubig
Ngoc Thang Vu
Katharina Kann
166
14
0
11 Jun 2023
Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Shensian Syu
Jun Xie
Hung-yi Lee
235
1
0
10 Jun 2023
Improving Language Model Integration for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Christian Herold
Yingbo Gao
Mohammad Zeineldeen
Hermann Ney
177
3
0
08 Jun 2023
LIC-GAN: Language Information Conditioned Graph Generative GAN Model
Robert Lo
Arnhav Datar
Abishek Sridhar
GAN
164
3
0
02 Jun 2023
GPT4Image: Large Pre-trained Models Help Vision Models Learn Better on Perception Task
The Web Conference (WWW), 2023
Ning Ding
Yehui Tang
Zhongqian Fu
Chaoting Xu
Kai Han
Yunhe Wang
MLLM
VLM
119
2
0
01 Jun 2023
Sequential Integrated Gradients: a simple but effective method for explaining language models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Joseph Enguehard
164
56
0
25 May 2023
Syntactic Knowledge via Graph Attention with BERT in Machine Translation
Yuqian Dai
S. Sharoff
M. Kamps
97
1
0
22 May 2023
Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Hao Fei
Qianfeng Liu
Meishan Zhang
Hao Fei
Tat-Seng Chua
LRM
287
61
0
20 May 2023
Dynamic Transformers Provide a False Sense of Efficiency
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yiming Chen
Simin Chen
Zexin Li
Wei Yang
Cong Liu
R. Tan
Haizhou Li
AAML
186
14
0
20 May 2023
The eBible Corpus: Data and Model Benchmarks for Bible Translation for Low-Resource Languages
Vesa Akerman
David Baines
Damien Daspit
Ulf Hermjakob
Tae Young Jang
Colin Leong
Michael Martin
Joel Mathew
J. Robie
Marcus Schwarting
127
2
0
19 Apr 2023
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
155
59
0
08 Apr 2023
Document-Level Machine Translation with Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Longyue Wang
Chenyang Lyu
Tianbo Ji
Zhirui Zhang
Dian Yu
Shuming Shi
Zhaopeng Tu
ELM
234
171
0
05 Apr 2023
How to Design Translation Prompts for ChatGPT: An Empirical Study
Yuan Gao
Ruili Wang
Feng Hou
231
66
0
05 Apr 2023
1
2
3
4
Next