Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.06823
Cited By
Incorporating BERT into Neural Machine Translation
International Conference on Learning Representations (ICLR), 2020
17 February 2020
Jinhua Zhu
Ziheng Lu
Lijun Wu
Di He
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
FedML
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (362★)
Papers citing
"Incorporating BERT into Neural Machine Translation"
50 / 182 papers shown
Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Rongxiang Weng
Qiang Wang
Wensen Cheng
Changfeng Zhu
Min Zhang
292
3
0
20 Mar 2023
AMOM: Adaptive Masking over Masking for Conditional Masked Language Model
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yisheng Xiao
Ruiyang Xu
Lijun Wu
Juntao Li
Tao Qin
Yan-Tie Liu
Hao Fei
153
13
0
13 Mar 2023
AccelTran: A Sparsity-Aware Accelerator for Dynamic Inference with Transformers
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2023
Shikhar Tuli
N. Jha
316
50
0
28 Feb 2023
How to prepare your task head for finetuning
International Conference on Learning Representations (ICLR), 2023
Yi Ren
Shangmin Guo
Wonho Bae
Danica J. Sutherland
135
19
0
11 Feb 2023
Plan-then-Seam: Towards Efficient Table-to-Text Generation
Findings (Findings), 2023
Liang Li
Ruiying Geng
Chengyang Fang
Bing Li
Can Ma
Binhua Li
Yongbin Li
LMTD
186
6
0
10 Feb 2023
Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation
Jiahuan Li
Shanbo Cheng
Zewei Sun
Mingxuan Wang
Shujian Huang
236
2
0
17 Dec 2022
UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Hirofumi Inaguma
Sravya Popuri
Ilia Kulikov
Peng-Jen Chen
Changhan Wang
Yu-An Chung
Yun Tang
Ann Lee
Shinji Watanabe
J. Pino
320
76
0
15 Dec 2022
Zero-Shot Dynamic Quantization for Transformer Inference
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yousef El-Kurdi
Jerry Quinn
Avirup Sil
MQ
175
1
0
17 Nov 2022
Findings of the Covid-19 MLIA Machine Translation Task
F. Casacuberta
Alexandru Ceausu
K. Choukri
Miltos Deligiannis
Miguel Domingo
...
V. Papavassiliou
Stelios Piperidis
Prokopis Prokopidis
Dimitris Roussis
M. Salah
133
0
0
14 Nov 2022
Mask More and Mask Later: Efficient Pre-training of Masked Language Models by Disentangling the [MASK] Token
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Baohao Liao
David Thulke
Sanjika Hewavitharana
Hermann Ney
Christof Monz
213
9
0
09 Nov 2022
RoChBert: Towards Robust BERT Fine-tuning for Chinese
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zihan Zhang
Jinfeng Li
Ning Shi
Bo Yuan
Xiangyu Liu
Rong Zhang
Hui Xue
Donghong Sun
Chao Zhang
AAML
154
7
0
28 Oct 2022
Active Countermeasures for Email Fraud
European Symposium on Security and Privacy (Euro S&P), 2022
Wentao Chen
Fuzhou Wang
Matthew Edwards
227
6
0
26 Oct 2022
The Shared Task on Gender Rewriting
Workshop on Arabic Natural Language Processing (WANLP), 2022
Bashar Alhafni
Farah E. Shamout
Houda Bouamor
Ossama Obeid
Sultan Alrowili
...
Mohamed Gabr
Abderrahmane Issam
Abdelrahim Qaddoumi
K. Vijay-Shanker
Mahmoud Zyate
232
2
0
22 Oct 2022
PATS: Sensitivity-aware Noisy Learning for Pretrained Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yupeng Zhang
Hongzhi Zhang
Sirui Wang
Wei Wu
Zhoujun Li
AAML
224
2
0
22 Oct 2022
Diffuser: Efficient Transformers with Multi-hop Attention Diffusion for Long Sequences
AAAI Conference on Artificial Intelligence (AAAI), 2022
Aosong Feng
Irene Li
Yuang Jiang
Rex Ying
207
18
0
21 Oct 2022
A baseline revisited: Pushing the limits of multi-segment models for context-aware translation
Suvodeep Majumde
Stanislas Lauly
Maria Nadejde
Marcello Federico
Georgiana Dinu
163
14
0
19 Oct 2022
Changing the Representation: Examining Language Representation for Neural Sign Language Production
Harry Walsh
Ben Saunders
Richard Bowden
SLR
245
29
0
16 Sep 2022
On the Complementarity between Pre-Training and Random-Initialization for Resource-Rich Machine Translation
International Conference on Computational Linguistics (COLING), 2022
Changtong Zan
Liang Ding
Li Shen
Yu Cao
Weifeng Liu
Dacheng Tao
341
21
0
07 Sep 2022
Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems
International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2022
Prasoon Sinha
Akhil Guliani
Rutwik Jain
Brandon Tran
Matthew D. Sinclair
Shivaram Venkataraman
191
29
0
23 Aug 2022
Discourse Cohesion Evaluation for Document-Level Neural Machine Translation
Xin Tan
Longyin Zhang
Guodong Zhou
88
2
0
19 Aug 2022
Domain-Specific Text Generation for Machine Translation
Conference of the Association for Machine Translation in the Americas (AMTA), 2022
Yasmin Moslem
Rejwanul Haque
John D. Kelleher
Andy Way
199
24
0
11 Aug 2022
Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
214
17
0
12 Jun 2022
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Pengzhi Gao
Zhongjun He
Hua Wu
Haifeng Wang
184
15
0
06 Jun 2022
Improving VAE-based Representation Learning
Mingtian Zhang
Tim Z. Xiao
Brooks Paige
David Barber
SSL
DRL
265
12
0
28 May 2022
PERT: A New Solution to Pinyin to Character Conversion Task
Jinghui Xiao
Qun Liu
Xin Jiang
Yuanfeng Xiong
Haiteng Wu
Zhe Zhang
105
2
0
24 May 2022
Artificial intelligence for topic modelling in Hindu philosophy: mapping themes between the Upanishads and the Bhagavad Gita
PLoS ONE (PLoS ONE), 2022
Rohitash Chandra
Mukul Ranjan
AI4CE
251
19
0
23 May 2022
Progressive Class Semantic Matching for Semi-supervised Text Classification
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Hai-Ming Xu
Lingqiao Liu
Ehsan Abbasnejad
VLM
131
12
0
20 May 2022
Controlling Translation Formality Using Pre-trained Multilingual Language Models
International Workshop on Spoken Language Translation (IWSLT), 2022
Elijah Matthew Rippeth
Sweta Agrawal
Marine Carpuat
AI4CE
227
20
0
13 May 2022
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension
IEEE Transactions on Image Processing (IEEE TIP), 2022
Peihan Miao
Wei Su
Gaoang Wang
Xuewei Li
Xi Li
ObjD
335
13
0
21 Apr 2022
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yisheng Xiao
Lijun Wu
Junliang Guo
Juntao Li
Hao Fei
Tao Qin
Tie-Yan Liu
3DV
MedIm
AI4CE
258
112
0
20 Apr 2022
Dynamic Position Encoding for Transformers
International Conference on Computational Linguistics (COLING), 2022
Joyce Zheng
Mehdi Rezagholizadeh
Peyman Passban
110
4
0
18 Apr 2022
Bridging Cross-Lingual Gaps During Leveraging the Multilingual Sequence-to-Sequence Pretraining for Text Generation and Understanding
Changtong Zan
Liang Ding
Li Shen
Yu Cao
Weifeng Liu
Dacheng Tao
LRM
186
8
0
16 Apr 2022
Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiangpeng Wei
Heng Yu
Yue Hu
Rongxiang Weng
Weihua Luo
Jun Xie
Rong Jin
CLL
271
26
0
14 Apr 2022
Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation
Yong Cao
Wei Li
Xianzhi Li
Min Chen
Guan-bin Chen
Long Hu
Zhengdao Li
Kai Hwang
SLR
258
18
0
12 Apr 2022
CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Nishant Kambhatla
Logan Born
Anoop Sarkar
187
18
0
01 Apr 2022
elBERto: Self-supervised Commonsense Learning for Question Answering
Knowledge-Based Systems (KBS), 2022
Xunlin Zhan
Yuan Li
Xiao Dong
Xiaodan Liang
Zhiting Hu
Lawrence Carin
SSL
RALM
LRM
184
9
0
17 Mar 2022
Universal Conditional Masked Language Pre-training for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Pengfei Li
Liangyou Li
Meng Zhang
Minghao Wu
Qun Liu
AI4CE
215
31
0
17 Mar 2022
ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Bei Li
Quan Du
Tao Zhou
Yi Jing
Shuhan Zhou
Xin Zeng
Tong Xiao
JingBo Zhu
Xuebo Liu
Min Zhang
201
41
0
17 Mar 2022
Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Wenxuan Wang
Wenxiang Jiao
Yongchang Hao
Xing Wang
Shuming Shi
Zhaopeng Tu
Michael Lyu
AIMat
176
29
0
16 Mar 2022
BERTVision -- A Parameter-Efficient Approach for Question Answering
Siduo Jiang
Cristopher Benge
Will King
98
1
0
24 Feb 2022
Prompt-Learning for Short Text Classification
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Yi Zhu
Xinke Zhou
Jipeng Qiang
Yun Li
Yunhao Yuan
Xindong Wu
VLM
196
55
0
23 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Neural Information Processing Systems (NeurIPS), 2022
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
403
307
0
03 Feb 2022
Neural Grapheme-to-Phoneme Conversion with Pre-trained Grapheme Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yi Liu
Zhiyuan Guo
Chao-Hong Tan
Ya-Jun Hu
Yuan Jiang
Zhenhua Ling
140
12
0
26 Jan 2022
Pretrained Language Models for Text Generation: A Survey
ACM Computing Surveys (ACM CSUR), 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
AI4CE
519
263
0
14 Jan 2022
Faster Nearest Neighbor Machine Translation
Shuhe Wang
Jiwei Li
Yuxian Meng
Rongbin Ouyang
Guoyin Wang
Xiaoya Li
Tianwei Zhang
Shi Zong
153
12
0
15 Dec 2021
Semi-supervised Domain Adaptive Structure Learning
Can Qin
Lichen Wang
Qianqian Ma
Yu Yin
Huan Wang
Y. Fu
TTA
247
28
0
12 Dec 2021
Simple Contrastive Representation Adversarial Learning for NLP Tasks
Deshui Miao
Jiaqi Zhang
Wenbo Xie
Jian Song
Xin Li
Lijuan Jia
Ning Guo
SSL
158
19
0
26 Nov 2021
Say What? Collaborative Pop Lyric Generation Using Multitask Transfer Learning
International Conference on Human-Agent Interaction (HAI), 2021
Naveen Ram
Tanay Gummadi
Rahul Bhethanabotla
Richard J. Savery
Gil Weinberg
176
9
0
15 Nov 2021
Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2021
Jiangchao Yao
Shengyu Zhang
Yang Yao
Feng Wang
Jianxin Ma
...
Kun Kuang
Chao-Xiang Wu
Leilei Gan
Jingren Zhou
Hongxia Yang
388
143
0
11 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
ACM Computing Surveys (CSUR), 2021
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
435
1,378
0
01 Nov 2021
Previous
1
2
3
4
Next