Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1611.02683
Cited By
Unsupervised Pretraining for Sequence to Sequence Learning
8 November 2016
Prajit Ramachandran
Peter J. Liu
Quoc V. Le
SSL
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unsupervised Pretraining for Sequence to Sequence Learning"
50 / 53 papers shown
Title
Improving Language Model Integration for Neural Machine Translation
Christian Herold
Yingbo Gao
Mohammad Zeineldeen
Hermann Ney
18
2
0
08 Jun 2023
Uncertainty-DTW for Time Series and Sequences
Lei Wang
Piotr Koniusz
11
33
0
30 Oct 2022
Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning
Xingping Dong
Jianbing Shen
Ling Shao
32
7
0
27 Sep 2022
DBT-DMAE: An Effective Multivariate Time Series Pre-Train Model under Missing Data
Kai Zhang
Qinmin Yang
Chong Li
AI4TS
19
0
0
16 Sep 2022
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language Understanding and Generation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
49
27
0
30 May 2022
Graph Enhanced BERT for Query Understanding
Juanhui Li
Yao Ma
Weizhen Zeng
Suqi Cheng
Jiliang Tang
Shuaiqiang Wang
Dawei Yin
23
7
0
03 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Andy Zeng
Maria Attarian
Brian Ichter
K. Choromanski
Adrian S. Wong
...
Michael S. Ryoo
Vikas Sindhwani
Johnny Lee
Vincent Vanhoucke
Peter R. Florence
ReLM
LRM
45
572
0
01 Apr 2022
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
31
148
0
01 Sep 2021
A Survey on Low-Resource Neural Machine Translation
Rui Wang
Xu Tan
Renqian Luo
Tao Qin
Tie-Yan Liu
3DV
33
58
0
09 Jul 2021
Neural Machine Translation for Low-Resource Languages: A Survey
Surangika Ranathunga
E. Lee
Marjana Prifti Skenduli
Ravi Shekhar
Mehreen Alam
Rishemjit Kaur
38
236
0
29 Jun 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
60
92
0
23 Mar 2021
BERT: A Review of Applications in Natural Language Processing and Understanding
M. V. Koroteev
VLM
25
196
0
22 Mar 2021
Code Generation from Natural Language with Less Prior and More Monolingual Data
Sajad Norouzi
Keyi Tang
Yanshuai Cao
12
19
0
01 Jan 2021
Improving Text Generation with Student-Forcing Optimal Transport
Guoyin Wang
Chunyuan Li
Jianqiao Li
Hao Fu
Yuh-Chen Lin
...
Ruiyi Zhang
Wenlin Wang
Dinghan Shen
Qian Yang
Lawrence Carin
OT
30
17
0
12 Oct 2020
Improving the Accuracy of Global Forecasting Models using Time Series Data Augmentation
Kasun Bandara
Hansika Hewamalage
Yuan-Hao Liu
Yanfei Kang
Christoph Bergmeir
AI4TS
18
114
0
06 Aug 2020
Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization
Sang Michael Xie
Tengyu Ma
Percy Liang
30
13
0
29 Jun 2020
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation
Asa Cooper Stickland
Xian Li
Marjan Ghazvininejad
28
44
0
30 Apr 2020
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
243
1,452
0
18 Mar 2020
A Survey on Contextual Embeddings
Qi Liu
Matt J. Kusner
Phil Blunsom
225
146
0
16 Mar 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
X. Chu
VLM
AI4TS
AI4CE
22
138
0
18 Feb 2020
Multilingual Denoising Pre-training for Neural Machine Translation
Yinhan Liu
Jiatao Gu
Naman Goyal
Xian Li
Sergey Edunov
Marjan Ghazvininejad
M. Lewis
Luke Zettlemoyer
AI4CE
AIMat
22
1,769
0
22 Jan 2020
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
43
2,012
0
18 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
40
395
0
11 Dec 2019
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
20
311
0
04 Dec 2019
Pretrained Language Models for Document-Level Neural Machine Translation
Liangyou Li
Xin Jiang
Qun Liu
20
19
0
08 Nov 2019
Domain, Translationese and Noise in Synthetic Data for Neural Machine Translation
Nikolay Bogoychev
Rico Sennrich
16
50
0
06 Nov 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
91
19,440
0
23 Oct 2019
Deep Learning Based Chatbot Models
Richard Csaky
29
46
0
23 Aug 2019
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Liang Wang
Wei-Ye Zhao
Ruoyu Jia
Sujian Li
Jingming Liu
VLM
AI4CE
34
37
0
22 Aug 2019
Encoder-Agnostic Adaptation for Conditional Language Generation
Zachary M. Ziegler
Luke Melas-Kyriazi
Sebastian Gehrmann
Alexander M. Rush
AI4CE
11
57
0
19 Aug 2019
Towards Making the Most of BERT in Neural Machine Translation
Jiacheng Yang
Mingxuan Wang
Hao Zhou
Chengqi Zhao
Yong Yu
Weinan Zhang
Lei Li
CLL
15
156
0
15 Aug 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
S. Rothe
Shashi Narayan
Aliaksei Severyn
SILM
69
433
0
29 Jul 2019
From Caesar Cipher to Unsupervised Learning: A New Method for Classifier Parameter Estimation
Yu Liu
Li Deng
Jianshu Chen
C. Chen
SSL
26
0
0
06 Jun 2019
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Benjamin Heinzerling
Michael Strube
8
36
0
04 Jun 2019
Domain Adaptation of Neural Machine Translation by Lexicon Induction
Junjie Hu
Mengzhou Xia
Graham Neubig
J. Carbonell
25
74
0
02 Jun 2019
A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models
Elman Mansimov
Alex Jinpeng Wang
Sean Welleck
Kyunghyun Cho
AIMat
25
46
0
29 May 2019
Effective Cross-lingual Transfer of Neural Machine Translation Models without Shared Vocabularies
Yunsu Kim
Yingbo Gao
Hermann Ney
VLM
24
88
0
14 May 2019
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Wei-Ye Zhao
Liang Wang
Kewei Shen
Ruoyu Jia
Jingming Liu
14
210
0
01 Mar 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models
Alexandra Chronopoulou
Christos Baziotis
Alexandros Potamianos
CLL
20
129
0
27 Feb 2019
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
25
2,710
0
22 Jan 2019
Bi-Directional Differentiable Input Reconstruction for Low-Resource Neural Machine Translation
Xing Niu
Weijia Xu
Marine Carpuat
19
17
0
02 Nov 2018
Unsupervised Learning via Meta-Learning
Kyle Hsu
Sergey Levine
Chelsea Finn
SSL
OffRL
28
229
0
04 Oct 2018
Semi-Supervised Sequence Modeling with Cross-View Training
Kevin Clark
Minh-Thang Luong
Christopher D. Manning
Quoc V. Le
SSL
6
333
0
22 Sep 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
Matthew E. Peters
Mark Neumann
Luke Zettlemoyer
Wen-tau Yih
33
425
0
27 Aug 2018
Hierarchical Neural Story Generation
Angela Fan
M. Lewis
Yann N. Dauphin
DiffM
36
1,584
0
13 May 2018
When and Why are Pre-trained Word Embeddings Useful for Neural Machine Translation?
Ye Qi
Devendra Singh Sachan
Matthieu Felix
Sarguna Padmanabhan
Graham Neubig
38
340
0
17 Apr 2018
Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task
Marcin Junczys-Dowmunt
Roman Grundkiewicz
Shubha Guha
Kenneth Heafield
33
192
0
16 Apr 2018
Learning Longer-term Dependencies in RNNs with Auxiliary Losses
Trieu H. Trinh
Andrew M. Dai
Thang Luong
Quoc V. Le
25
179
0
01 Mar 2018
Effective Use of Bidirectional Language Modeling for Transfer Learning in Biomedical Named Entity Recognition
Devendra Singh Sachan
P. Xie
Mrinmaya Sachan
Eric P. Xing
MedIm
27
56
0
21 Nov 2017
What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator?
Marc Tanti
Albert Gatt
K. Camilleri
21
56
0
07 Aug 2017
1
2
Next