Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.07416
Cited By
Tensor2Tensor for Neural Machine Translation
16 March 2018
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan N. Gomez
Stephan Gouws
Llion Jones
Lukasz Kaiser
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Tensor2Tensor for Neural Machine Translation"
50 / 261 papers shown
Title
Analyzing the Structure of Attention in a Transformer Language Model
Jesse Vig
Yonatan Belinkov
19
357
0
07 Jun 2019
Learning Deep Transformer Models for Machine Translation
Qiang Wang
Bei Li
Tong Xiao
Jingbo Zhu
Changliang Li
Derek F. Wong
Lidia S. Chao
16
656
0
05 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
23
65
0
04 Jun 2019
Domain Adaptive Inference for Neural Machine Translation
Danielle Saunders
Felix Stahlberg
Adria de Gispert
Bill Byrne
AI4CE
11
29
0
02 Jun 2019
Learning Sparse Networks Using Targeted Dropout
Aidan N. Gomez
Ivan Zhang
Siddhartha Rao Kamalakara
Divyam Madaan
Kevin Swersky
Y. Gal
Geoffrey E. Hinton
6
98
0
31 May 2019
Assessing The Factual Accuracy of Generated Text
Ben Goodrich
Vinay Rao
Mohammad Saleh
Peter J. Liu
HILM
22
185
0
30 May 2019
Unsupervised Paraphrasing without Translation
Aurko Roy
David Grangier
BDL
LRM
11
61
0
29 May 2019
Matrix-Free Preconditioning in Online Learning
Ashok Cutkosky
Tamás Sarlós
ODL
19
16
0
29 May 2019
Momentum-Based Variance Reduction in Non-Convex SGD
Ashok Cutkosky
Francesco Orabona
ODL
18
393
0
24 May 2019
Deep Learning in Alzheimer's disease: Diagnostic Classification and Prognostic Prediction using Neuroimaging Data
T. Jo
K. Nho
A. Saykin
22
502
0
02 May 2019
Similarity of Neural Network Representations Revisited
Simon Kornblith
Mohammad Norouzi
Honglak Lee
Geoffrey E. Hinton
29
1,354
0
01 May 2019
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
64
5,500
0
21 Apr 2019
Visualizing Attention in Transformer-Based Language Representation Models
Jesse Vig
MILM
14
21
0
04 Apr 2019
A Learned Representation for Scalable Vector Graphics
Raphael Gontijo-Lopes
David R Ha
Douglas Eck
Jonathon Shlens
GAN
OCL
30
113
0
04 Apr 2019
Consistency by Agreement in Zero-shot Neural Machine Translation
Maruan Al-Shedivat
Ankur P. Parikh
17
57
0
04 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
23
3,124
0
01 Apr 2019
Neural Grammatical Error Correction with Finite State Transducers
Felix Stahlberg
Christopher Bryant
Bill Byrne
22
28
0
25 Mar 2019
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
27
19
0
18 Mar 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation
Manoj Kumar
Mohammad Babaeizadeh
D. Erhan
Chelsea Finn
Sergey Levine
Laurent Dinh
Durk Kingma
VGen
25
131
0
04 Mar 2019
Speeding up Deep Learning with Transient Servers
Shijian Li
R. Walls
Lijie Xu
Tian Guo
11
12
0
28 Feb 2019
Extreme Tensoring for Low-Memory Preconditioning
Xinyi Chen
Naman Agarwal
Elad Hazan
Cyril Zhang
Yi Zhang
17
10
0
12 Feb 2019
Insertion Transformer: Flexible Sequence Generation via Insertion Operations
Mitchell Stern
William Chan
J. Kiros
Jakob Uszkoreit
KELM
20
247
0
08 Feb 2019
Memory-Efficient Adaptive Optimization
Rohan Anil
Vineet Gupta
Tomer Koren
Y. Singer
ODL
11
49
0
30 Jan 2019
The Evolved Transformer
David R. So
Chen Liang
Quoc V. Le
ViT
24
460
0
30 Jan 2019
Semantic Redundancies in Image-Classification Datasets: The 10% You Don't Need
Vighnesh Birodkar
H. Mobahi
Samy Bengio
11
82
0
29 Jan 2019
Context in Neural Machine Translation: A Review of Models and Evaluations
Andrei Popescu-Belis
MedIm
10
28
0
25 Jan 2019
Preventing Posterior Collapse with delta-VAEs
Ali Razavi
Aaron van den Oord
Ben Poole
Oriol Vinyals
DRL
14
167
0
10 Jan 2019
Sentence-wise Smooth Regularization for Sequence to Sequence Learning
Chengyue Gong
Xu Tan
Di He
Tao Qin
AI4TS
21
8
0
12 Dec 2018
Bayesian Layers: A Module for Neural Network Uncertainty
Dustin Tran
Michael W. Dusenberry
Mark van der Wilk
Danijar Hafner
UQCV
BDL
19
120
0
10 Dec 2018
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder
Qiao Tian
Bing Yang
Shan Liu
GAN
16
9
0
06 Dec 2018
Attending to Mathematical Language with Transformers
A. Wangperawong
17
22
0
05 Dec 2018
Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling
Jacob Menick
Nal Kalchbrenner
17
149
0
04 Dec 2018
Towards Accurate Generative Models of Video: A New Metric & Challenges
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVM
VGen
27
682
0
03 Dec 2018
Towards Neural Machine Translation for African Languages
Jade Z. Abbott
Laura Martinus
13
20
0
13 Nov 2018
End-to-End Non-Autoregressive Neural Machine Translation with Connectionist Temporal Classification
Jindrich Libovický
Jindřich Helcl
11
167
0
12 Nov 2018
CUNI System for the WMT18 Multimodal Translation Task
Jindřich Helcl
Jindrich Libovický
Dušan Variš
11
57
0
12 Nov 2018
Blockwise Parallel Decoding for Deep Autoregressive Models
Mitchell Stern
Noam M. Shazeer
Ashley J. Llorens
8
213
0
07 Nov 2018
Simple, Distributed, and Accelerated Probabilistic Programming
Like Hui
Matthew Hoffman
Siyuan Ma
Christopher Suter
Srinivas Vasudevan
Alexey Radul
M. Belkin
Rif A. Saurous
BDL
14
56
0
05 Nov 2018
Neural Machine Translation into Language Varieties
Surafel Melaku Lakew
A. Erofeeva
Marcello Federico
9
49
0
02 Nov 2018
Machine Translation between Vietnamese and English: an Empirical Study
Hong-Hai Phan-Vu
Viet-Trung Tran
V. Nguyen
Hoang-Vu Dang
Phan-Thuan Do
12
17
0
30 Oct 2018
Parallel Attention Mechanisms in Neural Machine Translation
Julian R. Medina
Jugal Kalita
4
18
0
29 Oct 2018
Deep Transfer Reinforcement Learning for Text Summarization
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
24
37
0
15 Oct 2018
Time Reversal as Self-Supervision
Suraj Nair
Mohammad Babaeizadeh
Chelsea Finn
Sergey Levine
Vikash Kumar
SSL
6
12
0
02 Oct 2018
Phrase-Based Attentions
Phi Xuan Nguyen
Shafiq R. Joty
6
8
0
30 Sep 2018
FRAGE: Frequency-Agnostic Word Representation
Chengyue Gong
Di He
Xu Tan
Tao Qin
Liwei Wang
Tie-Yan Liu
OOD
26
144
0
18 Sep 2018
Music Transformer
Cheng-Zhi Anna Huang
Ashish Vaswani
Jakob Uszkoreit
Noam M. Shazeer
Ian Simon
Curtis Hawthorne
Andrew M. Dai
Matthew D. Hoffman
Monica Dinculescu
Douglas Eck
34
470
0
12 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation
Zhiting Hu
Haoran Shi
Bowen Tan
Wentao Wang
Zichao Yang
...
Zhengzhong Liu
Xiaodan Liang
Wangrong Zhu
Devendra Singh Sachan
Eric P. Xing
VLM
12
56
0
04 Sep 2018
Trivial Transfer Learning for Low-Resource Neural Machine Translation
Tom Kocmi
Ondrej Bojar
11
171
0
02 Sep 2018
Beyond Error Propagation in Neural Machine Translation: Characteristics of Language Also Matter
Lijun Wu
Xu Tan
Di He
Fei Tian
Tao Qin
Jianhuang Lai
Tie-Yan Liu
10
48
0
01 Sep 2018
An Operation Sequence Model for Explainable Neural Machine Translation
Felix Stahlberg
Danielle Saunders
Bill Byrne
LRM
MILM
35
29
0
29 Aug 2018
Previous
1
2
3
4
5
6
Next