ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.07416
  4. Cited By
Tensor2Tensor for Neural Machine Translation

Tensor2Tensor for Neural Machine Translation

16 March 2018
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
Stephan Gouws
Llion Jones
Lukasz Kaiser
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
ArXiv (abs)PDFHTML

Papers citing "Tensor2Tensor for Neural Machine Translation"

50 / 264 papers shown
Title
Cued@wmt19:ewc&lms
Cued@wmt19:ewc&lmsConference on Machine Translation (WMT), 2019
Felix Stahlberg
Danielle Saunders
Adria de Gispert
Bill Byrne
141
13
0
11 Jun 2019
A Focus on Neural Machine Translation for African Languages
A Focus on Neural Machine Translation for African Languages
Laura Martinus
Jade Z. Abbott
139
41
0
11 Jun 2019
Parallel Scheduled Sampling
Parallel Scheduled Sampling
Daniel Duckworth
Arvind Neelakantan
Ben Goodrich
Lukasz Kaiser
Samy Bengio
172
24
0
11 Jun 2019
Improving Neural Language Modeling via Adversarial Training
Improving Neural Language Modeling via Adversarial TrainingInternational Conference on Machine Learning (ICML), 2019
Dilin Wang
Chengyue Gong
Qiang Liu
AAML
270
122
0
10 Jun 2019
Analyzing the Structure of Attention in a Transformer Language Model
Analyzing the Structure of Attention in a Transformer Language Model
Jesse Vig
Yonatan Belinkov
215
421
0
07 Jun 2019
Learning Deep Transformer Models for Machine Translation
Learning Deep Transformer Models for Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Qiang Wang
Bei Li
Tong Xiao
Jingbo Zhu
Changliang Li
Yang Li
Lidia S. Chao
206
733
0
05 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
182
65
0
04 Jun 2019
Domain Adaptive Inference for Neural Machine Translation
Domain Adaptive Inference for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Danielle Saunders
Felix Stahlberg
Adria de Gispert
Bill Byrne
AI4CE
142
30
0
02 Jun 2019
Learning Sparse Networks Using Targeted Dropout
Learning Sparse Networks Using Targeted Dropout
Aidan Gomez
Ivan Zhang
Siddhartha Rao Kamalakara
Divyam Madaan
Kevin Swersky
Y. Gal
Geoffrey E. Hinton
339
99
0
31 May 2019
Assessing The Factual Accuracy of Generated Text
Assessing The Factual Accuracy of Generated TextKnowledge Discovery and Data Mining (KDD), 2019
Ben Goodrich
Vinay Rao
Mohammad Saleh
Peter J. Liu
HILM
293
205
0
30 May 2019
Unsupervised Paraphrasing without Translation
Unsupervised Paraphrasing without TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Aurko Roy
David Grangier
BDLLRM
174
65
0
29 May 2019
Matrix-Free Preconditioning in Online Learning
Matrix-Free Preconditioning in Online LearningInternational Conference on Machine Learning (ICML), 2019
Ashok Cutkosky
Tamás Sarlós
ODL
153
18
0
29 May 2019
Momentum-Based Variance Reduction in Non-Convex SGD
Momentum-Based Variance Reduction in Non-Convex SGDNeural Information Processing Systems (NeurIPS), 2019
Ashok Cutkosky
Francesco Orabona
ODL
451
477
0
24 May 2019
Deep Learning in Alzheimer's disease: Diagnostic Classification and
  Prognostic Prediction using Neuroimaging Data
Deep Learning in Alzheimer's disease: Diagnostic Classification and Prognostic Prediction using Neuroimaging DataFrontiers in Aging Neuroscience (Front. Aging Neurosci.), 2019
T. Jo
K. Nho
A. Saykin
201
566
0
02 May 2019
Similarity of Neural Network Representations Revisited
Similarity of Neural Network Representations RevisitedInternational Conference on Machine Learning (ICML), 2019
Simon Kornblith
Mohammad Norouzi
Honglak Lee
Geoffrey E. Hinton
1.2K
1,732
0
01 May 2019
BERTScore: Evaluating Text Generation with BERT
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
2.0K
7,367
0
21 Apr 2019
Visualizing Attention in Transformer-Based Language Representation
  Models
Visualizing Attention in Transformer-Based Language Representation Models
Jesse Vig
MILM
131
23
0
04 Apr 2019
A Learned Representation for Scalable Vector Graphics
A Learned Representation for Scalable Vector Graphics
Raphael Gontijo-Lopes
David R Ha
Douglas Eck
Jonathon Shlens
GANOCL
160
134
0
04 Apr 2019
Consistency by Agreement in Zero-shot Neural Machine Translation
Consistency by Agreement in Zero-shot Neural Machine Translation
Maruan Al-Shedivat
Ankur P. Parikh
219
57
0
04 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLMFaML
519
3,315
0
01 Apr 2019
Neural Grammatical Error Correction with Finite State Transducers
Neural Grammatical Error Correction with Finite State Transducers
Felix Stahlberg
Christopher Bryant
Bill Byrne
154
30
0
25 Mar 2019
Neutron: An Implementation of the Transformer Translation Model and its
  Variants
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
184
19
0
18 Mar 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video
  Generation
VideoFlow: A Conditional Flow-Based Model for Stochastic Video GenerationInternational Conference on Learning Representations (ICLR), 2019
Manoj Kumar
Mohammad Babaeizadeh
D. Erhan
Chelsea Finn
Sergey Levine
Laurent Dinh
Durk Kingma
VGen
218
137
0
04 Mar 2019
Speeding up Deep Learning with Transient Servers
Speeding up Deep Learning with Transient ServersInternational Conference on Automation and Computing (ICAC), 2019
Shijian Li
R. Walls
Lijie Xu
Tian Guo
188
13
0
28 Feb 2019
Extreme Tensoring for Low-Memory Preconditioning
Extreme Tensoring for Low-Memory PreconditioningInternational Conference on Learning Representations (ICLR), 2019
Xinyi Chen
Naman Agarwal
Elad Hazan
Cyril Zhang
Yi Zhang
172
11
0
12 Feb 2019
Insertion Transformer: Flexible Sequence Generation via Insertion
  Operations
Insertion Transformer: Flexible Sequence Generation via Insertion OperationsInternational Conference on Machine Learning (ICML), 2019
Mitchell Stern
William Chan
J. Kiros
Jakob Uszkoreit
KELM
217
262
0
08 Feb 2019
Memory-Efficient Adaptive Optimization
Memory-Efficient Adaptive OptimizationNeural Information Processing Systems (NeurIPS), 2019
Rohan Anil
Vineet Gupta
Tomer Koren
Y. Singer
ODL
163
49
0
30 Jan 2019
The Evolved Transformer
The Evolved TransformerInternational Conference on Machine Learning (ICML), 2019
David R. So
Chen Liang
Quoc V. Le
ViT
489
487
0
30 Jan 2019
Semantic Redundancies in Image-Classification Datasets: The 10% You
  Don't Need
Semantic Redundancies in Image-Classification Datasets: The 10% You Don't Need
Vighnesh Birodkar
H. Mobahi
Samy Bengio
142
88
0
29 Jan 2019
Context in Neural Machine Translation: A Review of Models and
  Evaluations
Context in Neural Machine Translation: A Review of Models and Evaluations
Andrei Popescu-Belis
MedIm
146
32
0
25 Jan 2019
Preventing Posterior Collapse with delta-VAEs
Preventing Posterior Collapse with delta-VAEs
Ali Razavi
Aaron van den Oord
Ben Poole
Oriol Vinyals
DRL
243
180
0
10 Jan 2019
Sentence-wise Smooth Regularization for Sequence to Sequence Learning
Sentence-wise Smooth Regularization for Sequence to Sequence Learning
Chengyue Gong
Xu Tan
Di He
Tao Qin
AI4TS
153
8
0
12 Dec 2018
Bayesian Layers: A Module for Neural Network Uncertainty
Bayesian Layers: A Module for Neural Network Uncertainty
Dustin Tran
Michael W. Dusenberry
Mark van der Wilk
Danijar Hafner
UQCVBDL
310
134
0
10 Dec 2018
Generative Adversarial Network based Speaker Adaptation for High
  Fidelity WaveNet Vocoder
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder
Qiao Tian
Bing Yang
Shan Liu
GAN
133
9
0
06 Dec 2018
Attending to Mathematical Language with Transformers
Attending to Mathematical Language with Transformers
A. Wangperawong
192
23
0
05 Dec 2018
Generating High Fidelity Images with Subscale Pixel Networks and
  Multidimensional Upscaling
Generating High Fidelity Images with Subscale Pixel Networks and Multidimensional Upscaling
Jacob Menick
Nal Kalchbrenner
205
155
0
04 Dec 2018
Towards Accurate Generative Models of Video: A New Metric & Challenges
Towards Accurate Generative Models of Video: A New Metric & Challenges
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVMVGen
733
1,015
0
03 Dec 2018
Towards Neural Machine Translation for African Languages
Towards Neural Machine Translation for African Languages
Jade Z. Abbott
Laura Martinus
125
23
0
13 Nov 2018
End-to-End Non-Autoregressive Neural Machine Translation with
  Connectionist Temporal Classification
End-to-End Non-Autoregressive Neural Machine Translation with Connectionist Temporal Classification
Jindrich Libovický
Jindřich Helcl
140
172
0
12 Nov 2018
CUNI System for the WMT18 Multimodal Translation Task
CUNI System for the WMT18 Multimodal Translation Task
Jindřich Helcl
Jindrich Libovický
Dušan Variš
186
61
0
12 Nov 2018
Blockwise Parallel Decoding for Deep Autoregressive Models
Blockwise Parallel Decoding for Deep Autoregressive ModelsNeural Information Processing Systems (NeurIPS), 2018
Mitchell Stern
Noam M. Shazeer
Ashley J. Llorens
283
304
0
07 Nov 2018
Simple, Distributed, and Accelerated Probabilistic Programming
Simple, Distributed, and Accelerated Probabilistic Programming
Like Hui
Matthew Hoffman
Siyuan Ma
Christopher Suter
Srinivas Vasudevan
Alexey Radul
M. Belkin
Rif A. Saurous
BDL
152
59
0
05 Nov 2018
Neural Machine Translation into Language Varieties
Neural Machine Translation into Language Varieties
Surafel Melaku Lakew
A. Erofeeva
Marcello Federico
263
49
0
02 Nov 2018
Machine Translation between Vietnamese and English: an Empirical Study
Machine Translation between Vietnamese and English: an Empirical Study
Hong-Hai Phan-Vu
Viet-Trung Tran
V. Nguyen
Hoang-Vu Dang
Phan-Thuan Do
123
17
0
30 Oct 2018
Parallel Attention Mechanisms in Neural Machine Translation
Parallel Attention Mechanisms in Neural Machine Translation
Julian R. Medina
Jugal Kalita
92
19
0
29 Oct 2018
Deep Transfer Reinforcement Learning for Text Summarization
Deep Transfer Reinforcement Learning for Text Summarization
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
218
38
0
15 Oct 2018
Time Reversal as Self-Supervision
Time Reversal as Self-Supervision
Suraj Nair
Mohammad Babaeizadeh
Chelsea Finn
Sergey Levine
Vikash Kumar
SSL
237
12
0
02 Oct 2018
Phrase-Based Attentions
Phrase-Based Attentions
Phi Xuan Nguyen
Shafiq Joty
121
8
0
30 Sep 2018
FRAGE: Frequency-Agnostic Word Representation
FRAGE: Frequency-Agnostic Word Representation
Chengyue Gong
Di He
Xu Tan
Tao Qin
Liwei Wang
Tie-Yan Liu
OOD
186
147
0
18 Sep 2018
Music Transformer
Music Transformer
Cheng-Zhi Anna Huang
Ashish Vaswani
Jakob Uszkoreit
Noam M. Shazeer
Ian Simon
Curtis Hawthorne
Andrew M. Dai
Matthew D. Hoffman
Monica Dinculescu
Douglas Eck
549
546
0
12 Sep 2018
Previous
123456
Next