ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.07416
  4. Cited By
Tensor2Tensor for Neural Machine Translation

Tensor2Tensor for Neural Machine Translation

16 March 2018
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
Stephan Gouws
Llion Jones
Lukasz Kaiser
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
ArXiv (abs)PDFHTML

Papers citing "Tensor2Tensor for Neural Machine Translation"

50 / 264 papers shown
Title
Review: Deep Learning in Electron Microscopy
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
900
88
0
17 Sep 2020
A Computational-Graph Partitioning Method for Training
  Memory-Constrained DNNs
A Computational-Graph Partitioning Method for Training Memory-Constrained DNNs
Fareed Qararyah
Mohamed Wahib
Douga Dikbayir
M. E. Belviranli
Didem Unat
141
10
0
19 Aug 2020
Adaptable Multi-Domain Language Model for Transformer ASR
Adaptable Multi-Domain Language Model for Transformer ASR
Taewoo Lee
Min-Joong Lee
Tae Gyoon Kang
Seokyeong Jung
Minseok Kwon
...
Ho-Gyeong Kim
Jiseung Jeong
Jihyun Lee
Hosik Lee
Y. S. Choi
109
20
0
14 Aug 2020
End-to-End Neural Transformer Based Spoken Language Understanding
End-to-End Neural Transformer Based Spoken Language UnderstandingInterspeech (Interspeech), 2020
Martin H. Radfar
Athanasios Mouchtaris
Siegfried Kunzmann
182
63
0
12 Aug 2020
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-FireACM Multimedia (ACM MM), 2020
Jinglin Liu
Yi Ren
Zhou Zhao
Chen Zhang
Baoxing Huai
Jing Yuan
193
13
0
06 Aug 2020
DeLighT: Deep and Light-weight Transformer
DeLighT: Deep and Light-weight Transformer
Sachin Mehta
Marjan Ghazvininejad
Srini Iyer
Luke Zettlemoyer
Hannaneh Hajishirzi
VLM
219
33
0
03 Aug 2020
Neural Composition: Learning to Generate from Multiple Models
Neural Composition: Learning to Generate from Multiple Models
Denis Filimonov
R. Gadde
Ariya Rastrow
127
3
0
10 Jul 2020
Learning Graph Structure With A Finite-State Automaton Layer
Learning Graph Structure With A Finite-State Automaton LayerNeural Information Processing Systems (NeurIPS), 2020
Daniel D. Johnson
Hugo Larochelle
Daniel Tarlow
GNNAI4CE
139
17
0
09 Jul 2020
Best-First Beam Search
Best-First Beam SearchTransactions of the Association for Computational Linguistics (TACL), 2020
Clara Meister
Tim Vieira
Robert Bamler
360
80
0
08 Jul 2020
Announcing CzEng 2.0 Parallel Corpus with over 2 Gigawords
Announcing CzEng 2.0 Parallel Corpus with over 2 Gigawords
Tom Kocmi
Martin Popel
Ondrej Bojar
130
39
0
06 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML
  Models: A Survey and Insights
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
267
98
0
02 Jul 2020
UWSpeech: Speech to Speech Translation for Unwritten Languages
UWSpeech: Speech to Speech Translation for Unwritten LanguagesAAAI Conference on Artificial Intelligence (AAAI), 2020
Chen Zhang
Xu Tan
Yi Ren
Tao Qin
Ke-jun Zhang
Tie-Yan Liu
106
65
0
14 Jun 2020
FinEst BERT and CroSloEngual BERT: less is more in multilingual models
FinEst BERT and CroSloEngual BERT: less is more in multilingual modelsWorkshop on Time-Delay Systems (TDS), 2020
Matej Ulvcar
Marko Robnik-Šikonja
107
54
0
14 Jun 2020
Wat zei je? Detecting Out-of-Distribution Translations with Variational
  Transformers
Wat zei je? Detecting Out-of-Distribution Translations with Variational Transformers
Tim Z. Xiao
Aidan Gomez
Y. Gal
UQLM
189
39
0
08 Jun 2020
$O(n)$ Connections are Expressive Enough: Universal Approximability of
  Sparse Transformers
O(n)O(n)O(n) Connections are Expressive Enough: Universal Approximability of Sparse Transformers
Chulhee Yun
Yin-Wen Chang
Srinadh Bhojanapalli
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
202
94
0
08 Jun 2020
ELITR Non-Native Speech Translation at IWSLT 2020
ELITR Non-Native Speech Translation at IWSLT 2020
Dominik Machávcek
Jonávs Kratochvíl
Sangeet Sagar
Matúvs vZilinec
Ondrej Bojar
T. Nguyen
Felix Schneider
P. Williams
Yuekun Yao
112
11
0
05 Jun 2020
Applying the Transformer to Character-level Transduction
Applying the Transformer to Character-level Transduction
Shijie Wu
Robert Bamler
Mans Hulden
AI4CE
224
114
0
20 May 2020
It's Easier to Translate out of English than into it: Measuring Neural
  Translation Difficulty by Cross-Mutual Information
It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual InformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Emanuele Bugliarello
Sabrina J. Mielke
Antonios Anastasopoulos
Robert Bamler
Naoaki Okazaki
221
28
0
05 May 2020
Successfully Applying the Stabilized Lottery Ticket Hypothesis to the
  Transformer Architecture
Successfully Applying the Stabilized Lottery Ticket Hypothesis to the Transformer ArchitectureAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Christopher Brix
Parnia Bahar
Hermann Ney
223
39
0
04 May 2020
Using Context in Neural Machine Translation Training Objectives
Using Context in Neural Machine Translation Training ObjectivesAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Danielle Saunders
Felix Stahlberg
Bill Byrne
119
20
0
04 May 2020
Monitoring COVID-19 social distancing with person detection and tracking
  via fine-tuned YOLO v3 and Deepsort techniques
Monitoring COVID-19 social distancing with person detection and tracking via fine-tuned YOLO v3 and Deepsort techniques
Narinder Singh Punn
S. K. Sonbhadra
Sonali Agarwal
Gaurav Rai
251
250
0
04 May 2020
Generalized Entropy Regularization or: There's Nothing Special about
  Label Smoothing
Generalized Entropy Regularization or: There's Nothing Special about Label SmoothingAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Clara Meister
Elizabeth Salesky
Robert Bamler
UQCV
163
66
0
02 May 2020
ESPnet-ST: All-in-One Speech Translation Toolkit
ESPnet-ST: All-in-One Speech Translation Toolkit
Hirofumi Inaguma
Shun Kiyono
Kevin Duh
Shigeki Karita
Nelson Yalta
Tomoki Hayashi
Shinji Watanabe
220
172
0
21 Apr 2020
Fast and Accurate Deep Bidirectional Language Representations for
  Unsupervised Learning
Fast and Accurate Deep Bidirectional Language Representations for Unsupervised LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Joongbo Shin
Yoonhyung Lee
Seunghyun Yoon
Kyomin Jung
OOD
141
12
0
17 Apr 2020
Non-Autoregressive Machine Translation with Latent Alignments
Non-Autoregressive Machine Translation with Latent AlignmentsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Chitwan Saharia
William Chan
Saurabh Saxena
Mohammad Norouzi
258
163
0
16 Apr 2020
Reducing Gender Bias in Neural Machine Translation as a Domain
  Adaptation Problem
Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation ProblemAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Danielle Saunders
Bill Byrne
AI4CE
235
150
0
09 Apr 2020
Characterizing and Modeling Distributed Training with Transient Cloud
  GPU Servers
Characterizing and Modeling Distributed Training with Transient Cloud GPU ServersIEEE International Conference on Distributed Computing Systems (ICDCS), 2020
Shijian Li
R. Walls
Tian Guo
150
25
0
07 Apr 2020
AR: Auto-Repair the Synthetic Data for Neural Machine Translation
AR: Auto-Repair the Synthetic Data for Neural Machine Translation
Shanbo Cheng
Shaohui Kuang
Rongxiang Weng
Heng Yu
Changfeng Zhu
Weihua Luo
SyDa
157
3
0
05 Apr 2020
On-the-Fly Adaptation of Source Code Models using Meta-Learning
Disha Shrivastava
Hugo Larochelle
Daniel Tarlow
TTA
136
7
0
26 Mar 2020
VisuoSpatial Foresight for Multi-Step, Multi-Task Fabric Manipulation
VisuoSpatial Foresight for Multi-Step, Multi-Task Fabric Manipulation
Ryan Hoque
Daniel Seita
Ashwin Balakrishna
Aditya Ganapathi
A. Tanwani
Nawid Jamali
K. Yamane
Soshi Iba
Ken Goldberg
258
105
0
19 Mar 2020
Capturing document context inside sentence-level neural machine
  translation models with self-training
Capturing document context inside sentence-level neural machine translation models with self-training
Elman Mansimov
Gábor Melis
Lei Yu
129
14
0
11 Mar 2020
Teaching Temporal Logics to Neural Networks
Teaching Temporal Logics to Neural NetworksInternational Conference on Learning Representations (ICLR), 2020
Christopher Hahn
Frederik Schmitt
Jens U. Kreber
M. Rabe
Bernd Finkbeiner
NAI
329
75
0
06 Mar 2020
Train Large, Then Compress: Rethinking Model Size for Efficient Training
  and Inference of Transformers
Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
Zhuohan Li
Eric Wallace
Sheng Shen
Kevin Lin
Kurt Keutzer
Dan Klein
Joseph E. Gonzalez
279
152
0
26 Feb 2020
Sparse Sinkhorn Attention
Sparse Sinkhorn AttentionInternational Conference on Machine Learning (ICML), 2020
Yi Tay
Dara Bahri
Liu Yang
Donald Metzler
Da-Cheng Juan
175
374
0
26 Feb 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLMAI4TSAI4CE
303
148
0
18 Feb 2020
Controlling Computation versus Quality for Neural Sequence Models
Controlling Computation versus Quality for Neural Sequence Models
Ankur Bapna
N. Arivazhagan
Orhan Firat
205
33
0
17 Feb 2020
Low-Rank Bottleneck in Multi-head Attention Models
Low-Rank Bottleneck in Multi-head Attention ModelsInternational Conference on Machine Learning (ICML), 2020
Srinadh Bhojanapalli
Chulhee Yun
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
159
121
0
17 Feb 2020
Stress Test Evaluation of Transformer-based Models in Natural Language
  Understanding Tasks
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding TasksInternational Conference on Language Resources and Evaluation (LREC), 2020
Carlos Aspillaga
Andrés Carvallo
Vladimir Araujo
ELM
164
33
0
14 Feb 2020
On Layer Normalization in the Transformer Architecture
On Layer Normalization in the Transformer ArchitectureInternational Conference on Machine Learning (ICML), 2020
Ruibin Xiong
Yunchang Yang
Di He
Kai Zheng
Shuxin Zheng
Chen Xing
Huishuai Zhang
Yanyan Lan
Liwei Wang
Tie-Yan Liu
AI4CE
372
1,209
0
12 Feb 2020
Towards a Human-like Open-Domain Chatbot
Towards a Human-like Open-Domain Chatbot
Daniel De Freitas
Minh-Thang Luong
David R. So
Jamie Hall
Noah Fiedel
...
Zi Yang
Apoorv Kulshreshtha
Gaurav Nemade
Yifeng Lu
Quoc V. Le
506
991
0
27 Jan 2020
Pre-training via Leveraging Assisting Languages and Data Selection for
  Neural Machine Translation
Pre-training via Leveraging Assisting Languages and Data Selection for Neural Machine Translation
Israfel Salazar
Mary Dabre
Zhuoyuan Mao
Fei Cheng
Sadao Kurohashi
Eiichiro Sumita
128
2
0
23 Jan 2020
Shifted and Squeezed 8-bit Floating Point format for Low-Precision
  Training of Deep Neural Networks
Shifted and Squeezed 8-bit Floating Point format for Low-Precision Training of Deep Neural NetworksInternational Conference on Learning Representations (ICLR), 2020
Léopold Cambier
Anahita Bhiwandiwalla
Ting Gong
M. Nekuii
Oguz H. Elibol
Hanlin Tang
MQ
231
52
0
16 Jan 2020
Learning Accurate Integer Transformer Machine-Translation Models
Learning Accurate Integer Transformer Machine-Translation ModelsSN Computer Science (SN Comput. Sci.), 2020
Ephrem Wu
92
4
0
03 Jan 2020
Learning from Learning Machines: Optimisation, Rules, and Social Norms
Learning from Learning Machines: Optimisation, Rules, and Social Norms
Travis LaCroix
Yoshua Bengio
81
7
0
29 Dec 2019
Synthetic Datasets for Neural Program Synthesis
Synthetic Datasets for Neural Program SynthesisInternational Conference on Learning Representations (ICLR), 2019
Richard Shin
Neel Kant
Kavi Gupta
Christopher M. Bender
Brandon Trabucco
Rishabh Singh
Basel Alomair
NAI
183
46
0
27 Dec 2019
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures
  Translation
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures TranslationInternational Conference on Language Resources and Evaluation (LREC), 2019
Israfel Salazar
Mary Dabre
Atsushi Fujita
Sadao Kurohashi
182
6
0
26 Dec 2019
Learning and Evaluating Contextual Embedding of Source Code
Learning and Evaluating Contextual Embedding of Source Code
Aditya Kanade
Petros Maniatis
Gogul Balakrishnan
Kensen Shi
ELM
218
82
0
21 Dec 2019
Measuring Compositional Generalization: A Comprehensive Method on
  Realistic Data
Measuring Compositional Generalization: A Comprehensive Method on Realistic DataInternational Conference on Learning Representations (ICLR), 2019
Daniel Keysers
Nathanael Scharli
Nathan Scales
Hylke Buisman
Daniel Furrer
...
Tibor Tihon
Dmitry Tsarkov
Tianlin Li
Marc van Zee
Olivier Bousquet
CoGe
220
382
0
20 Dec 2019
A Survey on Document-level Neural Machine Translation: Methods and
  Evaluation
A Survey on Document-level Neural Machine Translation: Methods and Evaluation
Sameen Maruf
Fahimeh Saleh
Gholamreza Haffari
AI4TS
175
25
0
18 Dec 2019
In Nomine Function: Naming Functions in Stripped Binaries with Neural
  Networks
In Nomine Function: Naming Functions in Stripped Binaries with Neural Networks
Fiorella Artuso
Giuseppe Antonio Di Luna
Luca Massarelli
Leonardo Querzoni
135
5
0
17 Dec 2019
Previous
123456
Next