ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.07416
  4. Cited By
Tensor2Tensor for Neural Machine Translation

Tensor2Tensor for Neural Machine Translation

16 March 2018
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
Stephan Gouws
Llion Jones
Lukasz Kaiser
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
ArXiv (abs)PDFHTML

Papers citing "Tensor2Tensor for Neural Machine Translation"

50 / 264 papers shown
Title
FlauBERT: Unsupervised Language Model Pre-training for French
FlauBERT: Unsupervised Language Model Pre-training for FrenchInternational Conference on Language Resources and Evaluation (LREC), 2019
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
330
429
0
11 Dec 2019
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and SurveyJournal of Artificial Intelligence Research (JAIR), 2019
Felix Stahlberg
3DVAI4TSMedIm
351
381
0
04 Dec 2019
Neural Academic Paper Generation
Neural Academic Paper Generation
Samet Demir
Uras Mutlu
Özgür Özdemir
69
3
0
02 Dec 2019
Multimodal Machine Translation through Visuals and Speech
Multimodal Machine Translation through Visuals and SpeechMachine Translation (MT), 2019
U. Sulubacak
Ozan Caglayan
Stig-Arne Gronroos
Aku Rouhe
Desmond Elliott
Lucia Specia
Jörg Tiedemann
188
87
0
28 Nov 2019
How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for
  Rewriting Ill-Formed Questions
How to Ask Better Questions? A Large-Scale Multi-Domain Dataset for Rewriting Ill-Formed QuestionsAAAI Conference on Artificial Intelligence (AAAI), 2019
Zewei Chu
Mingda Chen
Jiehua Chen
Miaosen Wang
Kevin Gimpel
Manaal Faruqui
Xiance Si
182
21
0
21 Nov 2019
Neural Duplicate Question Detection without Labeled Training Data
Neural Duplicate Question Detection without Labeled Training DataConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Andreas Rucklé
N. Moosavi
Iryna Gurevych
OODAAML
189
13
0
13 Nov 2019
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic
  Meta-Learning
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning
Sathish Indurthi
HyoJung Han
Nikhil Kumar Lakumarapu
Beomseok Lee
Insoo Chung
Sangha Kim
Chanwoo Kim
183
28
0
11 Nov 2019
ConveRT: Efficient and Accurate Conversational Representations from
  Transformers
ConveRT: Efficient and Accurate Conversational Representations from TransformersFindings (Findings), 2019
Matthew Henderson
I. Casanueva
Nikola Mrkvsić
Pei-hao Su
Tsung-Hsien
Ivan Vulić
412
206
0
09 Nov 2019
Lipschitz Constrained Parameter Initialization for Deep Transformers
Lipschitz Constrained Parameter Initialization for Deep TransformersAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Hongfei Xu
Qiuhui Liu
Josef van Genabith
Deyi Xiong
Jingyi Zhang
ODL
258
26
0
08 Nov 2019
Neural Assistant: Joint Action Prediction, Response Generation, and
  Latent Knowledge Reasoning
Neural Assistant: Joint Action Prediction, Response Generation, and Latent Knowledge Reasoning
Arvind Neelakantan
Semih Yavuz
Sharan Narang
V. Prasad
Ben Goodrich
Daniel Duckworth
Chinnadhurai Sankar
Xifeng Yan
123
15
0
31 Oct 2019
Transfer Learning from Transformers to Fake News Challenge Stance
  Detection (FNC-1) Task
Transfer Learning from Transformers to Fake News Challenge Stance Detection (FNC-1) TaskInternational Conference on Language Resources and Evaluation (LREC), 2019
Valeriya Slovikovskaya
112
46
0
31 Oct 2019
Transformer-based Cascaded Multimodal Speech Translation
Transformer-based Cascaded Multimodal Speech TranslationInternational Workshop on Spoken Language Translation (IWSLT), 2019
Zixiu "Alex" Wu
Ozan Caglayan
Julia Ive
Josiah Wang
Lucia Specia
163
7
0
29 Oct 2019
Fast Structured Decoding for Sequence Models
Fast Structured Decoding for Sequence ModelsNeural Information Processing Systems (NeurIPS), 2019
Zhiqing Sun
Zhuohan Li
Haoqing Wang
Zi Lin
Di He
Zhihong Deng
243
128
0
25 Oct 2019
PyTorchPipe: a framework for rapid prototyping of pipelines combining
  language and vision
PyTorchPipe: a framework for rapid prototyping of pipelines combining language and vision
Tomasz Kornuta
115
3
0
18 Oct 2019
Efficiency through Auto-Sizing: Notre Dame NLP's Submission to the WNGT
  2019 Efficiency Task
Efficiency through Auto-Sizing: Notre Dame NLP's Submission to the WNGT 2019 Efficiency TaskConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Kenton W. Murray
Brian DuSell
David Chiang
86
2
0
16 Oct 2019
Transformers without Tears: Improving the Normalization of
  Self-Attention
Transformers without Tears: Improving the Normalization of Self-AttentionInternational Workshop on Spoken Language Translation (IWSLT), 2019
Toan Q. Nguyen
Julian Salazar
251
246
0
14 Oct 2019
HuggingFace's Transformers: State-of-the-art Natural Language Processing
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Thomas Wolf
Lysandre Debut
Victor Sanh
Julien Chaumond
Clement Delangue
...
Teven Le Scao
Sylvain Gugger
Mariama Drame
Quentin Lhoest
Alexander M. Rush
AI4CE
420
3,183
0
09 Oct 2019
A Case Study on Combining ASR and Visual Features for Generating
  Instructional Video Captions
A Case Study on Combining ASR and Visual Features for Generating Instructional Video CaptionsConference on Computational Natural Language Learning (CoNLL), 2019
Jack Hessel
Bo Pang
Zhenhai Zhu
Radu Soricut
165
39
0
07 Oct 2019
Synthetic Data for Deep Learning
Synthetic Data for Deep Learning
Sergey I. Nikolenko
324
409
0
25 Sep 2019
Breaking the Data Barrier: Towards Robust Speech Translation via
  Adversarial Stability Training
Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability TrainingInternational Workshop on Spoken Language Translation (IWSLT), 2019
Qiao Cheng
Meiyuan Fang
Yaqian Han
Jin Huang
Yitao Duan
236
18
0
25 Sep 2019
Hint-Based Training for Non-Autoregressive Machine Translation
Hint-Based Training for Non-Autoregressive Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Zhuohan Li
Zi Lin
Di He
Fei Tian
Tao Qin
Liwei Wang
Tie-Yan Liu
170
75
0
15 Sep 2019
NeMo: a toolkit for building AI applications using Neural Modules
NeMo: a toolkit for building AI applications using Neural Modules
Oleksii Kuchaiev
Jason Chun Lok Li
Huyen Nguyen
Oleksii Hrinchuk
Ryan Leary
...
Jack Cook
P. Castonguay
Mariya Popova
Jocelyn Huang
Jonathan M. Cohen
455
351
0
14 Sep 2019
A Universal Parent Model for Low-Resource Neural Machine Translation
  Transfer
A Universal Parent Model for Low-Resource Neural Machine Translation Transfer
Mozhdeh Gheini
Jonathan May
78
22
0
14 Sep 2019
A Comparative Study on Transformer vs RNN in Speech Applications
A Comparative Study on Transformer vs RNN in Speech ApplicationsAutomatic Speech Recognition & Understanding (ASRU), 2019
Shigeki Karita
Nanxin Chen
Tomoki Hayashi
Takaaki Hori
Hirofumi Inaguma
...
Ryuichi Yamamoto
Xiao-fei Wang
Shinji Watanabe
Takenori Yoshimura
Wangyou Zhang
260
778
0
13 Sep 2019
Hierarchical Foresight: Self-Supervised Learning of Long-Horizon Tasks
  via Visual Subgoal Generation
Hierarchical Foresight: Self-Supervised Learning of Long-Horizon Tasks via Visual Subgoal GenerationInternational Conference on Learning Representations (ICLR), 2019
Suraj Nair
Chelsea Finn
VGen
167
143
0
12 Sep 2019
Self-Attentional Models Application in Task-Oriented Dialogue Generation
  Systems
Self-Attentional Models Application in Task-Oriented Dialogue Generation SystemsRecent Advances in Natural Language Processing (RANLP), 2019
Mansour Saffar Mehrjardi
Amine Trabelsi
Osmar R. Zaiane
LRM
84
5
0
11 Sep 2019
Question Generation by Transformers
Question Generation by Transformers
Kettip Kriangchaivech
A. Wangperawong
163
30
0
09 Sep 2019
Enhancing Machine Translation with Dependency-Aware Self-Attention
Enhancing Machine Translation with Dependency-Aware Self-AttentionAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Emanuele Bugliarello
Naoaki Okazaki
175
73
0
06 Sep 2019
Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset
Taskmaster-1: Toward a Realistic and Diverse Dialog DatasetConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Bill Byrne
Karthikeyan K
Chinnadhurai Sankar
Arvind Neelakantan
Daniel Duckworth
Semih Yavuz
Ben Goodrich
Amit Dubey
A. Cedilnik
Kyu-Young Kim
160
228
0
01 Sep 2019
Maximizing Mutual Information for Tacotron
Maximizing Mutual Information for Tacotron
Peng Liu
Xixin Wu
Shiyin Kang
Guangzhi Li
Jane Polak Scowcroft
Dong Yu
205
16
0
30 Aug 2019
Improving Deep Transformer with Depth-Scaled Initialization and Merged
  Attention
Improving Deep Transformer with Depth-Scaled Initialization and Merged AttentionConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Biao Zhang
Ivan Titov
Rico Sennrich
167
115
0
29 Aug 2019
On NMT Search Errors and Model Errors: Cat Got Your Tongue?
On NMT Search Errors and Model Errors: Cat Got Your Tongue?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Felix Stahlberg
Bill Byrne
LRM
213
160
0
27 Aug 2019
Multilingual Neural Machine Translation with Language Clustering
Multilingual Neural Machine Translation with Language ClusteringConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Xu Tan
Jiale Chen
Di He
Ziheng Lu
Tao Qin
Tie-Yan Liu
379
119
0
25 Aug 2019
Leveraging Sentence Similarity in Natural Language Generation: Improving
  Beam Search using Range Voting
Leveraging Sentence Similarity in Natural Language Generation: Improving Beam Search using Range VotingWorkshop on Neural Generation and Translation (WNGT), 2019
Sebastian Borgeaud
Guy Edward Toh Emerson
170
21
0
17 Aug 2019
Incorporating Word and Subword Units in Unsupervised Machine Translation
  Using Language Model Rescoring
Incorporating Word and Subword Units in Unsupervised Machine Translation Using Language Model RescoringConference on Machine Translation (WMT), 2019
Zihan Liu
Yan Xu
Genta Indra Winata
Pascale Fung
316
22
0
16 Aug 2019
Fast and Accurate Capitalization and Punctuation for Automatic Speech
  Recognition Using Transformer and Chunk Merging
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk MergingOriental COCOSDA International Conference on Speech Database and Assessments (COCOSDA), 2019
Thai-Binh Nguyen
V. H. Nguyen
Hien Nguyen
Pham Ngoc Phuong
The-Loc Nguyen
Quoc Truong Do
Luong Chi Mai
99
48
0
07 Aug 2019
Predicting Actions to Help Predict Translations
Predicting Actions to Help Predict Translations
Zixiu "Alex" Wu
Julia Ive
Josiah Wang
Pranava Madhyastha
Lucia Specia
167
7
0
05 Aug 2019
DELTA: A DEep learning based Language Technology plAtform
DELTA: A DEep learning based Language Technology plAtform
Kun Han
Junwen Chen
Hui Zhang
Haiyang Xu
Yiping Peng
...
Cheng Gong
Yunbo Wang
Wei Zou
Hui Song
Xiangang Li
VLM
94
10
0
02 Aug 2019
Learning Question-Guided Video Representation for Multi-Turn Video
  Question Answering
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering
Guan-Lin Chao
Abhinav Rastogi
Semih Yavuz
Dilek Z. Hakkani-Tür
Jindong Chen
Ian Lane
79
6
0
31 Jul 2019
English-Czech Systems in WMT19: Document-Level Transformer
English-Czech Systems in WMT19: Document-Level TransformerConference on Machine Translation (WMT), 2019
Martin Popel
Dominik Machácek
Michal Auersperger
Ondrej Bojar
Pavel Pecina
122
22
0
30 Jul 2019
Representation Degeneration Problem in Training Natural Language
  Generation Models
Representation Degeneration Problem in Training Natural Language Generation ModelsInternational Conference on Learning Representations (ICLR), 2019
Jun Gao
Di He
Xu Tan
Tao Qin
Liwei Wang
Tie-Yan Liu
206
305
0
28 Jul 2019
Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine
  Translation
Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine TranslationJournal of Computacion y Sistemas (JCYS), 2019
Shantipriya Parida
Ondrej Bojar
S. Dash
142
65
0
21 Jul 2019
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional
  Encoder Representations from Transformer
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from TransformerInterspeech (Interspeech), 2019
Guan-Lin Chao
Ian Lane
153
109
0
05 Jul 2019
The CUED's Grammatical Error Correction Systems for BEA-2019
The CUED's Grammatical Error Correction Systems for BEA-2019
Felix Stahlberg
Bill Byrne
173
8
0
29 Jun 2019
Widening the Representation Bottleneck in Neural Machine Translation
  with Lexical Shortcuts
Widening the Representation Bottleneck in Neural Machine Translation with Lexical ShortcutsConference on Machine Translation (WMT), 2019
Denis Emelin
Ivan Titov
Rico Sennrich
122
10
0
28 Jun 2019
Comparing Semi-Parametric Model Learning Algorithms for Dynamic Model
  Estimation in Robotics
Comparing Semi-Parametric Model Learning Algorithms for Dynamic Model Estimation in Robotics
Sebastian Riedel
F. Stulp
125
7
0
27 Jun 2019
CUNI System for the WMT19 Robustness Task
CUNI System for the WMT19 Robustness TaskConference on Machine Translation (WMT), 2019
Jindřich Helcl
Jindrich Libovický
Martin Popel
111
10
0
21 Jun 2019
Distilling Translations with Visual Awareness
Distilling Translations with Visual AwarenessAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Julia Ive
Pranava Madhyastha
Lucia Specia
VLM
204
85
0
18 Jun 2019
UCAM Biomedical translation at WMT19: Transfer learning multi-domain
  ensembles
UCAM Biomedical translation at WMT19: Transfer learning multi-domain ensemblesConference on Machine Translation (WMT), 2019
Danielle Saunders
Felix Stahlberg
Bill Byrne
MedIm
123
14
0
13 Jun 2019
A Multiscale Visualization of Attention in the Transformer Model
A Multiscale Visualization of Attention in the Transformer ModelAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Jesse Vig
ViT
187
651
0
12 Jun 2019
Previous
123456
Next