Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1803.07416
Cited By
Tensor2Tensor for Neural Machine Translation
16 March 2018
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
Stephan Gouws
Llion Jones
Lukasz Kaiser
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Tensor2Tensor for Neural Machine Translation"
50 / 264 papers shown
Title
CrossedWires: A Dataset of Syntactically Equivalent but Semantically Disparate Deep Learning Models
Max Zvyagin
Thomas Brettin
Arvind Ramanathan
Sumit Kumar Jha
104
1
0
29 Aug 2021
YANMTT: Yet Another Neural Machine Translation Toolkit
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Mary Dabre
Eiichiro Sumita
217
14
0
25 Aug 2021
Compositional Generalization in Multilingual Semantic Parsing over Wikidata
Transactions of the Association for Computational Linguistics (TACL), 2021
Ruixiang Cui
Rahul Aralikatte
Heather Lent
Daniel Hershcovich
220
15
0
07 Aug 2021
Residual Tree Aggregation of Layers for Neural Machine Translation
Guoliang Li
Yiyang Li
112
0
0
19 Jul 2021
Neural Machine Translation for Low-Resource Languages: A Survey
ACM Computing Surveys (CSUR), 2021
Surangika Ranathunga
E. Lee
Marjana Prifti Skenduli
Ravi Shekhar
Mehreen Alam
Rishemjit Kaur
316
318
0
29 Jun 2021
A Survey of Transformers
AI Open (AO), 2021
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
413
1,374
0
08 Jun 2021
Luna: Linear Unified Nested Attention
Neural Information Processing Systems (NeurIPS), 2021
Xuezhe Ma
Xiang Kong
Sinong Wang
Chunting Zhou
Jonathan May
Hao Ma
Luke Zettlemoyer
210
132
0
03 Jun 2021
Transformers are Deep Infinite-Dimensional Non-Mercer Binary Kernel Machines
Matthew A. Wright
Joseph E. Gonzalez
200
24
0
02 Jun 2021
Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption Models
Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2021
Felix Stahlberg
Shankar Kumar
SyDa
210
103
0
27 May 2021
TranSmart: A Practical Interactive Machine Translation System
Guoping Huang
Lemao Liu
Xing Wang
Longyue Wang
Huayang Li
Zhaopeng Tu
Chengyang Huang
Shuming Shi
153
36
0
27 May 2021
Rethinking Skip Connection with Layer Normalization in Transformers and ResNets
International Conference on Computational Linguistics (COLING), 2020
Fenglin Liu
Xuancheng Ren
Zhiyuan Zhang
Xu Sun
Yuexian Zou
AI4CE
140
79
0
15 May 2021
Spelling Correction with Denoising Transformer
Alexandr Kuznetsov
Hector Urdiales
117
18
0
12 May 2021
Hierarchical RNNs-Based Transformers MADDPG for Mixed Cooperative-Competitive Environments
Journal of Intelligent & Fuzzy Systems (JIFS), 2021
Xiaolong Wei
Lifang Yang
Xianglin Huang
Gang Cao
Zhulin Tao
Zhengyang Du
Jing An
180
7
0
11 May 2021
EL-Attention: Memory Efficient Lossless Attention for Generation
International Conference on Machine Learning (ICML), 2021
Yu Yan
Jiusheng Chen
Weizhen Qi
Nikhil Bhendawade
Yeyun Gong
Nan Duan
Ruofei Zhang
VLM
158
9
0
11 May 2021
Billion-scale Pre-trained E-commerce Product Knowledge Graph Model
IEEE International Conference on Data Engineering (ICDE), 2021
Wen Zhang
Chi-Man Wong
Ganqiang Ye
Bo Wen
Wei Zhang
Huajun Chen
201
25
0
02 May 2021
A Simple and Effective Positional Encoding for Transformers
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Pu-Chin Chen
Henry Tsai
Srinadh Bhojanapalli
Hyung Won Chung
Yin-Wen Chang
Chun-Sung Ferng
240
74
0
18 Apr 2021
Sync-Switch: Hybrid Parameter Synchronization for Distributed Deep Learning
IEEE International Conference on Distributed Computing Systems (ICDCS), 2021
Shijian Li
Oren Mangoubi
Lijie Xu
Tian Guo
222
22
0
16 Apr 2021
Counter-Interference Adapter for Multilingual Machine Translation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Yaoming Zhu
Jiangtao Feng
Chengqi Zhao
Mingxuan Wang
Lei Li
252
69
0
16 Apr 2021
First the worst: Finding better gender translations during beam search
Findings (Findings), 2021
D. Saunders
Rosie Sallis
Bill Byrne
181
33
0
15 Apr 2021
WHOSe Heritage: Classification of UNESCO World Heritage "Outstanding Universal Value" Documents with Soft Labels
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Nan Bai
Renqian Luo
Pirouz Nourian
A. Roders
124
7
0
12 Apr 2021
Extended Parallel Corpus for Amharic-English Machine Translation
International Conference on Language Resources and Evaluation (LREC), 2021
A. Gezmu
A. Nürnberger
T. Bati
245
19
0
08 Apr 2021
Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomes
Statistical Methods in Medical Research (Stat Med), 2021
Jamie Yap
John J. Dziak
David Kabiito
Claire Babirye
J. McKay
Bibhas Chakraborty
J. Nakatumba‐Nabende
188
26
0
31 Mar 2021
FastMoE: A Fast Mixture-of-Expert Training System
Jiaao He
J. Qiu
Aohan Zeng
Zhilin Yang
Jidong Zhai
Jie Tang
ALM
MoE
201
125
0
24 Mar 2021
Full Page Handwriting Recognition via Image to Sequence Extraction
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2021
Sumeet S. Singh
Sergey Karayev
255
65
0
11 Mar 2021
Hurdles to Progress in Long-form Question Answering
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Kalpesh Krishna
Aurko Roy
Mohit Iyyer
230
222
0
10 Mar 2021
Do Transformer Modifications Transfer Across Implementations and Applications?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Sharan Narang
Hyung Won Chung
Yi Tay
W. Fedus
Thibault Févry
...
Wei Li
Nan Ding
Jake Marcus
Adam Roberts
Colin Raffel
202
134
0
23 Feb 2021
VisuoSpatial Foresight for Physical Sequential Fabric Manipulation
Autonomous Robots (Auton. Robots), 2021
Ryan Hoque
Daniel Seita
Ashwin Balakrishna
Aditya Ganapathi
A. Tanwani
Nawid Jamali
K. Yamane
Soshi Iba
Ken Goldberg
147
43
0
19 Feb 2021
A Deep Adversarial Model for Suffix and Remaining Time Prediction of Event Sequences
SDM (SDM), 2021
Farbod Taymouri
M. Rosa
S. Erfani
115
33
0
15 Feb 2021
MUFASA: Multimodal Fusion Architecture Search for Electronic Health Records
AAAI Conference on Artificial Intelligence (AAAI), 2021
Zhen Xu
David R. So
Andrew M. Dai
Mamba
325
64
0
03 Feb 2021
Automated Query Reformulation for Efficient Search based on Query Logs From Stack Overflow
International Conference on Software Engineering (ICSE), 2021
Kaibo Cao
Chunyang Chen
Sebastian Baltes
Christoph Treude
Xiang Chen
231
64
0
01 Feb 2021
TextBox: A Unified, Modularized, and Extensible Framework for Text Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Junyi Li
Tianyi Tang
Gaole He
Jinhao Jiang
Xiaoxuan Hu
Puzhao Xie
Zhipeng Chen
Zhuohao Yu
Wayne Xin Zhao
Ji-Rong Wen
262
27
0
06 Jan 2021
Neural Machine Translation: A Review of Methods, Resources, and Tools
AI Open (AO), 2020
Zhixing Tan
Shuo Wang
Zonghan Yang
Gang Chen
Xuancheng Huang
Maosong Sun
Yang Liu
3DV
AI4TS
239
123
0
31 Dec 2020
Why Neural Machine Translation Prefers Empty Outputs
Xing Shi
Yijun Xiao
Kevin Knight
AAML
131
9
0
24 Dec 2020
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task
AAAI Conference on Artificial Intelligence (AAAI), 2020
Dmitry Tsarkov
Tibor Tihon
Nathan Scales
Nikola Momchev
Danila Sinopalnikov
Nathanael Scharli
153
17
0
15 Dec 2020
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish
Machine Translation (MT), 2020
Begum Citamak
Ozan Caglayan
Menekse Kuyu
Erkut Erdem
Aykut Erdem
Pranava Madhyastha
Lucia Specia
190
9
0
13 Dec 2020
Attentional-Biased Stochastic Gradient Descent
Q. Qi
Yi Tian Xu
Rong Jin
W. Yin
Tianbao Yang
ODL
431
12
0
13 Dec 2020
Cross-lingual Transfer of Abstractive Summarizer to Less-resource Language
Aleš Žagar
Marko Robnik-Šikonja
227
11
0
08 Dec 2020
ConVEx: Data-Efficient and Few-Shot Slot Labeling
Matthew Henderson
Ivan Vulić
CLIP
VLM
193
38
0
22 Oct 2020
CUNI Systems for the Unsupervised and Very Low Resource Translation Task in WMT20
Ivana Kvapilíková
Tom Kocmi
Ondrej Bojar
90
5
0
22 Oct 2020
Detecting ESG topics using domain-specific language models and data augmentation approaches
Timothy Nugent
N. Stelea
Jochen L. Leidner
158
13
0
16 Oct 2020
Semantic Label Smoothing for Sequence to Sequence Problems
Michal Lukasik
Himanshu Jain
A. Menon
Seungyeon Kim
Srinadh Bhojanapalli
Felix X. Yu
Sanjiv Kumar
AI4TS
120
18
0
15 Oct 2020
Addressing Exposure Bias With Document Minimum Risk Training: Cambridge at the WMT20 Biomedical Translation Task
Conference on Machine Translation (WMT), 2020
Danielle Saunders
Bill Byrne
134
10
0
11 Oct 2020
fairseq S2T: Fast Speech-to-Text Modeling with fairseq
Changhan Wang
Yun Tang
Xutai Ma
Anne Wu
Sravya Popuri
Dmytro Okhonko
J. Pino
VLM
LRM
299
316
0
11 Oct 2020
On Task-Level Dialogue Composition of Generative Transformer Model
First Workshop on Insights from Negative Results in NLP (IFNRN), 2020
Prasanna Parthasarathi
Arvind Neelakantan
Sharan Narang
109
2
0
09 Oct 2020
Query-Key Normalization for Transformers
Findings (Findings), 2020
Alex Henry
Prudhvi Raj Dachapally
S. Pawar
Yuxuan Chen
209
148
0
08 Oct 2020
Improving Sequential Latent Variable Models with Autoregressive Flows
Joseph Marino
Lei Chen
Jiawei He
Stephan Mandt
BDL
AI4TS
303
15
0
07 Oct 2020
Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers
Yimeng Wu
Peyman Passban
Mehdi Rezagholizade
Qun Liu
MoE
128
37
0
06 Oct 2020
Code to Comment "Translation": Data, Metrics, Baselining & Evaluation
International Conference on Automated Software Engineering (ASE), 2020
David Gros
Hariharan Sezhiyan
Prem Devanbu
Zhou Yu
170
82
0
03 Oct 2020
Expectigrad: Fast Stochastic Optimization with Robust Convergence Properties
Brett Daley
Chris Amato
ODL
138
4
0
03 Oct 2020
Seq2Edits: Sequence Transduction Using Span-level Edit Operations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Felix Stahlberg
Shankar Kumar
BDL
184
95
0
23 Sep 2020
Previous
1
2
3
4
5
6
Next