Tensor2Tensor for Neural Machine Translation

16 March 2018

Papers citing "Tensor2Tensor for Neural Machine Translation"

50 / 264 papers shown

CrossedWires: A Dataset of Syntactically Equivalent but Semantically Disparate Deep Learning Models

109

29 Aug 2021

YANMTT: Yet Another Neural Machine Translation ToolkitAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Mary Dabre

Eiichiro Sumita

221

25 Aug 2021

Compositional Generalization in Multilingual Semantic Parsing over WikidataTransactions of the Association for Computational Linguistics (TACL), 2021

Ruixiang Cui

Rahul Aralikatte

Heather Lent

Daniel Hershcovich

242

07 Aug 2021

Residual Tree Aggregation of Layers for Neural Machine Translation

Guoliang Li

Yiyang Li

113

19 Jul 2021

Neural Machine Translation for Low-Resource Languages: A SurveyACM Computing Surveys (CSUR), 2021

Surangika Ranathunga

E. Lee

Marjana Prifti Skenduli

Ravi Shekhar

Mehreen Alam

Rishemjit Kaur

321

324

29 Jun 2021

A Survey of TransformersAI Open (AO), 2021

Tianyang Lin

Yuxin Wang

Xiangyang Liu

Xipeng Qiu

ViT

445

1,386

08 Jun 2021

Luna: Linear Unified Nested AttentionNeural Information Processing Systems (NeurIPS), 2021

Sinong Wang

Hao Ma

Luke Zettlemoyer

235

133

03 Jun 2021

Transformers are Deep Infinite-Dimensional Non-Mercer Binary Kernel Machines

Matthew A. Wright

Joseph E. Gonzalez

226

02 Jun 2021

Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption ModelsWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2021

Felix Stahlberg

Shankar Kumar

SyDa

220

103

27 May 2021

TranSmart: A Practical Interactive Machine Translation System

171

27 May 2021

Rethinking Skip Connection with Layer Normalization in Transformers and ResNetsInternational Conference on Computational Linguistics (COLING), 2020

Xuancheng Ren

Yuexian Zou

148

15 May 2021

Spelling Correction with Denoising Transformer

Alexandr Kuznetsov

Hector Urdiales

123

12 May 2021

Hierarchical RNNs-Based Transformers MADDPG for Mixed Cooperative-Competitive EnvironmentsJournal of Intelligent & Fuzzy Systems (JIFS), 2021

192

11 May 2021

EL-Attention: Memory Efficient Lossless Attention for GenerationInternational Conference on Machine Learning (ICML), 2021

166

11 May 2021

Billion-scale Pre-trained E-commerce Product Knowledge Graph ModelIEEE International Conference on Data Engineering (ICDE), 2021

Wei Zhang

Huajun Chen

207

02 May 2021

A Simple and Effective Positional Encoding for TransformersConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Srinadh Bhojanapalli

252

18 Apr 2021

Sync-Switch: Hybrid Parameter Synchronization for Distributed Deep LearningIEEE International Conference on Distributed Computing Systems (ICDCS), 2021

230

16 Apr 2021

Counter-Interference Adapter for Multilingual Machine TranslationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Lei Li

275

16 Apr 2021

First the worst: Finding better gender translations during beam searchFindings (Findings), 2021

D. Saunders

Rosie Sallis

Bill Byrne

194

15 Apr 2021

WHOSe Heritage: Classification of UNESCO World Heritage "Outstanding Universal Value" Documents with Soft LabelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

141

12 Apr 2021

Extended Parallel Corpus for Amharic-English Machine TranslationInternational Conference on Language Resources and Evaluation (LREC), 2021

A. Gezmu

A. Nürnberger

T. Bati

254

08 Apr 2021

Sample size estimation for comparing dynamic treatment regimens in a SMART: a Monte Carlo-based approach and case study with longitudinal overdispersed count outcomesStatistical Methods in Medical Research (Stat Med), 2021

John J. Dziak

Bibhas Chakraborty

194

31 Mar 2021

FastMoE: A Fast Mixture-of-Expert Training System

202

129

24 Mar 2021

Full Page Handwriting Recognition via Image to Sequence ExtractionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2021

Sumeet S. Singh

Sergey Karayev

256

11 Mar 2021

Hurdles to Progress in Long-form Question AnsweringNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Kalpesh Krishna

Aurko Roy

Mohit Iyyer

238

222

10 Mar 2021

Do Transformer Modifications Transfer Across Implementations and Applications?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Sharan Narang

...

215

134

23 Feb 2021

VisuoSpatial Foresight for Physical Sequential Fabric ManipulationAutonomous Robots (Auton. Robots), 2021

148

19 Feb 2021

A Deep Adversarial Model for Suffix and Remaining Time Prediction of Event SequencesSDM (SDM), 2021

Farbod Taymouri

M. Rosa

S. Erfani

128

15 Feb 2021

MUFASA: Multimodal Fusion Architecture Search for Electronic Health RecordsAAAI Conference on Artificial Intelligence (AAAI), 2021

342

03 Feb 2021

Automated Query Reformulation for Efficient Search based on Query Logs From Stack OverflowInternational Conference on Software Engineering (ICSE), 2021

232

01 Feb 2021

TextBox: A Unified, Modularized, and Extensible Framework for Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

306

06 Jan 2021

Neural Machine Translation: A Review of Methods, Resources, and ToolsAI Open (AO), 2020

Zhixing Tan

Shuo Wang

Zonghan Yang

Gang Chen

Xuancheng Huang

Maosong Sun

Yang Liu

3DV AI4TS

259

124

31 Dec 2020

Why Neural Machine Translation Prefers Empty Outputs

131

24 Dec 2020

*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional TaskAAAI Conference on Artificial Intelligence (AAAI), 2020

Nikola Momchev

171

15 Dec 2020

MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in TurkishMachine Translation (MT), 2020

Pranava Madhyastha

206

13 Dec 2020

Attentional-Biased Stochastic Gradient Descent

459

13 Dec 2020

Cross-lingual Transfer of Abstractive Summarizer to Less-resource Language

Aleš Žagar

Marko Robnik-Šikonja

240

08 Dec 2020

ConVEx: Data-Efficient and Few-Shot Slot Labeling

Matthew Henderson

Ivan Vulić

CLIP VLM

209

22 Oct 2020

CUNI Systems for the Unsupervised and Very Low Resource Translation Task in WMT20

Ivana Kvapilíková

Tom Kocmi

Ondrej Bojar

22 Oct 2020

Detecting ESG topics using domain-specific language models and data augmentation approaches

Timothy Nugent

N. Stelea

Jochen L. Leidner

164

16 Oct 2020

Semantic Label Smoothing for Sequence to Sequence Problems

Srinadh Bhojanapalli

Sanjiv Kumar

125

15 Oct 2020

Addressing Exposure Bias With Document Minimum Risk Training: Cambridge at the WMT20 Biomedical Translation TaskConference on Machine Translation (WMT), 2020

Danielle Saunders

Bill Byrne

156

11 Oct 2020

fairseq S2T: Fast Speech-to-Text Modeling with fairseq

325

318

11 Oct 2020

On Task-Level Dialogue Composition of Generative Transformer ModelFirst Workshop on Insights from Negative Results in NLP (IFNRN), 2020

Prasanna Parthasarathi

Arvind Neelakantan

Sharan Narang

113

09 Oct 2020

Query-Key Normalization for TransformersFindings (Findings), 2020

Alex Henry

Prudhvi Raj Dachapally

S. Pawar

Yuxuan Chen

228

153

08 Oct 2020

Improving Sequential Latent Variable Models with Autoregressive Flows

336

07 Oct 2020

Why Skip If You Can Combine: A Simple Knowledge Distillation Technique for Intermediate Layers

Qun Liu

133

06 Oct 2020

Code to Comment "Translation": Data, Metrics, Baselining & EvaluationInternational Conference on Automated Software Engineering (ASE), 2020

170

03 Oct 2020

Expectigrad: Fast Stochastic Optimization with Robust Convergence Properties

Brett Daley

Chris Amato

ODL

138

03 Oct 2020

Seq2Edits: Sequence Transduction Using Span-level Edit OperationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020

Felix Stahlberg

Shankar Kumar

BDL

196

23 Sep 2020