v1v2v3 (latest)

Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling

4 November 2016

Papers citing "Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling"

50 / 237 papers shown

Thick-Net: Parallel Network Structure for Sequential Modeling

130

19 Nov 2019

Improving Transformer Models by Reordering their SublayersAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Ofir Press

Noah A. Smith

Omer Levy

168

10 Nov 2019

Kernelized Bayesian Softmax for Text GenerationNeural Information Processing Systems (NeurIPS), 2019

Ning Miao

Hao Zhou

Chengqi Zhao

Wenxian Shi

Lei Li

131

01 Nov 2019

Federated Evaluation of On-device Personalization

123

307

22 Oct 2019

Deep Independently Recurrent Neural Network (IndRNN)

Shuai Li

Wanqing Li

Chris Cook

Yanbo Gao

197

11 Oct 2019

Searching for A Robust Neural Architecture in Four GPU HoursComputer Vision and Pattern Recognition (CVPR), 2019

Xuanyi Dong

Yezhou Yang

432

689

10 Oct 2019

AntMan: Sparse Low-Rank Compression to Accelerate RNN inference

Samyam Rajbhandari

H. Shrivastava

J. Rho

102

02 Oct 2019

Generalization in Generation: A closer look at Exposure BiasConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Florian Schmidt

252

115

01 Oct 2019

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASRSpoken Language Technology Workshop (SLT), 2018

22 Sep 2019

Ouroboros: On Accelerating Training of Transformer-Based Language ModelsNeural Information Processing Systems (NeurIPS), 2019

Lawrence Carin

138

14 Sep 2019

CTRL: A Conditional Transformer Language Model for Controllable Generation

863

1,362

11 Sep 2019

Subword Language Model for Query Auto-CompletionConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Gyuwan Kim

137

02 Sep 2019

Quantity doesn't buy quality syntax with neural language modelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Marten van Schijndel

Aaron Mueller

Tal Linzen

166

31 Aug 2019

Restricted Recurrent Neural Networks

Enmao Diao

Jie Ding

Vahid Tarokh

161

21 Aug 2019

SenseBERT: Driving Some Sense into BERTAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

267

196

15 Aug 2019

Representation Degeneration Problem in Training Natural Language Generation ModelsInternational Conference on Learning Representations (ICLR), 2019

Xu Tan

216

311

28 Jul 2019

Adaptive Noise Injection: A Structure-Expanding Regularization for RNN

114

25 Jul 2019

Augmenting Self-attention with Persistent Memory

223

149

02 Jul 2019

Kite: Automatic speech recognition for unmanned aerial vehiclesInterspeech (Interspeech), 2019

Dan Oneaţă

H. Cucu

116

02 Jul 2019

Relating Simple Sentence Representations in Deep Neural Networks and the BrainAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

144

27 Jun 2019

A Tensorized Transformer for Language ModelingNeural Information Processing Systems (NeurIPS), 2019

354

186

24 Jun 2019

Character n-gram Embeddings to Improve RNN Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2019

Sho Takase

Jun Suzuki

Masaaki Nagata

145

13 Jun 2019

Improving Neural Language Modeling via Adversarial TrainingInternational Conference on Machine Learning (ICML), 2019

286

122

10 Jun 2019

Improving Neural Language Models by Segmenting, Attending, and Predicting the FutureAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

146

04 Jun 2019

A Cross-Sentence Latent Variable Model for Semi-Supervised Text Sequence MatchingAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

152

04 Jun 2019

Efficient Neural Architecture Search via Proximal IterationsAAAI Conference on Artificial Intelligence (AAAI), 2019

335

108

30 May 2019

Deep Residual Output Layers for Neural Language GenerationInternational Conference on Machine Learning (ICML), 2019

Nikolaos Pappas

James Henderson

206

14 May 2019

Mutual Information Scaling and Expressive Power of Sequence Models

Huitao Shen

172

10 May 2019

Differentiable Architecture Search with Ensemble Gumbel-Softmax

108

06 May 2019

Language Models with Transformers

Chenguang Wang

Mu Li

Alex Smola

263

132

20 Apr 2019

Improving Human Text Comprehension through Semi-Markov CRF-based Neural Section Title Generation

Sebastian Gehrmann

Steven Layne

Franck Dernoncourt

102

15 Apr 2019

Knowledge Distillation For Recurrent Neural Network Language Modeling With Trust Regularization

208

08 Apr 2019

WeNet: Weighted Networks for Recurrent Network Architecture Search

Zhiheng Huang

Bing Xiang

08 Apr 2019

SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression

Christos Baziotis

Ion Androutsopoulos

Ioannis Konstas

Alexandros Potamianos

225

07 Apr 2019

Conversation Model Fine-Tuning for Classifying Client Utterances in Counseling Dialogues

Sungjoon Park

Donghyun Kim

Alice Oh

125

31 Mar 2019

Pre-trained Language Model Representations for Language Generation

Sergey Edunov

Alexei Baevski

Michael Auli

229

137

22 Mar 2019

Cloze-driven Pretraining of Self-attention NetworksConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Alexei Baevski

Sergey Edunov

Yinhan Liu

Luke Zettlemoyer

Michael Auli

174

205

19 Mar 2019

Efficient Contextual Representation Learning Without Softmax Layer

112

28 Feb 2019

Context Vectors are Reflections of Word Vectors in Half the DimensionsJournal of Artificial Intelligence Research (JAIR), 2019

Z. Assylbekov

Rustem Takhanov

120

26 Feb 2019

Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise Non-linearities

201

21 Feb 2019

Compression of Recurrent Neural Networks for Efficient Language Modeling

Artem M. Grachev

D. Ignatov

Andrey V. Savchenko

168

06 Feb 2019

Tensorized Embedding Layers for Efficient Model Compression

235

30 Jan 2019

Variational Smoothing in Recurrent Neural Network Language Models

Lingpeng Kong

104

27 Jan 2019

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

749

4,119

09 Jan 2019

FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network

192

197

08 Jan 2019

Multi-style Generative Reading Comprehension

244

08 Jan 2019

Exploring Weight Symmetry in Deep Neural Networks

S. Hu

Sergey Zagoruyko

N. Komodakis

194

28 Dec 2018

Precision Highway for Ultra Low-Precision Quantization

229

24 Dec 2018

Learning Private Neural Language Modeling with Attentive Aggregation

Zi Huang

247

164

17 Dec 2018

Neural Abstractive Text Summarization with Sequence-to-Sequence Models

408

252

05 Dec 2018