Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.02182
Cited By
Regularizing and Optimizing LSTM Language Models
7 August 2017
Stephen Merity
N. Keskar
R. Socher
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regularizing and Optimizing LSTM Language Models"
50 / 508 papers shown
Title
BAMSProd: A Step towards Generalizing the Adaptive Optimization Methods to Deep Binary Model
Junjie Liu
Dongchao Wen
Deyu Wang
Wei Tao
Tse-Wei Chen
Kinya Osa
Masami Kato
MQ
13
1
0
29 Sep 2020
Identifying Automatically Generated Headlines using Transformers
Antonis Maronikolakis
Hinrich Schütze
Mark Stevenson
17
3
0
28 Sep 2020
Multi-timescale Representation Learning in LSTM Language Models
Shivangi Mahto
Vy A. Vo
Javier S. Turek
Alexander G. Huth
12
29
0
27 Sep 2020
Grounded Compositional Outputs for Adaptive Language Modeling
Nikolaos Pappas
Phoebe Mulcaire
Noah A. Smith
KELM
25
7
0
24 Sep 2020
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
Juan Cruz-Benito
Sanjay Vishwakarma
Francisco Martín-Fernández
Ismael Faro Ibm Quantum
22
30
0
16 Sep 2020
Cascaded Semantic and Positional Self-Attention Network for Document Classification
Juyong Jiang
Jie Zhang
Kai Zhang
24
6
0
15 Sep 2020
Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
Sahar Abdelnabi
Mario Fritz
WaLM
18
143
0
07 Sep 2020
S-SGD: Symmetrical Stochastic Gradient Descent with Weight Noise Injection for Reaching Flat Minima
Wonyong Sung
Iksoo Choi
Jinhwan Park
Seokhyun Choi
Sungho Shin
ODL
17
7
0
05 Sep 2020
A Survey of Active Learning for Text Classification using Deep Neural Networks
Christopher Schröder
A. Niekler
6
98
0
17 Aug 2020
Domain-specific Communication Optimization for Distributed DNN Training
Hao Wang
Jingrong Chen
Xinchen Wan
Han Tian
Jiacheng Xia
Gaoxiong Zeng
Weiyan Wang
Kai Chen
Wei Bai
Junchen Jiang
AI4CE
6
15
0
16 Aug 2020
DeLighT: Deep and Light-weight Transformer
Sachin Mehta
Marjan Ghazvininejad
Srini Iyer
Luke Zettlemoyer
Hannaneh Hajishirzi
VLM
17
32
0
03 Aug 2020
Neural Architecture Search as Sparse Supernet
Y. Wu
Aoming Liu
Zhiwu Huang
Siwei Zhang
Luc Van Gool
17
22
0
31 Jul 2020
Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining
T. Tsai
Kevin Ji
VLM
14
17
0
29 Jul 2020
Stochastic Normalized Gradient Descent with Momentum for Large-Batch Training
Shen-Yi Zhao
Chang-Wei Shi
Yin-Peng Xie
Wu-Jun Li
ODL
13
8
0
28 Jul 2020
Representation Learning via Adversarially-Contrastive Optimal Transport
A. Cherian
Shuchin Aeron
OT
4
7
0
11 Jul 2020
Learning Over-Parametrized Two-Layer ReLU Neural Networks beyond NTK
Yuanzhi Li
Tengyu Ma
Hongyang R. Zhang
MLT
20
28
0
09 Jul 2020
Diverse and Styled Image Captioning Using SVD-Based Mixture of Recurrent Experts
Marzi Heidari
M. Ghatee
A. Nickabadi
Arash Pourhasan Nezhad
DiffM
MoE
27
1
0
07 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
10
70
0
04 Jul 2020
Efficient Algorithms for Device Placement of DNN Graph Operators
Jakub Tarnawski
Amar Phanishayee
Nikhil R. Devanur
Divya Mahajan
Fanny Nina Paravecino
17
66
0
29 Jun 2020
Learning Sparse Prototypes for Text Generation
Junxian He
Taylor Berg-Kirkpatrick
Graham Neubig
19
23
0
29 Jun 2020
Taming GANs with Lookahead-Minmax
Tatjana Chavdarova
Matteo Pagliardini
Sebastian U. Stich
F. Fleuret
Martin Jaggi
GAN
13
25
0
25 Jun 2020
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing
Nikita Klyuchnikov
I. Trofimov
Ekaterina Artemova
Mikhail Salnikov
M. Fedorov
Evgeny Burnaev
VLM
8
101
0
12 Jun 2020
Extrapolation for Large-batch Training in Deep Learning
Tao R. Lin
Lingjing Kong
Sebastian U. Stich
Martin Jaggi
14
36
0
10 Jun 2020
Exploring the Vulnerability of Deep Neural Networks: A Study of Parameter Corruption
Xu Sun
Zhiyuan Zhang
Xuancheng Ren
Ruixuan Luo
Liangyou Li
19
39
0
10 Jun 2020
Transfer Learning for British Sign Language Modelling
B. Mocialov
Graham Turner
H. Hastie
SLR
18
18
0
03 Jun 2020
Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings
Vaibhav Kumar
Tenzin Singhay Bhotia
Vaibhav Kumar
Tanmoy Chakraborty
CVBM
9
47
0
02 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
22
72
0
31 May 2020
Stance Prediction for Contemporary Issues: Data and Experiments
Marjan Hosseinia
Eduard Constantin Dragut
Arjun Mukherjee
16
28
0
29 May 2020
CoolMomentum: A Method for Stochastic Optimization by Langevin Dynamics with Simulated Annealing
O. Borysenko
M. Byshkin
ODL
9
14
0
29 May 2020
BRENDA: Browser Extension for Fake News Detection
Bjarte Botnevik
Eirik Sakariassen
Vinay Setty
6
41
0
27 May 2020
Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient Clipping
Eduard A. Gorbunov
Marina Danilova
Alexander Gasnikov
6
115
0
21 May 2020
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task
Sirwe Saeedi
Ali (Aliakbar) Panahi
Seyran Saeedi
A. Fong
ReLM
ELM
LRM
16
12
0
17 May 2020
MicroNet for Efficient Language Modeling
Zhongxia Yan
Hanrui Wang
Demi Guo
Song Han
15
8
0
16 May 2020
Neural Networks Versus Conventional Filters for Inertial-Sensor-based Attitude Estimation
Daniel Weber
C. Gühmann
Thomas Seel
6
34
0
14 May 2020
Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach
Wenyu Du
Zhouhan Lin
Yikang Shen
Timothy J. O'Donnell
Yoshua Bengio
Yue Zhang
20
13
0
12 May 2020
Neural Polysynthetic Language Modelling
Lane Schwartz
Francis M. Tyers
Lori S. Levin
Christo Kirov
Patrick Littell
...
Vasilisa Andriyanets
Aldrian Obaja Muis
Naoki Otani
J. Park
Zhisong Zhang
11
24
0
11 May 2020
A Tale of Two Perplexities: Sensitivity of Neural Language Models to Lexical Retrieval Deficits in Dementia of the Alzheimer's Type
T. Cohen
Serguei V. S. Pakhomov
6
25
0
07 May 2020
Weakly-Supervised Neural Response Selection from an Ensemble of Task-Specialised Dialogue Agents
Asir Saeed
Khai Mai
Pham Quang Nhat Minh
Nguyen Tuan Duc
Danushka Bollegala
14
0
0
06 May 2020
Learning Architectures from an Extended Search Space for Language Modeling
Yinqiao Li
Chi Hu
Yuhao Zhang
Nuo Xu
Yufan Jiang
Tong Xiao
Jingbo Zhu
Tongran Liu
Changliang Li
14
10
0
06 May 2020
Russian Natural Language Generation: Creation of a Language Modelling Dataset and Evaluation with Modern Neural Architectures
Zein Shaheen
G. Wohlgenannt
Bassel Zaity
D. Mouromtsev
Vadim Pak
6
2
0
05 May 2020
Stolen Probability: A Structural Weakness of Neural Language Models
David Demeter
Gregory J. Kimmel
Doug Downey
17
32
0
05 May 2020
Noise Pollution in Hospital Readmission Prediction: Long Document Classification with Reinforcement Learning
Liyan Xu
J. Hogan
R. Patzer
Jinho D. Choi
12
4
0
04 May 2020
Why and when should you pool? Analyzing Pooling in Recurrent Architectures
Pratyush Maini
Keshav Kolluru
Danish Pruthi
Mausam
26
6
0
01 May 2020
Learning Music Helps You Read: Using Transfer to Study Linguistic Structure in Language Models
Isabel Papadimitriou
Dan Jurafsky
24
9
0
30 Apr 2020
An Empirical Study of Pre-trained Transformers for Arabic Information Extraction
Wuwei Lan
Yang Chen
Wei-ping Xu
Alan Ritter
14
4
0
30 Apr 2020
Politeness Transfer: A Tag and Generate Approach
Aman Madaan
Amrith Rajagopal Setlur
Tanmay Parekh
Barnabás Póczós
Graham Neubig
Yiming Yang
Ruslan Salakhutdinov
A. Black
Shrimai Prabhumoye
20
159
0
29 Apr 2020
Analyzing Political Parody in Social Media
Antonis Maronikolakis
Danae Sánchez Villegas
Daniel Preotiuc-Pietro
Nikolaos Aletras
6
21
0
28 Apr 2020
Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription
A. Andrusenko
A. Laptev
Ivan Medennikov
9
16
0
22 Apr 2020
An Analysis of the Utility of Explicit Negative Examples to Improve the Syntactic Abilities of Neural Language Models
Hiroshi Noji
Hiroya Takamura
11
14
0
06 Apr 2020
Syntax-driven Iterative Expansion Language Models for Controllable Text Generation
Noe Casas
José A. R. Fonollosa
Marta R. Costa-jussá
19
11
0
05 Apr 2020
Previous
1
2
3
4
5
6
...
9
10
11
Next