Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.07868
Cited By
Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks
25 February 2016
Tim Salimans
Diederik P. Kingma
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks"
50 / 957 papers shown
Title
Fix your classifier: the marginal value of training the last weight layer
Elad Hoffer
Itay Hubara
Daniel Soudry
21
101
0
14 Jan 2018
Generating Neural Networks with Neural Networks
Lior Deutsch
31
21
0
06 Jan 2018
PixelSNAIL: An Improved Autoregressive Generative Model
Xi Chen
Nikhil Mishra
Mostafa Rohaninejad
Pieter Abbeel
DRL
DiffM
BDL
GAN
9
270
0
28 Dec 2017
Letter-Based Speech Recognition with Gated ConvNets
Vitaliy Liptchinsky
Gabriel Synnaeve
R. Collobert
16
71
0
22 Dec 2017
Sockeye: A Toolkit for Neural Machine Translation
F. Hieber
Tobias Domhan
Michael J. Denkowski
David Vilar
Artem Sokolov
Ann Clifton
Matt Post
11
215
0
15 Dec 2017
The exploding gradient problem demystified - definition, prevalence, impact, origin, tradeoffs, and solutions
George Philipp
D. Song
J. Carbonell
ODL
27
46
0
15 Dec 2017
Deep convolutional neural networks for brain image analysis on magnetic resonance imaging: a review
J. Bernal
Kaisar Kushibar
Daniel S. Asfaw
Sergi Valverde
A. Oliver
Robert Martí
Xavier Llado
14
348
0
11 Dec 2017
Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima
S. Du
J. Lee
Yuandong Tian
Barnabás Póczós
Aarti Singh
MLT
24
234
0
03 Dec 2017
Compatibility Family Learning for Item Recommendation and Generation
Yong-Siang Shih
Kai-Yueh Chang
Hsuan-Tien Lin
Min Sun
24
55
0
02 Dec 2017
Safer Classification by Synthesis
William Wang
Angelina Wang
Aviv Tamar
Xi Chen
Pieter Abbeel
34
41
0
22 Nov 2017
Universal Denoising Networks : A Novel CNN Architecture for Image Denoising
Stamatios Lefkimmiatis
OOD
SupR
24
12
0
21 Nov 2017
A Classifying Variational Autoencoder with Application to Polyphonic Music Generation
Jay A. Hennig
Akash Umakantha
R. Williamson
MGen
BDL
14
17
0
19 Nov 2017
Global versus Localized Generative Adversarial Nets
Guo-Jun Qi
Liheng Zhang
Hao Hu
Marzieh Edraki
Jingdong Wang
Xian-Sheng Hua
GAN
22
81
0
16 Nov 2017
Decoupled Weight Decay Regularization
I. Loshchilov
Frank Hutter
OffRL
22
2,078
0
14 Nov 2017
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
48
185
0
14 Nov 2017
Sobolev GAN
Youssef Mroueh
Chun-Liang Li
Tom Sercu
Anant Raj
Yu Cheng
8
117
0
14 Nov 2017
Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation
Chunqi Wang
Bo Xu
17
46
0
13 Nov 2017
Compression-aware Training of Deep Networks
J. Álvarez
Mathieu Salzmann
10
172
0
07 Nov 2017
Implicit Weight Uncertainty in Neural Networks
Nick Pawlowski
Andrew Brock
Matthew C. H. Lee
Martin Rajchl
Ben Glocker
BDL
UQCV
16
95
0
03 Nov 2017
Smooth Neighbors on Teacher Graphs for Semi-supervised Learning
Yucen Luo
Jun Zhu
Mengxi Li
Yong Ren
Bo Zhang
19
242
0
01 Nov 2017
Revisit Fuzzy Neural Network: Demystifying Batch Normalization and ReLU with Generalized Hamming Network
Lixin Fan
8
26
0
27 Oct 2017
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Tero Karras
Timo Aila
S. Laine
J. Lehtinen
GAN
25
7,274
0
27 Oct 2017
Malware Detection by Eating a Whole EXE
Edward Raff
Jon Barker
Jared Sylvester
Robert Brandon
Bryan Catanzaro
Charles K. Nicholas
19
535
0
25 Oct 2017
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning
Wei Ping
Kainan Peng
Andrew Gibiansky
Sercan Ö. Arik
Ajay Kannan
Sharan Narang
Jonathan Raiman
John Miller
11
303
0
20 Oct 2017
Bayesian Hypernetworks
David M. Krueger
Chin-Wei Huang
Riashat Islam
Ryan Turner
Alexandre Lacoste
Aaron Courville
UQCV
BDL
17
139
0
13 Oct 2017
Projection Based Weight Normalization for Deep Neural Networks
Lei Huang
Xianglong Liu
B. Lang
Bo-wen Li
12
18
0
06 Oct 2017
Improving Lexical Choice in Neural Machine Translation
Toan Q. Nguyen
David Chiang
16
86
0
03 Oct 2017
Training Feedforward Neural Networks with Standard Logistic Activations is Feasible
Emanuele Sansone
F. D. De Natale
19
4
0
03 Oct 2017
Riemannian approach to batch normalization
Minhyung Cho
Jaehyung Lee
21
93
0
27 Sep 2017
Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification
Igor Gitman
Boris Ginsburg
6
65
0
24 Sep 2017
Dynamic Evaluation of Neural Sequence Models
Ben Krause
Emmanuel Kahembwe
Iain Murray
Steve Renals
19
133
0
21 Sep 2017
Orthogonal Weight Normalization: Solution to Optimization over Multiple Dependent Stiefel Manifolds in Deep Neural Networks
Lei Huang
Xianglong Liu
B. Lang
Adams Wei Yu
Yongliang Wang
Bo Li
ODL
19
223
0
16 Sep 2017
Normalized Direction-preserving Adam
Zijun Zhang
Lin Ma
Zongpeng Li
Chuan Wu
ODL
12
29
0
13 Sep 2017
Shifting Mean Activation Towards Zero with Bipolar Activation Functions
L. Eidnes
Arild Nøkland
16
18
0
12 Sep 2017
An unsupervised long short-term memory neural network for event detection in cell videos
Ha Tran Hong Phan
Ashnil Kumar
D. Feng
M. Fulham
Jinman Kim
18
5
0
07 Sep 2017
Training Spiking Neural Networks for Cognitive Tasks: A Versatile Framework Compatible to Various Temporal Codes
Chaofei Hong
11
0
0
02 Sep 2017
Proportionate gradient updates with PercentDelta
Sami Abu-El-Haija
26
7
0
24 Aug 2017
Exploiting Convolution Filter Patterns for Transfer Learning
Mehmet Aygun
Y. Aytar
H. K. Ekenel
8
12
0
23 Aug 2017
SMASH: One-Shot Model Architecture Search through HyperNetworks
Andrew Brock
Theodore Lim
J. Ritchie
Nick Weston
15
761
0
17 Aug 2017
An Effective Training Method For Deep Convolutional Neural Network
Yangzhou Jiang
Zeyang Dou
Qun Hao
Jie Cao
Kun Gao
Xi Chen
22
0
0
31 Jul 2017
Learning Algorithms for Active Learning
Philip Bachman
Alessandro Sordoni
Adam Trischler
VLM
16
154
0
31 Jul 2017
Tensor Regression Networks
Jean Kossaifi
Zachary Chase Lipton
Arinbjorn Kolbeinsson
Aran Khanna
Tommaso Furlanello
Anima Anandkumar
3DV
32
145
0
26 Jul 2017
Linear Discriminant Generative Adversarial Networks
Zhun Sun
Mete Ozay
Takayuki Okatani
GAN
19
1
0
25 Jul 2017
One-shot Face Recognition by Promoting Underrepresented Classes
Yandong Guo
Lei Zhang
CVBM
25
104
0
18 Jul 2017
Block-Normalized Gradient Method: An Empirical Study for Training Deep Neural Network
Adams Wei Yu
Lei Huang
Qihang Lin
Ruslan Salakhutdinov
J. Carbonell
ODL
10
24
0
16 Jul 2017
Adversarial Dropout for Supervised and Semi-supervised Learning
Sungrae Park
Jun-Keon Park
Su-Jin Shin
Il-Chul Moon
GAN
25
174
0
12 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
Yoonho Boo
Wonyong Sung
MQ
22
11
0
01 Jul 2017
Dr.VAE: Drug Response Variational Autoencoder
Ladislav Rampášek
Daniel Hidru
P. Smirnov
B. Haibe-Kains
Anna Goldenberg
DRL
11
32
0
26 Jun 2017
Learning Hierarchical Information Flow with Recurrent Neural Modules
Danijar Hafner
A. Irpan
James Davidson
N. Heess
6
9
0
18 Jun 2017
Bayesian Conditional Generative Adverserial Networks
Ehsan Abbasnejad
Javen Qinfeng Shi
Iman Abbasnejad
A. Hengel
A. Dick
GAN
11
12
0
17 Jun 2017
Previous
1
2
3
...
17
18
19
20
Next