Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1706.02515
Cited By
v1
v2
v3
v4
v5 (latest)
Self-Normalizing Neural Networks
Neural Information Processing Systems (NeurIPS), 2017
8 June 2017
Günter Klambauer
Thomas Unterthiner
Andreas Mayr
Sepp Hochreiter
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1585★)
Papers citing
"Self-Normalizing Neural Networks"
50 / 926 papers shown
LipschitzLR: Using theoretically computed adaptive learning rates for fast convergence
Rahul Yedida
Snehanshu Saha
Tejas Prashanth
ODL
101
14
0
20 Feb 2019
On the Impact of the Activation Function on Deep Neural Networks Training
Soufiane Hayou
Arnaud Doucet
Judith Rousseau
ODL
268
219
0
19 Feb 2019
Fake News Detection on Social Media using Geometric Deep Learning
Federico Monti
Fabrizio Frasca
D. Eynard
Damon Mannion
M. Bronstein
GNN
223
530
0
10 Feb 2019
A simple and efficient architecture for trainable activation functions
Andrea Apicella
Francesco Isgrò
R. Prevete
173
41
0
08 Feb 2019
Artificial Intelligence for Prosthetics - challenge solutions
L. Kidzinski
Carmichael F. Ong
Sharada Mohanty
Jennifer Hicks
Sean F. Carroll
...
E. Tumer
J. Watson
M. Salathé
Sergey Levine
Scott L. Delp
113
48
0
07 Feb 2019
Attention in Natural Language Processing
Andrea Galassi
Marco Lippi
Paolo Torroni
GNN
437
551
0
04 Feb 2019
On Correlation of Features Extracted by Deep Neural Networks
IEEE International Joint Conference on Neural Network (IJCNN), 2019
B. Ayinde
T. Inanc
J. Zurada
185
25
0
30 Jan 2019
Learning Context-Dependent Choice Functions
Karlson Pfannschmidt
Pritha Gupta
Björn Haddenhorst
Eyke Hüllermeier
283
11
0
29 Jan 2019
Activation Adaptation in Neural Networks
Farnoush Farhadi
V. Nia
Andrea Lodi
AI4CE
201
15
0
28 Jan 2019
Trajectory Normalized Gradients for Distributed Optimization
Jianqiao Wangni
Ke Li
Jianbo Shi
Jitendra Malik
125
2
0
24 Jan 2019
Disentangling Video with Independent Prediction
William F. Whitney
Rob Fergus
CML
CoGe
OCL
DRL
100
1
0
17 Jan 2019
Is it Time to Swish? Comparing Deep Learning Activation Functions Across NLP tasks
Steffen Eger
Paul Youssef
Iryna Gurevych
LLMSV
172
81
0
09 Jan 2019
LiSHT: Non-Parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks
Swalpa Kumar Roy
Suvojit Manna
S. Dubey
B. B. Chaudhuri
272
52
0
01 Jan 2019
Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks
Mohamed Yousef
K. Hussain
U. S. Mohammed
3DV
168
133
0
31 Dec 2018
Supervised Domain Enablement Attention for Personalized Domain Classification
Joo-Kyung Kim
Young-Bum Kim
157
11
0
18 Dec 2018
NIPS - Not Even Wrong? A Systematic Review of Empirically Complete Demonstrations of Algorithmic Effectiveness in the Machine Learning and Artificial Intelligence Literature
Franz J. Király
Bilal A. Mateen
R. Sonabend
192
10
0
18 Dec 2018
Flatten-T Swish: a thresholded ReLU-Swish-like activation function for deep learning
Hock Hung Chieng
Noorhaniza Wahid
P. Ong
Sai Raj Kishore Perla
114
43
0
15 Dec 2018
Evolutionary Neural Architecture Search for Image Restoration
Gerard Jacques van Wyk
Anna Sergeevna Bosman
114
37
0
14 Dec 2018
Guided Dropout
Rohit Keshari
Richa Singh
Mayank Vatsa
BDL
226
37
0
10 Dec 2018
Generalized Batch Normalization: Towards Accelerating Deep Neural Networks
Xiaoyong Yuan
Zheng Feng
Matthew Norton
Xiaolin Li
70
26
0
08 Dec 2018
Attention Boosted Sequential Inference Model
Guanyu Li
Pengfei Zhang
Caiyan Jia
156
3
0
05 Dec 2018
ECC: Platform-Independent Energy-Constrained Deep Neural Network Compression via a Bilinear Regression Model
Haichuan Yang
Yuhao Zhu
Ji Liu
243
43
0
05 Dec 2018
EENMF: An End-to-End Neural Matching Framework for E-Commerce Sponsored Search
Wenjin Wu
Guojun Liu
Hui Ye
Chenshuang Zhang
Tianshu Wu
Daorui Xiao
Jialin Li
Xiaoyu Zhu
335
10
0
04 Dec 2018
SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation
Md Shamim Hussain
M. A. Haque
123
48
0
01 Dec 2018
The SWAG Algorithm; a Mathematical Approach that Outperforms Traditional Deep Learning. Theory and Implementation
S. Safaei
Vahid Safaei
Solmazi Safaei
Zerotti Woods
H. Arabnia
Juan B. Gutierrez
70
0
0
28 Nov 2018
SOC: hunting the underground inside story of the ethereum Social-network Opinion and Comment
TonTon Hsien-De Huang
Po-Wei Hong
Ying-Tse Lee
Yi-Lun Wang
Chi-Leong Lok
Hung-Yu kao
85
2
0
27 Nov 2018
Neural Non-Stationary Spectral Kernel
Sami Remes
Markus Heinonen
Samuel Kaski
BDL
157
10
0
27 Nov 2018
Driver Behavior Recognition via Interwoven Deep Convolutional Neural Nets with Multi-stream Inputs
IEEE Access (IEEE Access), 2018
Chaoyun Zhang
Rui Li
Woojin Kim
Daesub Yoon
P. Patras
266
54
0
22 Nov 2018
Regularizing by the Variance of the Activations' Sample-Variances
Neural Information Processing Systems (NeurIPS), 2018
Etai Littwin
Lior Wolf
VLM
112
12
0
21 Nov 2018
Unsupervised Multimodal Representation Learning across Medical Images and Reports
T. Hsu
W. Weng
Willie Boag
Matthew B. A. McDermott
Peter Szolovits
SSL
175
37
0
21 Nov 2018
A Deep Neural Network for Unsupervised Anomaly Detection and Diagnosis in Multivariate Time Series Data
AAAI Conference on Artificial Intelligence (AAAI), 2018
Chuxu Zhang
Dongjin Song
Yuncong Chen
Xinyang Feng
C. Lumezanu
Wei Cheng
Jingchao Ni
Bo Zong
Haifeng Chen
Nitesh Chawla
AI4TS
413
829
0
20 Nov 2018
Higher-order Network for Action Recognition
International Conference on Pattern Recognition (ICPR), 2018
Jie Shao
Xiangyang Xue
270
0
0
19 Nov 2018
Deep Determinantal Point Processes
Mike Gartrell
Elvis Dohmatob
Jon Alberdi
174
4
0
17 Nov 2018
SGR: Self-Supervised Spectral Graph Representation Learning
Anton Tsitsulin
Davide Mottin
Panagiotis Karras
A. Bronstein
Emmanuel Müller
SSL
92
6
0
15 Nov 2018
Modality Attention for End-to-End Audio-visual Speech Recognition
Pan Zhou
Wenwen Yang
Wei Chen
Yanfeng Wang
Jia Jia
144
72
0
13 Nov 2018
Activation Functions: Comparison of trends in Practice and Research for Deep Learning
S. Bodenstedt
Dominik Rivoir
A. Gachagan
S. T. Mees
271
1,394
0
08 Nov 2018
Linear Memory Networks
International Conference on Artificial Neural Networks (ICANN), 2018
Xi Chen
Ali Ghadirzadeh
Mårten Björkman
KELM
109
10
0
08 Nov 2018
Quasi-random sampling for multivariate distributions via generative neural networks
Marius Hofert
Avinash Prasad
Mu Zhu
357
17
0
01 Nov 2018
A Streamlined Encoder/Decoder Architecture for Melody Extraction
Tsung-Han Hsieh
Li Su
Yi-Hsuan Yang
178
57
0
30 Oct 2018
MPNA: A Massively-Parallel Neural Array Accelerator with Dataflow Optimization for Convolutional Neural Networks
Muhammad Abdullah Hanif
Rachmad Vidya Wicaksana Putra
Muhammad Tanvir
R. Hafiz
Semeen Rehman
Mohamed Bennai
72
19
0
30 Oct 2018
A Methodology for Automatic Selection of Activation Functions to Design Hybrid Deep Neural Networks
Alberto Marchisio
Muhammad Abdullah Hanif
Semeen Rehman
Maurizio Martina
Mohamed Bennai
133
11
0
27 Oct 2018
Batch Normalization Sampling
Zhaodong Chen
Lei Deng
Guoqi Li
Jiawei Sun
Xing Hu
Xin Ma
Yuan Xie
130
0
0
25 Oct 2018
Single-Image SVBRDF Capture with a Rendering-Aware Deep Network
Valentin Deschaintre
M. Aittala
F. Durand
G. Drettakis
Adrien Bousseau
3DH
176
303
0
23 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
Jacky Liang
Viktor Makoviychuk
Ankur Handa
N. Chentanez
Lukasz Wawrzyniak
Dieter Fox
AI4CE
288
224
0
12 Oct 2018
Weighted Sigmoid Gate Unit for an Activation Function of Deep Neural Network
Masayuki Tanaka
157
59
0
03 Oct 2018
DeepCMB: Lensing Reconstruction of the Cosmic Microwave Background with Deep Neural Networks
J. Caldeira
W. L. K. Wu
Brian D. Nord
Camille Avestruz
Shubhendu Trivedi
K. Story
307
73
0
02 Oct 2018
Unsupervised Emergence of Spatial Structure from Sensorimotor Prediction
Alban Laflaquière
Michael Garcia Ortiz
176
3
0
02 Oct 2018
Aggregation of binary feature descriptors for compact scene model representation in large scale structure-from-motion applications
International Conference on Computer Vision and Graphics (ICCVG), 2018
J. Komorowski
Tomasz Trzciñski
3DPC
3DV
123
0
0
28 Sep 2018
SConE: Siamese Constellation Embedding Descriptor for Image Matching
Tomasz Trzciñski
J. Komorowski
Lukasz Dabala
K. Czarnota
Grzegorz Kurzejamski
Simon Lynen
87
10
0
28 Sep 2018
Dynamical Isometry is Achieved in Residual Networks in a Universal Way for any Activation Function
International Conference on Artificial Intelligence and Statistics (AISTATS), 2018
W. Tarnowski
P. Warchol
Stanislaw Jastrzebski
Jacek Tabor
M. Nowak
192
40
0
24 Sep 2018
Previous
1
2
3
...
15
16
17
18
19
Next