v1v2 (latest)

A Simple Way to Initialize Recurrent Networks of Rectified Linear Units

3 April 2015

Papers citing "A Simple Way to Initialize Recurrent Networks of Rectified Linear Units"

50 / 353 papers shown

Stabilizing RNN Gradients through Pre-training

Luca Herranz-Celotti

Jean Rouat

241

23 Aug 2023

Dynamic Analysis and an Eigen Initializer for Recurrent Neural NetworksIEEE International Joint Conference on Neural Network (IJCNN), 2023

Ran Dou

José C. Príncipe

171

28 Jul 2023

Fading memory as inductive bias in residual recurrent networksNeural Networks (Neural Netw.), 2023

I. Dubinin

Felix Effenberger

223

27 Jul 2023

Long Short-term Memory with Two-Compartment Spiking Neuron

Haizhou Li

Kay Chen Tan

171

14 Jul 2023

Optimum Output Long Short-Term Memory Cell for High-Frequency Trading Forecasting

209

17 Apr 2023

SMPConv: Self-moving Point Representations for Continuous ConvolutionComputer Vision and Pattern Recognition (CVPR), 2023

Sanghyeon Kim

Eunbyung Park

3DPC

201

05 Apr 2023

Resurrecting Recurrent Neural Networks for Long SequencesInternational Conference on Machine Learning (ICML), 2023

Antonio Orvieto

497

420

11 Mar 2023

Convolutional unitary or orthogonal recurrent neural networks

M. Magnasco

124

14 Feb 2023

A Natural Bias for Language Generation ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Wojciech Stokowiec

183

19 Dec 2022

State-Regularized Recurrent Neural Networks to Extract Automata and Explain PredictionsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Cheng Wang

Carolin (Haas) Lawrence

Mathias Niepert

217

10 Dec 2022

Gated Recurrent Neural Networks with Weighted Time-Delay FeedbackInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

N. Benjamin Erichson

Soon Hoe Lim

Michael W. Mahoney

309

01 Dec 2022

Exploring the Long-Term Generalization of Counting Behavior in RNNs

Pranava Madhyastha

29 Nov 2022

Mohammad Mahmudul Alam

175

23 Nov 2022

On the biological plausibility of orthogonal initialisation for solving gradient instability in deep neural networks

Nikolay Manchev

Michael W. Spratling

ODL

122

27 Oct 2022

On Scrambling Phenomena for Randomly Initialized Recurrent NetworksNeural Information Processing Systems (NeurIPS), 2022

195

11 Oct 2022

Fast Saturating Gate for Learning Long Time Scales with Recurrent Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2022

Kentaro Ohno

Sekitoshi Kanai

Yasutoshi Ida

290

04 Oct 2022

Random orthogonal additive filters: a solution to the vanishing/exploding gradient of deep neural networksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

Andrea Ceni

ODL

156

03 Oct 2022

An Embarrassingly Simple Approach for Intellectual Property Rights Protection on Recurrent Neural Networks

Zhi Qin Tan

H. P. Wong

Chee Seng Chan

217

03 Oct 2022

Efficient LSTM Training with Eligibility TracesInternational Conference on Artificial Neural Networks (ICANN), 2022

Mitchell L. Hoyer

Shahram Eivazi

S. Otte

30 Sep 2022

Breaking Time Invariance: Assorted-Time Normalization for RNNsNeural Processing Letters (NPL), 2022

Cole Pospisil

Vasily Zadorozhnyy

Qiang Ye

28 Sep 2022

Towards Large-Scale Small Object Detection: Survey and BenchmarksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Junwei Han

421

529

28 Jul 2022

Privacy-Preserving Federated Recurrent Neural NetworksProceedings on Privacy Enhancing Technologies (PoPETs), 2022

Sinem Sav

Abdulrahman Diaa

Apostolos Pyrgelis

Jean-Philippe Bossuat

Jean-Pierre Hubaux

FedML

248

28 Jul 2022

Explain My Surprise: Learning Efficient Long-Term Memory by Predicting Uncertain OutcomesNeural Information Processing Systems (NeurIPS), 2022

369

27 Jul 2022

Hybrid Facial Expression Recognition (FER2013) Model for Real-Time Emotion Classification and Prediction

O. Oguine

K. J. Oguine

Hashim Ibrahim Bisallah

Daniel Ofuani

CVBM

224

19 Jun 2022

RF-Next: Efficient Receptive Field Search for Convolutional Neural NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Shanghua Gao

Zhong-Yu Li

Qi Han

Ming-Ming Cheng

Liang Wang

307

14 Jun 2022

Efficient recurrent architectures through activity sparsity and sparse back-propagation through timeInternational Conference on Learning Representations (ICLR), 2022

Anand Subramoney

Khaleelulla Khan Nazeer

Mark Schöne

Christian Mayr

David Kappel

334

13 Jun 2022

Towards a General Purpose CNN for Long Range Dependencies in

N

237

07 Jun 2022

Thalamus: a brain-inspired algorithm for biologically-plausible continual learning and disentangled representationsInternational Conference on Learning Representations (ICLR), 2022

Ali Hummos

CLL

215

24 May 2022

Feedback Gradient Descent: Efficient and Stable Optimization with Orthogonality for DNNsAAAI Conference on Artificial Intelligence (AAAI), 2022

Fanchen Bu

D. Chang

173

12 May 2022

MemFHE: End-to-End Computing with Fully Homomorphic Encryption in MemoryACM Transactions on Embedded Computing Systems (TECS), 2022

Saransh Gupta

Rosario Cammarota

Tajana Simunic

155

26 Apr 2022

Path Development Network with Finite-dimensional Lie Group Representation

Han Lou

Siran Li

Hao Ni

241

02 Apr 2022

What is the best RNN-cell structure for forecasting each time series behavior?Expert systems with applications (ESWA), 2022

15 Mar 2022

NeuroView-RNN: It's About TimeConference on Fairness, Accountability and Transparency (FAccT), 2022

Sina Alemohammad

Richard G. Baraniuk

193

23 Feb 2022

Semantic-based Data Augmentation for Math Word ProblemsInternational Conference on Database Systems for Advanced Applications (DASFAA), 2022

203

07 Jan 2022

Classification of Long Sequential Data using Circular Dilated Convolutional Neural NetworksNeurocomputing (Neurocomputing), 2022

218

06 Jan 2022

Target Propagation via Regularized Inversion

Vincent Roulet

Zaïd Harchaoui

BDL AAML

257

02 Dec 2021

Gradients are Not All You Need

239

102

10 Nov 2021

Recurrent Neural Networks for Learning Long-term Temporal Dependencies with Reanalysis of Time Scale RepresentationInternational Conference on Big Knowledge (ICBK), 2021

Kentaro Ohno

Atsutoshi Kumagai

CLL AI4CE

05 Nov 2021

Cortico-cerebellar networks as decoupling neural interfacesNeural Information Processing Systems (NeurIPS), 2021

179

21 Oct 2021

FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

468

15 Oct 2021

How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies

Bao Wang

Hedi Xia

T. Nguyen

Stanley Osher

AI4CE

171

13 Oct 2021

Orthogonal Graph Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2021

205

23 Sep 2021

Oscillatory Fourier Neural Network: A Compact and Efficient Architecture for Sequential Processing

Bing Han

Cheng Wang

Kaushik Roy

124

14 Sep 2021

CAN3D: Fast 3D Medical Image Segmentation via Compact Context Aggregation

234

12 Sep 2021

Acceleration Method for Learning Fine-Layered Optical Neural Networks

K. Aoyama

H. Sawada

150

01 Sep 2021

Working Memory Connections for LSTMNeural Networks (NN), 2021

Lorenzo Baraldi

144

230

31 Aug 2021

Dirichlet Energy Constrained Learning for Deep Graph Neural Networks

Daochen Zha

191

146

06 Jul 2021

Recurrent Neural Network from Adder's Perspective: Carry-lookahead RNNNeural Networks (NN), 2021

Yong Peng

146

22 Jun 2021

RNNs of RNNs: Recursive Construction of Stable Assemblies of Recurrent Neural Networks

L. Kozachkov

Michaela Ennis

Jean-Jacques E. Slotine

304

16 Jun 2021

A Lightweight and Gradient-Stable Neural LayerNeural Networks (NN), 2021

Yueyao Yu

Yin Zhang

503

08 Jun 2021