Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.06228
Cited By
Training Very Deep Networks
22 July 2015
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training Very Deep Networks"
50 / 558 papers shown
Title
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
98
1
0
17 Apr 2025
PNN: A Novel Progressive Neural Network for Fault Classification in Rotating Machinery under Small Dataset Constraint
Praveen Chopra
Himanshu Kumar
Sandeep Yadav
AI4CE
65
0
0
24 Mar 2025
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition
Yuhang Wen
Mengyuan Liu
Songtao Wu
Beichen Ding
45
0
0
31 Dec 2024
Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences
Niklas Schmidinger
Lisa Schneckenreiter
Philipp Seidl
Johannes Schimunek
Pieter-Jan Hoedt
Johannes Brandstetter
Andreas Mayr
Sohvi Luukkonen
Sepp Hochreiter
Günter Klambauer
MedIm
63
4
0
06 Nov 2024
FACTS: A Factored State-Space Framework For World Modelling
Li Nanbo
Firas Laakom
Yucheng Xu
Wenyi Wang
Jürgen Schmidhuber
AI4TS
211
0
0
28 Oct 2024
A Seesaw Model Attack Algorithm for Distributed Learning
Kun Yang
Tianyi Luo
Yanjie Dong
Aohan Li
AAML
FedML
31
0
0
07 Oct 2024
VulCatch: Enhancing Binary Vulnerability Detection through CodeT5 Decompilation and KAN Advanced Feature Extraction
Abdulrahman Hamman Adama Chukkol
Senlin Luo
Kashif Sharif
Yunusa Haruna
Muhammad Muhammad Abdullahi
33
1
0
13 Aug 2024
Weakly Contrastive Learning via Batch Instance Discrimination and Feature Clustering for Small Sample SAR ATR
Yikui Zhai
Wenlve Zhou
Bing Sun
Jingwen Li
Qirui Ke
...
Junying Gan
Chaoyun Mai
R. D. Labati
Vincenzo Piuri
F. Scotti
35
19
0
07 Aug 2024
The Role of Temporal Hierarchy in Spiking Neural Networks
Filippo Moro
Pau Vilimelis Aceituno
Laura Kriener
Melika Payvand
AI4CE
45
3
0
26 Jul 2024
Highway Networks for Improved Surface Reconstruction: The Role of Residuals and Weight Updates
A. Noorizadegan
Y. C. Hon
D. Young
C. S. Chen
3DV
28
0
0
11 Jul 2024
Stable Weight Updating: A Key to Reliable PDE Solutions Using Deep Learning
A. Noorizadegan
R. Cavoretto
D. Young
C. S. Chen
36
7
0
10 Jul 2024
Strengthening Layer Interaction via Dynamic Layer Attention
Kaishen Wang
Xun Xia
Jian Liu
Zhang Yi
Tao He
25
3
0
19 Jun 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Zhiyuan Ma
Liangliang Zhao
Biqing Qi
Bowen Zhou
DiffM
64
2
0
19 Jun 2024
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
59
337
0
13 Jun 2024
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
Yuhui Wang
Qingyuan Wu
Weida Li
Dylan R. Ashley
Francesco Faccio
Chao Huang
Jürgen Schmidhuber
AI4CE
26
0
0
12 Jun 2024
Highway Value Iteration Networks
Yuhui Wang
Weida Li
Francesco Faccio
Qingyuan Wu
Jürgen Schmidhuber
45
2
0
05 Jun 2024
Generative adversarial learning with optimal input dimension and its adaptive generator architecture
Zhiyao Tan
Ling Zhou
Huazhen Lin
GAN
42
0
0
06 May 2024
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Guoping Xu
Xiaxia Wang
Xinglong Wu
Xuesong Leng
Yongchao Xu
3DPC
43
8
0
02 May 2024
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Wenbo Zhang
Yifan Zhang
Jianfeng Lin
Binqiang Huang
Jinlu Zhang
Wenhao Yu
VLM
49
2
0
17 Apr 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
51
7
0
28 Mar 2024
Cell Variational Information Bottleneck Network
Zhonghua Zhai
Chen Ju
Jinsong Lan
Shuai Xiao
32
0
0
22 Mar 2024
Unleashing Network Potentials for Semantic Scene Completion
Fengyun Wang
Qianru Sun
Dong-Ming Zhang
Jinhui Tang
32
4
0
12 Mar 2024
Uncertainty Quantification on Clinical Trial Outcome Prediction
Tianyi Chen
Yingzhou Lu
Nan Hao
Capucine Van Rechem
Jintai Chen
Tianfan Fu
30
21
0
07 Jan 2024
Gradient Flossing: Improving Gradient Descent through Dynamic Control of Jacobians
Rainer Engelken
34
5
0
28 Dec 2023
Spike No More: Stabilizing the Pre-training of Large Language Models
Sho Takase
Shun Kiyono
Sosuke Kobayashi
Jun Suzuki
20
14
0
28 Dec 2023
Parallel Trust-Region Approaches in Neural Network Training: Beyond Traditional Methods
Ken Trotti
Samuel A. Cruz Alegría
Alena Kopanicáková
Rolf Krause
26
0
0
21 Dec 2023
TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing
Aleksandar Terzić
Michael Hersche
G. Karunaratne
Zixiao Huang
Abu Sebastian
Abbas Rahimi
AI4TS
22
1
0
09 Dec 2023
GloNets: Globally Connected Neural Networks
Antonio Di Cecco
C. Metta
M. Fantozzi
F. Morandin
Maurizio Parton
27
2
0
27 Nov 2023
SENetV2: Aggregated dense layer for channelwise and global representations
Mahendran Narayanan
17
22
0
17 Nov 2023
Improved weight initialization for deep and narrow feedforward neural network
Hyunwoo Lee
Yunho Kim
Seungyeop Yang
Hayoung Choi
ODL
30
3
0
07 Nov 2023
Not all layers are equally as important: Every Layer Counts BERT
Lucas Georges Gabriel Charpentier
David Samuel
20
15
0
03 Nov 2023
Power-Enhanced Residual Network for Function Approximation and Physics-Informed Inverse Problems
A. Noorizadegan
D. Young
Benny Y. C. Hon
C. S. Chen
PINN
19
7
0
24 Oct 2023
Make Deep Networks Shallow Again
Bernhard Bermeitinger
T. Hrycej
Siegfried Handschuh
25
0
0
15 Sep 2023
When to Learn What: Model-Adaptive Data Augmentation Curriculum
Chengkai Hou
Jieyu Zhang
Dinesh Manocha
35
15
0
09 Sep 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
36
20
0
27 Aug 2023
Squeeze aggregated excitation network
N. Mahendran
FAtt
8
1
0
25 Aug 2023
Persistent learning signals and working memory without continuous attractors
Il Memming Park
Ábel Ságodi
Piotr Sokól
23
8
0
24 Aug 2023
AAFACE: Attribute-aware Attentional Network for Face Recognition
Niloufar Alipour Talemi
Hossein Kashiani
Sahar Rahimi Malakshan
Mohammad Saeed Ebrahimi Saadabadi
Nima Najafzadeh
Mohammad Akyash
Nasser M. Nasrabadi
CVBM
18
4
0
14 Aug 2023
AVScan2Vec: Feature Learning on Antivirus Scan Data for Production-Scale Malware Corpora
R. Joyce
Tirth Patel
Charles K. Nicholas
Edward Raff
23
4
0
09 Jun 2023
Layer-adaptive Structured Pruning Guided by Latency
Siyuan Pan
Linna Zhang
Jie Zhang
Xiaoshuang Li
Liang Hou
Xiaobing Tu
37
0
0
23 May 2023
Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach
Zahra Tabatabaei
Adrián Colomer
Javier Oliver Moll
Valery Naranjo
MedIm
42
7
0
19 May 2023
Feed-Forward Optimization With Delayed Feedback for Neural Networks
Katharina Flügel
D. Coquelin
Marie Weiel
Charlotte Debus
Achim Streit
Markus Goetz
AI4CE
40
7
0
26 Apr 2023
Phantom Embeddings: Using Embedding Space for Model Regularization in Deep Neural Networks
Mofassir ul Islam Arif
M. Jameel
Josif Grabocka
Lars Schmidt-Thieme
21
0
0
14 Apr 2023
Variations of Squeeze and Excitation networks
N. Mahendran
19
1
0
11 Apr 2023
Efficient Neural Architecture Search for Emotion Recognition
Monu Verma
Murari Mandal
Satish Kumar Reddy
Yashwanth Reddy Meedimale
Santosh Kumar Vipparthi
CVBM
27
9
0
23 Mar 2023
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Haoyu He
Jianfei Cai
Jing Zhang
Dacheng Tao
Bohan Zhuang
VPVLM
22
50
0
15 Mar 2023
Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models
Yang Liu
Dingkang Yang
Yan Wang
Jing Liu
Jun Liu
Azzedine Boukerche
Peng Sun
Liang Song
41
82
0
10 Feb 2023
Neural Control of Parametric Solutions for High-dimensional Evolution PDEs
Nathan Gaby
X. Ye
Haomin Zhou
19
6
0
31 Jan 2023
Improving Reliability of Fine-tuning with Block-wise Optimisation
Basel Barakat
Qiang Huang
19
1
0
15 Jan 2023
Spectral Cross-Domain Neural Network with Soft-adaptive Threshold Spectral Enhancement
Che Liu
Sibo Cheng
Weiping Ding
Rossella Arcucci
42
9
0
10 Jan 2023
1
2
3
4
...
10
11
12
Next