ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.06228
  4. Cited By
Training Very Deep Networks

Training Very Deep Networks

22 July 2015
R. Srivastava
Klaus Greff
Jürgen Schmidhuber
ArXivPDFHTML

Papers citing "Training Very Deep Networks"

50 / 559 papers shown
Title
Hadamax Encoding: Elevating Performance in Model-Free Atari
Hadamax Encoding: Elevating Performance in Model-Free Atari
Jacob E. Kooi
Zhao Yang
Vincent François-Lavet
12
0
0
21 May 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
98
1
0
17 Apr 2025
PNN: A Novel Progressive Neural Network for Fault Classification in Rotating Machinery under Small Dataset Constraint
PNN: A Novel Progressive Neural Network for Fault Classification in Rotating Machinery under Small Dataset Constraint
Praveen Chopra
Himanshu Kumar
Sandeep Yadav
AI4CE
65
0
0
24 Mar 2025
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition
CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition
Yuhang Wen
Mengyuan Liu
Songtao Wu
Beichen Ding
47
0
0
31 Dec 2024
Bio-xLSTM: Generative modeling, representation and in-context learning
  of biological and chemical sequences
Bio-xLSTM: Generative modeling, representation and in-context learning of biological and chemical sequences
Niklas Schmidinger
Lisa Schneckenreiter
Philipp Seidl
Johannes Schimunek
Pieter-Jan Hoedt
Johannes Brandstetter
Andreas Mayr
Sohvi Luukkonen
Sepp Hochreiter
Günter Klambauer
MedIm
65
4
0
06 Nov 2024
FACTS: A Factored State-Space Framework For World Modelling
FACTS: A Factored State-Space Framework For World Modelling
Li Nanbo
Firas Laakom
Yucheng Xu
Wenyi Wang
Jürgen Schmidhuber
AI4TS
223
0
0
28 Oct 2024
A Seesaw Model Attack Algorithm for Distributed Learning
A Seesaw Model Attack Algorithm for Distributed Learning
Kun Yang
Tianyi Luo
Yanjie Dong
Aohan Li
AAML
FedML
31
0
0
07 Oct 2024
VulCatch: Enhancing Binary Vulnerability Detection through CodeT5
  Decompilation and KAN Advanced Feature Extraction
VulCatch: Enhancing Binary Vulnerability Detection through CodeT5 Decompilation and KAN Advanced Feature Extraction
Abdulrahman Hamman Adama Chukkol
Senlin Luo
Kashif Sharif
Yunusa Haruna
Muhammad Muhammad Abdullahi
33
1
0
13 Aug 2024
Weakly Contrastive Learning via Batch Instance Discrimination and
  Feature Clustering for Small Sample SAR ATR
Weakly Contrastive Learning via Batch Instance Discrimination and Feature Clustering for Small Sample SAR ATR
Yikui Zhai
Wenlve Zhou
Bing Sun
Jingwen Li
Qirui Ke
...
Junying Gan
Chaoyun Mai
R. D. Labati
Vincenzo Piuri
F. Scotti
35
19
0
07 Aug 2024
The Role of Temporal Hierarchy in Spiking Neural Networks
The Role of Temporal Hierarchy in Spiking Neural Networks
Filippo Moro
Pau Vilimelis Aceituno
Laura Kriener
Melika Payvand
AI4CE
45
3
0
26 Jul 2024
Highway Networks for Improved Surface Reconstruction: The Role of
  Residuals and Weight Updates
Highway Networks for Improved Surface Reconstruction: The Role of Residuals and Weight Updates
A. Noorizadegan
Y. C. Hon
D. Young
C. S. Chen
3DV
28
0
0
11 Jul 2024
Stable Weight Updating: A Key to Reliable PDE Solutions Using Deep
  Learning
Stable Weight Updating: A Key to Reliable PDE Solutions Using Deep Learning
A. Noorizadegan
R. Cavoretto
D. Young
C. S. Chen
36
7
0
10 Jul 2024
Strengthening Layer Interaction via Dynamic Layer Attention
Strengthening Layer Interaction via Dynamic Layer Attention
Kaishen Wang
Xun Xia
Jian Liu
Zhang Yi
Tao He
25
3
0
19 Jun 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Zhiyuan Ma
Liangliang Zhao
Biqing Qi
Bowen Zhou
DiffM
64
2
0
19 Jun 2024
Depth Anything V2
Depth Anything V2
Lihe Yang
Bingyi Kang
Zilong Huang
Zhen Zhao
Xiaogang Xu
Jiashi Feng
Hengshuang Zhao
DiffM
VLM
MDE
59
337
0
13 Jun 2024
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term
  Planning
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning
Yuhui Wang
Qingyuan Wu
Weida Li
Dylan R. Ashley
Francesco Faccio
Chao Huang
Jürgen Schmidhuber
AI4CE
26
0
0
12 Jun 2024
Highway Value Iteration Networks
Highway Value Iteration Networks
Yuhui Wang
Weida Li
Francesco Faccio
Qingyuan Wu
Jürgen Schmidhuber
45
2
0
05 Jun 2024
Generative adversarial learning with optimal input dimension and its
  adaptive generator architecture
Generative adversarial learning with optimal input dimension and its adaptive generator architecture
Zhiyao Tan
Ling Zhou
Huazhen Lin
GAN
42
0
0
06 May 2024
Development of Skip Connection in Deep Neural Networks for Computer
  Vision and Medical Image Analysis: A Survey
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey
Guoping Xu
Xiaxia Wang
Xinglong Wu
Xuesong Leng
Yongchao Xu
3DPC
43
8
0
02 May 2024
A Progressive Framework of Vision-language Knowledge Distillation and
  Alignment for Multilingual Scene
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Wenbo Zhang
Yifan Zhang
Jianfeng Lin
Binqiang Huang
Jinlu Zhang
Wenhao Yu
VLM
49
2
0
17 Apr 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques
  and Insights
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
51
7
0
28 Mar 2024
Cell Variational Information Bottleneck Network
Zhonghua Zhai
Chen Ju
Jinsong Lan
Shuai Xiao
32
0
0
22 Mar 2024
Unleashing Network Potentials for Semantic Scene Completion
Unleashing Network Potentials for Semantic Scene Completion
Fengyun Wang
Qianru Sun
Dong-Ming Zhang
Jinhui Tang
32
4
0
12 Mar 2024
Uncertainty Quantification on Clinical Trial Outcome Prediction
Uncertainty Quantification on Clinical Trial Outcome Prediction
Tianyi Chen
Yingzhou Lu
Nan Hao
Capucine Van Rechem
Jintai Chen
Tianfan Fu
30
21
0
07 Jan 2024
Gradient Flossing: Improving Gradient Descent through Dynamic Control of
  Jacobians
Gradient Flossing: Improving Gradient Descent through Dynamic Control of Jacobians
Rainer Engelken
34
5
0
28 Dec 2023
Spike No More: Stabilizing the Pre-training of Large Language Models
Spike No More: Stabilizing the Pre-training of Large Language Models
Sho Takase
Shun Kiyono
Sosuke Kobayashi
Jun Suzuki
20
14
0
28 Dec 2023
Parallel Trust-Region Approaches in Neural Network Training: Beyond
  Traditional Methods
Parallel Trust-Region Approaches in Neural Network Training: Beyond Traditional Methods
Ken Trotti
Samuel A. Cruz Alegría
Alena Kopanicáková
Rolf Krause
26
0
0
21 Dec 2023
TCNCA: Temporal Convolution Network with Chunked Attention for Scalable
  Sequence Processing
TCNCA: Temporal Convolution Network with Chunked Attention for Scalable Sequence Processing
Aleksandar Terzić
Michael Hersche
G. Karunaratne
Zixiao Huang
Abu Sebastian
Abbas Rahimi
AI4TS
22
1
0
09 Dec 2023
GloNets: Globally Connected Neural Networks
GloNets: Globally Connected Neural Networks
Antonio Di Cecco
C. Metta
M. Fantozzi
F. Morandin
Maurizio Parton
30
2
0
27 Nov 2023
SENetV2: Aggregated dense layer for channelwise and global
  representations
SENetV2: Aggregated dense layer for channelwise and global representations
Mahendran Narayanan
17
22
0
17 Nov 2023
Improved weight initialization for deep and narrow feedforward neural
  network
Improved weight initialization for deep and narrow feedforward neural network
Hyunwoo Lee
Yunho Kim
Seungyeop Yang
Hayoung Choi
ODL
30
3
0
07 Nov 2023
Not all layers are equally as important: Every Layer Counts BERT
Not all layers are equally as important: Every Layer Counts BERT
Lucas Georges Gabriel Charpentier
David Samuel
20
15
0
03 Nov 2023
Power-Enhanced Residual Network for Function Approximation and
  Physics-Informed Inverse Problems
Power-Enhanced Residual Network for Function Approximation and Physics-Informed Inverse Problems
A. Noorizadegan
D. Young
Benny Y. C. Hon
C. S. Chen
PINN
22
7
0
24 Oct 2023
Make Deep Networks Shallow Again
Make Deep Networks Shallow Again
Bernhard Bermeitinger
T. Hrycej
Siegfried Handschuh
25
0
0
15 Sep 2023
When to Learn What: Model-Adaptive Data Augmentation Curriculum
When to Learn What: Model-Adaptive Data Augmentation Curriculum
Chengkai Hou
Jieyu Zhang
Dinesh Manocha
37
15
0
09 Sep 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
36
20
0
27 Aug 2023
Squeeze aggregated excitation network
Squeeze aggregated excitation network
N. Mahendran
FAtt
8
1
0
25 Aug 2023
Persistent learning signals and working memory without continuous
  attractors
Persistent learning signals and working memory without continuous attractors
Il Memming Park
Ábel Ságodi
Piotr Sokól
23
8
0
24 Aug 2023
AAFACE: Attribute-aware Attentional Network for Face Recognition
AAFACE: Attribute-aware Attentional Network for Face Recognition
Niloufar Alipour Talemi
Hossein Kashiani
Sahar Rahimi Malakshan
Mohammad Saeed Ebrahimi Saadabadi
Nima Najafzadeh
Mohammad Akyash
Nasser M. Nasrabadi
CVBM
18
4
0
14 Aug 2023
AVScan2Vec: Feature Learning on Antivirus Scan Data for Production-Scale
  Malware Corpora
AVScan2Vec: Feature Learning on Antivirus Scan Data for Production-Scale Malware Corpora
R. Joyce
Tirth Patel
Charles K. Nicholas
Edward Raff
23
4
0
09 Jun 2023
Layer-adaptive Structured Pruning Guided by Latency
Layer-adaptive Structured Pruning Guided by Latency
Siyuan Pan
Linna Zhang
Jie Zhang
Xiaoshuang Li
Liang Hou
Xiaobing Tu
37
0
0
23 May 2023
Towards More Transparent and Accurate Cancer Diagnosis with an
  Unsupervised CAE Approach
Towards More Transparent and Accurate Cancer Diagnosis with an Unsupervised CAE Approach
Zahra Tabatabaei
Adrián Colomer
Javier Oliver Moll
Valery Naranjo
MedIm
42
7
0
19 May 2023
Feed-Forward Optimization With Delayed Feedback for Neural Networks
Feed-Forward Optimization With Delayed Feedback for Neural Networks
Katharina Flügel
D. Coquelin
Marie Weiel
Charlotte Debus
Achim Streit
Markus Goetz
AI4CE
40
7
0
26 Apr 2023
Phantom Embeddings: Using Embedding Space for Model Regularization in
  Deep Neural Networks
Phantom Embeddings: Using Embedding Space for Model Regularization in Deep Neural Networks
Mofassir ul Islam Arif
M. Jameel
Josif Grabocka
Lars Schmidt-Thieme
21
0
0
14 Apr 2023
Variations of Squeeze and Excitation networks
Variations of Squeeze and Excitation networks
N. Mahendran
19
1
0
11 Apr 2023
Efficient Neural Architecture Search for Emotion Recognition
Efficient Neural Architecture Search for Emotion Recognition
Monu Verma
Murari Mandal
Satish Kumar Reddy
Yashwanth Reddy Meedimale
Santosh Kumar Vipparthi
CVBM
27
9
0
23 Mar 2023
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
Haoyu He
Jianfei Cai
Jing Zhang
Dacheng Tao
Bohan Zhuang
VPVLM
22
50
0
15 Mar 2023
Generalized Video Anomaly Event Detection: Systematic Taxonomy and
  Comparison of Deep Models
Generalized Video Anomaly Event Detection: Systematic Taxonomy and Comparison of Deep Models
Yang Liu
Dingkang Yang
Yan Wang
Jing Liu
Jun Liu
Azzedine Boukerche
Peng Sun
Liang Song
41
82
0
10 Feb 2023
Neural Control of Parametric Solutions for High-dimensional Evolution
  PDEs
Neural Control of Parametric Solutions for High-dimensional Evolution PDEs
Nathan Gaby
X. Ye
Haomin Zhou
19
6
0
31 Jan 2023
Improving Reliability of Fine-tuning with Block-wise Optimisation
Improving Reliability of Fine-tuning with Block-wise Optimisation
Basel Barakat
Qiang Huang
21
1
0
15 Jan 2023
1234...101112
Next