ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1603.09382
  4. Cited By
Deep Networks with Stochastic Depth

Deep Networks with Stochastic Depth

30 March 2016
Gao Huang
Yu Sun
Zhuang Liu
Daniel Sedra
Kilian Q. Weinberger
ArXivPDFHTML

Papers citing "Deep Networks with Stochastic Depth"

50 / 337 papers shown
Title
ResIST: Layer-Wise Decomposition of ResNets for Distributed Training
ResIST: Layer-Wise Decomposition of ResNets for Distributed Training
Chen Dun
Cameron R. Wolfe
C. Jermaine
Anastasios Kyrillidis
16
21
0
02 Jul 2021
AutoFormer: Searching Transformers for Visual Recognition
AutoFormer: Searching Transformers for Visual Recognition
Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
ViT
36
259
0
01 Jul 2021
Attention Bottlenecks for Multimodal Fusion
Attention Bottlenecks for Multimodal Fusion
Arsha Nagrani
Shan Yang
Anurag Arnab
A. Jansen
Cordelia Schmid
Chen Sun
25
539
0
30 Jun 2021
Simple Training Strategies and Model Scaling for Object Detection
Simple Training Strategies and Model Scaling for Object Detection
Xianzhi Du
Barret Zoph
Wei-Chih Hung
Tsung-Yi Lin
ObjD
31
40
0
30 Jun 2021
Vision Permutator: A Permutable MLP-Like Architecture for Visual
  Recognition
Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition
Qibin Hou
Zihang Jiang
Li-xin Yuan
Mingg-Ming Cheng
Shuicheng Yan
Jiashi Feng
ViT
MLLM
24
205
0
23 Jun 2021
Recent Deep Semi-supervised Learning Approaches and Related Works
Recent Deep Semi-supervised Learning Approaches and Related Works
Gyeongho Kim
SSL
13
10
0
22 Jun 2021
GAIA: A Transfer Learning System of Object Detection that Fits Your
  Needs
GAIA: A Transfer Learning System of Object Detection that Fits Your Needs
Xingyuan Bu
Junran Peng
Junjie Yan
T. Tan
Zhaoxiang Zhang
ObjD
VLM
15
53
0
21 Jun 2021
Stateful ODE-Nets using Basis Function Expansions
Stateful ODE-Nets using Basis Function Expansions
A. Queiruga
N. Benjamin Erichson
Liam Hodgkinson
Michael W. Mahoney
19
16
0
21 Jun 2021
How to train your ViT? Data, Augmentation, and Regularization in Vision
  Transformers
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers
Andreas Steiner
Alexander Kolesnikov
Xiaohua Zhai
Ross Wightman
Jakob Uszkoreit
Lucas Beyer
ViT
34
613
0
18 Jun 2021
DeepLab2: A TensorFlow Library for Deep Labeling
DeepLab2: A TensorFlow Library for Deep Labeling
Mark Weber
Huiyu Wang
Siyuan Qiao
Jun Xie
Maxwell D. Collins
...
Laura Leal-Taixe
Alan Yuille
Florian Schroff
Hartwig Adam
Liang-Chieh Chen
VLM
20
45
0
17 Jun 2021
Layer Folding: Neural Network Depth Reduction using Activation
  Linearization
Layer Folding: Neural Network Depth Reduction using Activation Linearization
Amir Ben Dror
Niv Zehngut
Avraham Raviv
E. Artyomov
Ran Vitek
R. Jevnisek
13
20
0
17 Jun 2021
BEiT: BERT Pre-Training of Image Transformers
BEiT: BERT Pre-Training of Image Transformers
Hangbo Bao
Li Dong
Songhao Piao
Furu Wei
ViT
12
2,742
0
15 Jun 2021
CAT: Cross Attention in Vision Transformer
CAT: Cross Attention in Vision Transformer
Hezheng Lin
Xingyi Cheng
Xiangyu Wu
Fan Yang
Dong Shen
Zhongyuan Wang
Qing Song
Wei Yuan
ViT
27
149
0
10 Jun 2021
CoAtNet: Marrying Convolution and Attention for All Data Sizes
CoAtNet: Marrying Convolution and Attention for All Data Sizes
Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
ViT
22
1,167
0
09 Jun 2021
Robust Mutual Learning for Semi-supervised Semantic Segmentation
Robust Mutual Learning for Semi-supervised Semantic Segmentation
Pan Zhang
Bo Zhang
Ting Zhang
Dong Chen
Fang Wen
17
17
0
01 Jun 2021
ResT: An Efficient Transformer for Visual Recognition
ResT: An Efficient Transformer for Visual Recognition
Qing-Long Zhang
Yubin Yang
ViT
8
229
0
28 May 2021
Scaling Properties of Deep Residual Networks
Scaling Properties of Deep Residual Networks
A. Cohen
R. Cont
Alain Rossier
Renyuan Xu
11
18
0
25 May 2021
Pay Attention to MLPs
Pay Attention to MLPs
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
17
651
0
17 May 2021
Meta-Cal: Well-controlled Post-hoc Calibration by Ranking
Meta-Cal: Well-controlled Post-hoc Calibration by Ranking
Xingchen Ma
Matthew B. Blaschko
10
34
0
10 May 2021
Conformer: Local Features Coupling Global Representations for Visual
  Recognition
Conformer: Local Features Coupling Global Representations for Visual Recognition
Zhiliang Peng
Wei Huang
Shanzhi Gu
Lingxi Xie
Yaowei Wang
Jianbin Jiao
QiXiang Ye
ViT
13
527
0
09 May 2021
ResMLP: Feedforward networks for image classification with
  data-efficient training
ResMLP: Feedforward networks for image classification with data-efficient training
Hugo Touvron
Piotr Bojanowski
Mathilde Caron
Matthieu Cord
Alaaeldin El-Nouby
...
Gautier Izacard
Armand Joulin
Gabriel Synnaeve
Jakob Verbeek
Hervé Jégou
VLM
16
654
0
07 May 2021
MLP-Mixer: An all-MLP Architecture for Vision
MLP-Mixer: An all-MLP Architecture for Vision
Ilya O. Tolstikhin
N. Houlsby
Alexander Kolesnikov
Lucas Beyer
Xiaohua Zhai
...
Andreas Steiner
Daniel Keysers
Jakob Uszkoreit
Mario Lucic
Alexey Dosovitskiy
239
2,600
0
04 May 2021
Single-Training Collaborative Object Detectors Adaptive to Bandwidth and
  Computation
Single-Training Collaborative Object Detectors Adaptive to Bandwidth and Computation
Juliano S. Assine
José Cândido Silveira Santos Filho
Eduardo Valle
ObjD
42
8
0
03 May 2021
Vision Transformers with Patch Diversification
Vision Transformers with Patch Diversification
Chengyue Gong
Dilin Wang
Meng Li
Vikas Chandra
Qiang Liu
ViT
37
62
0
26 Apr 2021
Visformer: The Vision-friendly Transformer
Visformer: The Vision-friendly Transformer
Zhengsu Chen
Lingxi Xie
Jianwei Niu
Xuefeng Liu
Longhui Wei
Qi Tian
ViT
109
209
0
26 Apr 2021
Multiscale Vision Transformers
Multiscale Vision Transformers
Haoqi Fan
Bo Xiong
K. Mangalam
Yanghao Li
Zhicheng Yan
Jitendra Malik
Christoph Feichtenhofer
ViT
19
1,219
0
22 Apr 2021
All Tokens Matter: Token Labeling for Training Better Vision
  Transformers
All Tokens Matter: Token Labeling for Training Better Vision Transformers
Zihang Jiang
Qibin Hou
Li-xin Yuan
Daquan Zhou
Yujun Shi
Xiaojie Jin
Anran Wang
Jiashi Feng
ViT
12
203
0
22 Apr 2021
Towards Self-Adaptive Metric Learning On the Fly
Towards Self-Adaptive Metric Learning On the Fly
Y. Gao
Yifan Li
Swarup Chandra
Latifur Khan
B. Thuraisingham
11
20
0
03 Apr 2021
Going deeper with Image Transformers
Going deeper with Image Transformers
Hugo Touvron
Matthieu Cord
Alexandre Sablayrolles
Gabriel Synnaeve
Hervé Jégou
ViT
23
986
0
31 Mar 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,085
0
29 Mar 2021
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Ashish Vaswani
Prajit Ramachandran
A. Srinivas
Niki Parmar
Blake A. Hechtman
Jonathon Shlens
16
395
0
23 Mar 2021
3D Human Pose Estimation with Spatial and Temporal Transformers
3D Human Pose Estimation with Spatial and Temporal Transformers
Ce Zheng
Sijie Zhu
Matías Mendieta
Taojiannan Yang
C. L. P. Chen
Zhengming Ding
ViT
39
437
0
18 Mar 2021
Revisiting ResNets: Improved Training and Scaling Strategies
Revisiting ResNets: Improved Training and Scaling Strategies
Irwan Bello
W. Fedus
Xianzhi Du
E. D. Cubuk
A. Srinivas
Tsung-Yi Lin
Jonathon Shlens
Barret Zoph
27
297
0
13 Mar 2021
Experiments with Rich Regime Training for Deep Learning
Experiments with Rich Regime Training for Deep Learning
Xinyan Li
A. Banerjee
16
2
0
26 Feb 2021
CReST: A Class-Rebalancing Self-Training Framework for Imbalanced
  Semi-Supervised Learning
CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning
Chen Wei
Kihyuk Sohn
Clayton Mellina
Alan Yuille
Fan Yang
CLL
26
255
0
18 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
260
179
0
17 Feb 2021
High-Performance Large-Scale Image Recognition Without Normalization
High-Performance Large-Scale Image Recognition Without Normalization
Andrew Brock
Soham De
Samuel L. Smith
Karen Simonyan
VLM
223
512
0
11 Feb 2021
Self-supervised driven consistency training for annotation efficient
  histopathology image analysis
Self-supervised driven consistency training for annotation efficient histopathology image analysis
C. Srinidhi
Seung Wook Kim
Fu-Der Chen
Anne L. Martel
SSL
11
109
0
07 Feb 2021
An Efficient Transformer Decoder with Compressed Sub-layers
An Efficient Transformer Decoder with Compressed Sub-layers
Yanyang Li
Ye Lin
Tong Xiao
Jingbo Zhu
17
29
0
03 Jan 2021
Learning Light-Weight Translation Models from Deep Transformer
Learning Light-Weight Translation Models from Deep Transformer
Bei Li
Ziyang Wang
Hui Liu
Quan Du
Tong Xiao
Chunliang Zhang
Jingbo Zhu
VLM
112
40
0
27 Dec 2020
Scaling Wide Residual Networks for Panoptic Segmentation
Scaling Wide Residual Networks for Panoptic Segmentation
Liang-Chieh Chen
Huiyu Wang
Siyuan Qiao
SSeg
14
47
0
23 Nov 2020
FP-NAS: Fast Probabilistic Neural Architecture Search
FP-NAS: Fast Probabilistic Neural Architecture Search
Zhicheng Yan
Xiaoliang Dai
Peizhao Zhang
Yuandong Tian
Bichen Wu
Matt Feiszli
11
23
0
22 Nov 2020
Learning Loss for Test-Time Augmentation
Learning Loss for Test-Time Augmentation
Ildoo Kim
Younghoon Kim
Sungwoong Kim
OOD
18
90
0
22 Oct 2020
Combining Ensembles and Data Augmentation can Harm your Calibration
Combining Ensembles and Data Augmentation can Harm your Calibration
Yeming Wen
Ghassen Jerfel
Rafael Muller
Michael W. Dusenberry
Jasper Snoek
Balaji Lakshminarayanan
Dustin Tran
UQCV
24
63
0
19 Oct 2020
Review and Comparison of Commonly Used Activation Functions for Deep
  Neural Networks
Review and Comparison of Commonly Used Activation Functions for Deep Neural Networks
Tomasz Szandała
54
274
0
15 Oct 2020
Densely Guided Knowledge Distillation using Multiple Teacher Assistants
Densely Guided Knowledge Distillation using Multiple Teacher Assistants
Wonchul Son
Jaemin Na
Junyong Choi
Wonjun Hwang
20
110
0
18 Sep 2020
Wasserstein Routed Capsule Networks
Wasserstein Routed Capsule Networks
Alexander Fuchs
Franz Pernkopf
14
7
0
22 Jul 2020
Diverse Ensembles Improve Calibration
Diverse Ensembles Improve Calibration
Asa Cooper Stickland
Iain Murray
UQCV
FedML
14
26
0
08 Jul 2020
Enabling On-Device CNN Training by Self-Supervised Instance Filtering
  and Error Map Pruning
Enabling On-Device CNN Training by Self-Supervised Instance Filtering and Error Map Pruning
Yawen Wu
Zhepeng Wang
Yiyu Shi
J. Hu
14
43
0
07 Jul 2020
Surrogate-assisted Particle Swarm Optimisation for Evolving
  Variable-length Transferable Blocks for Image Classification
Surrogate-assisted Particle Swarm Optimisation for Evolving Variable-length Transferable Blocks for Image Classification
Bin Wang
Bing Xue
Mengjie Zhang
9
53
0
03 Jul 2020
Previous
1234567
Next