ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.03397
  4. Cited By
Dropout: Explicit Forms and Capacity Control

Dropout: Explicit Forms and Capacity Control

International Conference on Machine Learning (ICML), 2020
6 March 2020
R. Arora
Peter L. Bartlett
Poorya Mianjy
Nathan Srebro
ArXiv (abs)PDFHTML

Papers citing "Dropout: Explicit Forms and Capacity Control"

31 / 31 papers shown
The Hidden Power of Normalization Layers in Neural Networks: Exponential Capacity Control
The Hidden Power of Normalization Layers in Neural Networks: Exponential Capacity Control
Khoat Than
163
0
0
02 Nov 2025
Overlap-Adaptive Regularization for Conditional Average Treatment Effect Estimation
Overlap-Adaptive Regularization for Conditional Average Treatment Effect Estimation
Valentyn Melnychuk
Dennis Frauen
Jonas Schweisthal
Stefan Feuerriegel
CML
207
1
0
29 Sep 2025
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image Segmentation
Piyush Tiwary
Kinjawl Bhattacharyya
Prathosh A.P.
MedIm
289
1
0
26 May 2025
Analytic theory of dropout regularization
Analytic theory of dropout regularizationPhysical Review E (Phys. Rev. E), 2025
Francesco Mori
Francesca Mignacco
422
2
0
12 May 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
Hadamard product in deep learning: Introduction, Advances and ChallengesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
Volkan Cevher
AAML
392
21
0
17 Apr 2025
An Overview of Low-Rank Structures in the Training and Adaptation of Large Models
An Overview of Low-Rank Structures in the Training and Adaptation of Large Models
Laura Balzano
Tianjiao Ding
B. Haeffele
Soo Min Kwon
Qing Qu
Peng Wang
Liang Luo
Can Yaras
OffRLAI4CE
323
5
0
25 Mar 2025
Efficient Stagewise Pretraining via Progressive Subnetworks
Efficient Stagewise Pretraining via Progressive Subnetworks
Abhishek Panigrahi
Nikunj Saunshi
Kaifeng Lyu
Sobhan Miryoosefi
Sashank J. Reddi
Satyen Kale
Sanjiv Kumar
273
9
0
08 Feb 2024
Dropout Drops Double Descent
Dropout Drops Double DescentJapanese Journal of Statistics and Data Science (JSDS), 2023
Tianbao Yang
J. Suzuki
329
1
0
25 May 2023
Reweighted Mixup for Subpopulation Shift
Reweighted Mixup for Subpopulation Shift
Zongbo Han
Zhipeng Liang
Fan Yang
Liu Liu
Lanqing Li
...
P. Zhao
Qinghua Hu
Bing Wu
Changqing Zhang
Jianhua Yao
257
4
0
09 Apr 2023
UMIX: Improving Importance Weighting for Subpopulation Shift via
  Uncertainty-Aware Mixup
UMIX: Improving Importance Weighting for Subpopulation Shift via Uncertainty-Aware MixupNeural Information Processing Systems (NeurIPS), 2022
Zongbo Han
Zhipeng Liang
Fan Yang
Liu Liu
Lanqing Li
Yatao Bian
P. Zhao
Bing Wu
Changqing Zhang
Jianhua Yao
314
47
0
19 Sep 2022
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function
  Perspective
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function PerspectiveNeural Information Processing Systems (NeurIPS), 2022
Chanwoo Park
Sangdoo Yun
Sanghyuk Chun
AAML
248
41
0
21 Aug 2022
Implicit regularization of dropout
Implicit regularization of dropoutIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Zhongwang Zhang
Zhi-Qin John Xu
374
57
0
13 Jul 2022
Boosting Factorization Machines via Saliency-Guided Mixup
Boosting Factorization Machines via Saliency-Guided MixupIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Chenwang Wu
Defu Lian
Yong Ge
Min Zhou
Enhong Chen
Dacheng Tao
226
6
0
17 Jun 2022
Improving the Robustness and Generalization of Deep Neural Network with
  Confidence Threshold Reduction
Improving the Robustness and Generalization of Deep Neural Network with Confidence Threshold Reduction
Xiangyuan Yang
Jie Lin
Hanlin Zhang
Xinyu Yang
Peng Zhao
AAMLOOD
300
1
0
02 Jun 2022
Gating Dropout: Communication-efficient Regularization for Sparsely
  Activated Transformers
Gating Dropout: Communication-efficient Regularization for Sparsely Activated TransformersInternational Conference on Machine Learning (ICML), 2022
R. Liu
Young Jin Kim
Alexandre Muzio
Hany Awadalla
MoE
182
30
0
28 May 2022
Towards Size-Independent Generalization Bounds for Deep Operator Nets
Towards Size-Independent Generalization Bounds for Deep Operator Nets
Pulkit Gopalani
Sayar Karmakar
Dibyakanti Kumar
Anirbit Mukherjee
AI4CE
277
6
0
23 May 2022
Exact Solutions of a Deep Linear Network
Exact Solutions of a Deep Linear NetworkNeural Information Processing Systems (NeurIPS), 2022
Liu Ziyin
Botao Li
Xiangmin Meng
ODL
689
26
0
10 Feb 2022
Stochastic Neural Networks with Infinite Width are Deterministic
Stochastic Neural Networks with Infinite Width are Deterministic
Liu Ziyin
Hanlin Zhang
Xiangming Meng
Yuting Lu
Eric P. Xing
Masakuni Ueda
327
3
0
30 Jan 2022
The Flip Side of the Reweighted Coin: Duality of Adaptive Dropout and
  Regularization
The Flip Side of the Reweighted Coin: Duality of Adaptive Dropout and RegularizationNeural Information Processing Systems (NeurIPS), 2021
Daniel LeJeune
Hamid Javadi
Richard G. Baraniuk
367
8
0
14 Jun 2021
What training reveals about neural network complexity
What training reveals about neural network complexityNeural Information Processing Systems (NeurIPS), 2021
Andreas Loukas
Marinos Poiitis
Stefanie Jegelka
312
12
0
08 Jun 2021
Meta-Learning with Fewer Tasks through Task Interpolation
Meta-Learning with Fewer Tasks through Task InterpolationInternational Conference on Learning Representations (ICLR), 2021
Huaxiu Yao
Linjun Zhang
Chelsea Finn
289
66
0
04 Jun 2021
Transformers are Deep Infinite-Dimensional Non-Mercer Binary Kernel
  Machines
Transformers are Deep Infinite-Dimensional Non-Mercer Binary Kernel Machines
Matthew A. Wright
Joseph E. Gonzalez
282
24
0
02 Jun 2021
Generalization of GANs and overparameterized models under Lipschitz
  continuity
Generalization of GANs and overparameterized models under Lipschitz continuity
Khoat Than
Nghia D. Vu
AI4CE
287
2
0
06 Apr 2021
Noisy Recurrent Neural Networks
Noisy Recurrent Neural NetworksNeural Information Processing Systems (NeurIPS), 2021
Soon Hoe Lim
N. Benjamin Erichson
Liam Hodgkinson
Michael W. Mahoney
399
68
0
09 Feb 2021
On Convergence and Generalization of Dropout Training
On Convergence and Generalization of Dropout TrainingNeural Information Processing Systems (NeurIPS), 2020
Poorya Mianjy
R. Arora
295
33
0
23 Oct 2020
How Does Mixup Help With Robustness and Generalization?
How Does Mixup Help With Robustness and Generalization?International Conference on Learning Representations (ICLR), 2020
Linjun Zhang
Zhun Deng
Kenji Kawaguchi
Amirata Ghorbani
James Zou
AAML
658
291
0
09 Oct 2020
Explicit Regularisation in Gaussian Noise Injections
Explicit Regularisation in Gaussian Noise InjectionsNeural Information Processing Systems (NeurIPS), 2020
A. Camuto
M. Willetts
Umut Simsekli
Stephen J. Roberts
Chris Holmes
560
79
0
14 Jul 2020
Shape Matters: Understanding the Implicit Bias of the Noise Covariance
Shape Matters: Understanding the Implicit Bias of the Noise Covariance
Jeff Z. HaoChen
Colin Wei
Jason D. Lee
Tengyu Ma
685
114
0
15 Jun 2020
Implicit Regularization in Deep Learning May Not Be Explainable by Norms
Implicit Regularization in Deep Learning May Not Be Explainable by Norms
Noam Razin
Nadav Cohen
375
168
0
13 May 2020
The Implicit and Explicit Regularization Effects of Dropout
The Implicit and Explicit Regularization Effects of DropoutInternational Conference on Machine Learning (ICML), 2020
Colin Wei
Sham Kakade
Tengyu Ma
467
129
0
28 Feb 2020
Implicit Regularization and Convergence for Weight Normalization
Implicit Regularization and Convergence for Weight NormalizationNeural Information Processing Systems (NeurIPS), 2019
Xiaoxia Wu
Guang Cheng
Zhaolin Ren
Shanshan Wu
Zhiyuan Li
Suriya Gunasekar
Rachel A. Ward
Qiang Liu
617
26
0
18 Nov 2019
1
Page 1 of 1