ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.06220
  4. Cited By
Artificial Neural Variability for Deep Learning: On Overfitting, Noise
  Memorization, and Catastrophic Forgetting
v1v2v3 (latest)

Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting

Neural Computation (Neural Comput.), 2020
12 November 2020
Zeke Xie
Fengxiang He
Shaopeng Fu
Issei Sato
Dacheng Tao
Masashi Sugiyama
ArXiv (abs)PDFHTMLGithub (33★)

Papers citing "Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting"

25 / 25 papers shown
On-the-fly Modulation for Balanced Multimodal Learning
On-the-fly Modulation for Balanced Multimodal LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yake Wei
D. Hu
Henghui Du
Ji-Rong Wen
240
31
0
15 Oct 2024
Provably Robust Pre-Trained Ensembles for Biomarker-Based Cancer Classification
Provably Robust Pre-Trained Ensembles for Biomarker-Based Cancer Classification
Chongmin Lee
Jihie Kim
150
0
0
14 Jun 2024
Towards Practical Tool Usage for Continually Learning LLMs
Towards Practical Tool Usage for Continually Learning LLMs
Jerry Huang
Prasanna Parthasarathi
Mehdi Rezagholizadeh
Sarath Chandar
CLLKELM
210
8
0
14 Apr 2024
Neural Field Classifiers via Target Encoding and Classification Loss
Neural Field Classifiers via Target Encoding and Classification Loss
Xindi Yang
Zeke Xie
Xiong Zhou
Boyu Liu
Buhua Liu
Yi Liu
Haoran Wang
Yunfeng Cai
Mingming Sun
182
0
0
02 Mar 2024
Active Neural Mapping
Active Neural MappingIEEE International Conference on Computer Vision (ICCV), 2023
Zike Yan
Haoxiang Yang
H. Zha
349
34
0
30 Aug 2023
S3IM: Stochastic Structural SIMilarity and Its Unreasonable
  Effectiveness for Neural Fields
S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural FieldsIEEE International Conference on Computer Vision (ICCV), 2023
Zeke Xie
Xindi Yang
Yujie Yang
Qingyan Sun
Yi Jiang
Haoran Wang
Yunfeng Cai
Mingming Sun
175
50
0
14 Aug 2023
Feature Noise Boosts DNN Generalization under Label Noise
Feature Noise Boosts DNN Generalization under Label NoiseIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Lu Zeng
Xuan Chen
Xiaoshuang Shi
Mengqi Li
MLTNoLa
182
6
0
03 Aug 2023
On the Overlooked Structure of Stochastic Gradients
On the Overlooked Structure of Stochastic GradientsNeural Information Processing Systems (NeurIPS), 2022
Zeke Xie
Qian-Yuan Tang
Mingming Sun
P. Li
297
10
0
05 Dec 2022
SplitNet: Learnable Clean-Noisy Label Splitting for Learning with Noisy
  Labels
SplitNet: Learnable Clean-Noisy Label Splitting for Learning with Noisy LabelsInternational Journal of Computer Vision (IJCV), 2022
Daehwan Kim
Kwang-seok Ryoo
Hansang Cho
Seung Wook Kim
NoLa
266
9
0
20 Nov 2022
Research Trends and Applications of Data Augmentation Algorithms
Research Trends and Applications of Data Augmentation Algorithms
João Fonseca
F. Bação
222
7
0
18 Jul 2022
Sparse Double Descent: Where Network Pruning Aggravates Overfitting
Sparse Double Descent: Where Network Pruning Aggravates OverfittingInternational Conference on Machine Learning (ICML), 2022
Zhengqi He
Zeke Xie
Quanzhi Zhu
Zengchang Qin
255
34
0
17 Jun 2022
Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective
Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory PerspectiveNeural Information Processing Systems (NeurIPS), 2022
Keitaro Sakamoto
Issei Sato
233
10
0
15 May 2022
Lifelong Ensemble Learning based on Multiple Representations for
  Few-Shot Object Recognition
Lifelong Ensemble Learning based on Multiple Representations for Few-Shot Object Recognition
Hamidreza Kasaei
Songsong Xiong
228
15
0
04 May 2022
Goldilocks-curriculum Domain Randomization and Fractal Perlin Noise with
  Application to Sim2Real Pneumonia Lesion Detection
Goldilocks-curriculum Domain Randomization and Fractal Perlin Noise with Application to Sim2Real Pneumonia Lesion Detection
Takahiro Suzuki
S. Hanaoka
Issei Sato
OODMedIm
189
1
0
29 Apr 2022
Deep learning, stochastic gradient descent and diffusion maps
Deep learning, stochastic gradient descent and diffusion mapsJournal of Computational Mathematics and Data Science (JCMDS), 2022
Carmina Fjellström
Kaj Nyström
DiffM
232
19
0
04 Apr 2022
Balanced Multimodal Learning via On-the-fly Gradient Modulation
Balanced Multimodal Learning via On-the-fly Gradient ModulationComputer Vision and Pattern Recognition (CVPR), 2022
Xiaokang Peng
Yake Wei
Andong Deng
Dong Wang
Di Hu
320
343
0
29 Mar 2022
Learning with Noisy Labels Revisited: A Study Using Real-World Human
  Annotations
Learning with Noisy Labels Revisited: A Study Using Real-World Human AnnotationsInternational Conference on Learning Representations (ICLR), 2021
Jiaheng Wei
Zhaowei Zhu
Weiran Wang
Tongliang Liu
Gang Niu
Yang Liu
NoLa
385
312
0
22 Oct 2021
Review of Kernel Learning for Intra-Hour Solar Forecasting with Infrared
  Sky Images and Cloud Dynamic Feature Extraction
Review of Kernel Learning for Intra-Hour Solar Forecasting with Infrared Sky Images and Cloud Dynamic Feature ExtractionRenewable & Sustainable Energy Reviews (RSER), 2021
Guillermo Terrén-Serrano
Manel Martínez-Ramón
83
22
0
11 Oct 2021
On the Generalization of Models Trained with SGD: Information-Theoretic
  Bounds and Implications
On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications
Ziqiao Wang
Yongyi Mao
FedMLMLT
306
32
0
07 Oct 2021
The Role of Bio-Inspired Modularity in General Learning
The Role of Bio-Inspired Modularity in General LearningArtificial General Intelligence (AGI), 2021
Rachel A. StClair
W. Hahn
Elan Barenholtz
CLL
169
0
0
23 Sep 2021
TAG: Task-based Accumulated Gradients for Lifelong learning
TAG: Task-based Accumulated Gradients for Lifelong learning
Pranshu Malviya
B. Ravindran
Sarath Chandar
CLL
237
6
0
11 May 2021
Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to
  Improve Generalization
Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve GeneralizationInternational Conference on Machine Learning (ICML), 2021
Zeke Xie
Li-xin Yuan
Zhanxing Zhu
Masashi Sugiyama
240
34
0
31 Mar 2021
On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A
  Gradient-Norm Perspective
On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm PerspectiveNeural Information Processing Systems (NeurIPS), 2020
Zeke Xie
Zhiqiang Xu
Jingzhao Zhang
Issei Sato
Masashi Sugiyama
455
32
0
23 Nov 2020
A Theoretical Analysis of Catastrophic Forgetting through the NTK
  Overlap Matrix
A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
T. Doan
Mehdi Abbana Bennani
Bogdan Mazoure
Guillaume Rabusseau
Pierre Alquier
CLL
457
101
0
07 Oct 2020
Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate
  and Momentum
Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum
Zeke Xie
Xinrui Wang
Huishuai Zhang
Issei Sato
Masashi Sugiyama
ODL
656
59
0
29 Jun 2020
1
Page 1 of 1