v1v2v3 (latest)

Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting

Neural Computation (Neural Comput.), 2020

12 November 2020

Zeke Xie

ArXiv (abs)PDF HTML Github (33★)

Papers citing "Artificial Neural Variability for Deep Learning: On Overfitting, Noise Memorization, and Catastrophic Forgetting"

25 / 25 papers shown

On-the-fly Modulation for Balanced Multimodal LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

240

15 Oct 2024

Provably Robust Pre-Trained Ensembles for Biomarker-Based Cancer Classification

Chongmin Lee

Jihie Kim

150

14 Jun 2024

Towards Practical Tool Usage for Continually Learning LLMs

Jerry Huang

Prasanna Parthasarathi

Mehdi Rezagholizadeh

Sarath Chandar

CLL KELM

210

14 Apr 2024

Neural Field Classifiers via Target Encoding and Classification Loss

Zeke Xie

182

02 Mar 2024

Active Neural MappingIEEE International Conference on Computer Vision (ICCV), 2023

Zike Yan

Haoxiang Yang

H. Zha

349

30 Aug 2023

S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural FieldsIEEE International Conference on Computer Vision (ICCV), 2023

Zeke Xie

175

14 Aug 2023

Feature Noise Boosts DNN Generalization under Label NoiseIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

182

03 Aug 2023

On the Overlooked Structure of Stochastic GradientsNeural Information Processing Systems (NeurIPS), 2022

Zeke Xie

Qian-Yuan Tang

Mingming Sun

P. Li

297

05 Dec 2022

SplitNet: Learnable Clean-Noisy Label Splitting for Learning with Noisy LabelsInternational Journal of Computer Vision (IJCV), 2022

266

20 Nov 2022

Research Trends and Applications of Data Augmentation Algorithms

João Fonseca

F. Bação

222

18 Jul 2022

Sparse Double Descent: Where Network Pruning Aggravates OverfittingInternational Conference on Machine Learning (ICML), 2022

Zhengqi He

Zeke Xie

Quanzhi Zhu

Zengchang Qin

255

17 Jun 2022

Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory PerspectiveNeural Information Processing Systems (NeurIPS), 2022

Keitaro Sakamoto

Issei Sato

233

15 May 2022

Lifelong Ensemble Learning based on Multiple Representations for Few-Shot Object Recognition

Hamidreza Kasaei

Songsong Xiong

228

04 May 2022

Goldilocks-curriculum Domain Randomization and Fractal Perlin Noise with Application to Sim2Real Pneumonia Lesion Detection

189

29 Apr 2022

Deep learning, stochastic gradient descent and diffusion mapsJournal of Computational Mathematics and Data Science (JCMDS), 2022

Carmina Fjellström

Kaj Nyström

DiffM

232

04 Apr 2022

Balanced Multimodal Learning via On-the-fly Gradient ModulationComputer Vision and Pattern Recognition (CVPR), 2022

320

343

29 Mar 2022

Learning with Noisy Labels Revisited: A Study Using Real-World Human AnnotationsInternational Conference on Learning Representations (ICLR), 2021

Zhaowei Zhu

Yang Liu

385

312

22 Oct 2021

Review of Kernel Learning for Intra-Hour Solar Forecasting with Infrared Sky Images and Cloud Dynamic Feature ExtractionRenewable & Sustainable Energy Reviews (RSER), 2021

Guillermo Terrén-Serrano

Manel Martínez-Ramón

11 Oct 2021

On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications

Ziqiao Wang

Yongyi Mao

FedML MLT

306

07 Oct 2021

The Role of Bio-Inspired Modularity in General LearningArtificial General Intelligence (AGI), 2021

169

23 Sep 2021

TAG: Task-based Accumulated Gradients for Lifelong learning

237

11 May 2021

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve GeneralizationInternational Conference on Machine Learning (ICML), 2021

Zeke Xie

Li-xin Yuan

Zhanxing Zhu

Masashi Sugiyama

240

31 Mar 2021

On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm PerspectiveNeural Information Processing Systems (NeurIPS), 2020

Zeke Xie

Zhiqiang Xu

Jingzhao Zhang

Issei Sato

Masashi Sugiyama

455

23 Nov 2020

A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix

457

101

07 Oct 2020

Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum

Zeke Xie

656

29 Jun 2020