v1v2v3v4v5v6v7 (latest)

Multiple Descent: Design Your Own Generalization Curve

3 August 2020

Papers citing "Multiple Descent: Design Your Own Generalization Curve"

45 / 45 papers shown

A dynamic view of some anomalous phenomena in SGD

Vivek Shripad Borkar

438

03 May 2025

Investigating the Impact of Model Complexity in Large Language Models

Jing Luo

Huiyuan Wang

Weiran Huang

262

01 Oct 2024

Understanding the Double Descent Phenomenon in Deep Learning

Marc Lafon

Alexandre Thomas

428

15 Mar 2024

A PAC-Bayesian Perspective on the Interpolating Information Criterion

Liam Hodgkinson

Christopher van der Heide

Roberto Salomone

Fred Roosta

Michael W. Mahoney

334

13 Nov 2023

Transgressing the boundaries: towards a rigorous understanding of deep learning and its (non-)robustness

C. Hartmann

Lorenz Richter

AAML

248

05 Jul 2023

Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok

Pascal Junior Tikeng Notsawo

343

23 Jun 2023

Dropout Drops Double DescentJapanese Journal of Statistics and Data Science (JSDS), 2023

Tianbao Yang

J. Suzuki

342

25 May 2023

Least Squares Regression Can Exhibit Under-Parameterized Double DescentNeural Information Processing Systems (NeurIPS), 2023

Xinyue Li

Rishi Sonthalia

438

24 May 2023

Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle

Ila Rani Fiete

353

24 Mar 2023

Extrapolated cross-validation for randomized ensemblesJournal of Computational And Graphical Statistics (JCGS), 2023

524

27 Feb 2023

Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Tianhao Wang

253

24 Feb 2023

Deep networks for system identification: a Survey

372

104

30 Jan 2023

Homophily modulates double descent generalization in graph convolution networksProceedings of the National Academy of Sciences of the United States of America (PNAS), 2022

535

26 Dec 2022

Gradient flow in the gaussian covariate model: exact solution of learning curves and multiple descent structures

Antione Bodin

N. Macris

312

13 Dec 2022

Regularization Trade-offs with Fake FeaturesEuropean Signal Processing Conference (EUSIPCO), 2022

Martin Hellkvist

Ayça Özçelikkale

Anders Ahlén

390

01 Dec 2022

A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear ModelsNeural Information Processing Systems (NeurIPS), 2022

394

21 Oct 2022

Second-order regression models exhibit progressive sharpening to the edge of stabilityInternational Conference on Machine Learning (ICML), 2022

Atish Agarwala

Fabian Pedregosa

Jeffrey Pennington

323

10 Oct 2022

Omnigrok: Grokking Beyond Algorithmic DataInternational Conference on Learning Representations (ICLR), 2022

Ziming Liu

Eric J. Michaud

Max Tegmark

441

126

03 Oct 2022

Multiple Descent in the Multiple Random Feature ModelJournal of machine learning research (JMLR), 2022

Xuran Meng

Jianfeng Yao

Yuan Cao

302

21 Aug 2022

Information bottleneck theory of high-dimensional regression: relevancy, efficiency and optimalityNeural Information Processing Systems (NeurIPS), 2022

Wave Ngampruetikorn

David J. Schwab

216

08 Aug 2022

The BUTTER Zone: An Empirical Study of Training Dynamics in Fully Connected Neural Networks

384

25 Jul 2022

Improving Students' Academic Performance with AI and Semantic Technologies

Yi-Hua Cheng

169

02 May 2022

Cascaded Gaps: Towards Gap-Dependent Regret for Risk-Sensitive Reinforcement Learning

Yingjie Fei

Ruitu Xu

196

07 Mar 2022

Benefit of Interpolation in Nearest Neighbor AlgorithmsSIAM Journal on Mathematics of Data Science (SIMODS), 2019

Yue Xing

Qifan Song

Guang Cheng

347

23 Feb 2022

On Optimal Early Stopping: Over-informative versus Under-informative Parametrization

Ruoqi Shen

Liyao (Mars) Gao

Yi-An Ma

351

20 Feb 2022

Fluctuations, Bias, Variance & Ensemble of Learners: Exact Asymptotics for Convex Losses in High-DimensionInternational Conference on Machine Learning (ICML), 2022

316

31 Jan 2022

With Greater Distance Comes Worse Performance: On the Perspective of Layer Utilization and Model Generalization

James Wang

Cheng Yang

105

28 Jan 2022

Multi-scale Feature Learning Dynamics: Insights for Double DescentInternational Conference on Machine Learning (ICML), 2021

271

06 Dec 2021

Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning

347

06 Nov 2021

On the Double Descent of Random Features Models Trained with SGD

552

13 Oct 2021

Taxonomizing local versus global structure in neural network loss landscapesNeural Information Processing Systems (NeurIPS), 2021

408

23 Jul 2021

Mitigating deep double descent by concatenating inputs

John Chen

Qihan Wang

Anastasios Kyrillidis

BDL

230

02 Jul 2021

Towards an Understanding of Benign Overfitting in Neural Networks

314

06 Jun 2021

Out-of-Distribution Generalization in Kernel RegressionNeural Information Processing Systems (NeurIPS), 2021

253

04 Jun 2021

The Shape of Learning Curves: a ReviewIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

T. Viering

Marco Loog

395

198

19 Mar 2021

On the interplay between data structure and loss function in classification problemsNeural Information Processing Systems (NeurIPS), 2021

326

09 Mar 2021

Risk-Monotonicity in Statistical LearningNeural Information Processing Systems (NeurIPS), 2020

Zakaria Mhammedi

657

28 Nov 2020

What causes the test error? Going beyond bias-variance via ANOVAJournal of machine learning research (JMLR), 2020

Licong Lin

Guang Cheng

324

11 Oct 2020

On the Universality of the Double Descent Peak in Ridgeless RegressionInternational Conference on Learning Representations (ICLR), 2020

David Holzmüller

606

05 Oct 2020

Spectral Bias and Task-Model Alignment Explain Generalization in Kernel Regression and Infinitely Wide Neural NetworksNature Communications (Nat Commun), 2020

Abdulkadir Canatar

Blake Bordelon

Cengiz Pehlevan

637

234

23 Jun 2020

Asymptotics of Ridge (less) Regression under General Source ConditionInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Dominic Richards

Jaouad Mourtada

Lorenzo Rosasco

393

11 Jun 2020

Triple descent and the two kinds of overfitting: Where & why do they appear?

Stéphane dÁscoli

Levent Sagun

Giulio Biroli

341

05 Jun 2020

Random Features for Kernel Approximation: A Survey on Algorithms, Theory, and Beyond

Xiaolin Huang

567

200

23 Apr 2020

The Curious Case of Adversarially Robust Models: More Data Can Help, Double Descend, or Hurt GeneralizationConference on Uncertainty in Artificial Intelligence (UAI), 2020

349

25 Feb 2020

Empirical Risk Minimization in the Interpolating Regime with Application to Neural Network LearningMachine-mediated learning (ML), 2019

Nicole Mücke

Ingo Steinwart

AI4CE

280

25 May 2019