v1v2 (latest)

Reconciling modern machine learning practice and the bias-variance trade-off

28 December 2018

Papers citing "Reconciling modern machine learning practice and the bias-variance trade-off"

50 / 945 papers shown

Bayesian Interpolation with Deep Linear NetworksProceedings of the National Academy of Sciences of the United States of America (PNAS), 2022

Boris Hanin

Alexander Zlokapa

424

29 Dec 2022

Problem-Dependent Power of Quantum Neural Networks on Multi-Class ClassificationPhysical Review Letters (PRL), 2022

425

29 Dec 2022

On Implicit Bias in Overparameterized Bilevel OptimizationInternational Conference on Machine Learning (ICML), 2022

Paul Vicol

253

28 Dec 2022

Homophily modulates double descent generalization in graph convolution networksProceedings of the National Academy of Sciences of the United States of America (PNAS), 2022

397

26 Dec 2022

The Quantum Path Kernel: a Generalized Quantum Neural Tangent Kernel for Deep Quantum Machine LearningIEEE Transactions on Quantum Engineering (IEEE Trans. Quantum Eng.), 2022

Massimiliano Incudini

269

22 Dec 2022

Reproducible scaling laws for contrastive language-image learningComputer Vision and Pattern Recognition (CVPR), 2022

496

1,161

14 Dec 2022

Gradient flow in the gaussian covariate model: exact solution of learning curves and multiple descent structures

Antione Bodin

N. Macris

208

13 Dec 2022

Reliable extrapolation of deep neural operators informed by physics or sparse observationsSocial Science Research Network (SSRN), 2022

260

127

13 Dec 2022

$Tight bounds for maximum $\ell_1$-margin classifiers$

Tight bounds for maximum

\ell_1

291

07 Dec 2022

Improved Convergence Guarantees for Shallow Neural Networks

A. Razborov

ODL

217

05 Dec 2022

High Dimensional Binary Classification under Label Shift: Phase Transition and RegularizationSampling Theory, Signal Processing, and Data Analysis (SampTA), 2022

315

01 Dec 2022

Regularization Trade-offs with Fake FeaturesEuropean Signal Processing Conference (EUSIPCO), 2022

Martin Hellkvist

Ayça Özçelikkale

Anders Ahlén

346

01 Dec 2022

Task Discovery: Finding the Tasks that Neural Networks Generalize onNeural Information Processing Systems (NeurIPS), 2022

378

01 Dec 2022

Nonlinear Advantage: Trained Networks Might Not Be As Complex as You ThinkInternational Conference on Machine Learning (ICML), 2022

Christian H. X. Ali Mehmeti-Göpel

Jan Disselhoff

184

30 Nov 2022

Why Neural Networks WorkIntelligent Systems with Applications (ISA), 2022

Sayan Mukherjee

Bernardo A. Huberman

124

26 Nov 2022

The Vanishing Decision Boundary Complexity and the Strong First Component

Hengshuai Yao

UQCV

166

25 Nov 2022

The smooth output assumption, and why deep networks are better than wide ones

Luis Sa-Couto

J. M. Ramos

Andreas Wichert

103

25 Nov 2022

A Survey of Learning Curves with Bad Behavior: or How More Data Need Not Lead to Better Performance

Marco Loog

T. Viering

185

25 Nov 2022

Frozen Overparameterization: A Double Descent Perspective on Transfer Learning of Deep Neural Networks

Yehuda Dar

Lorenzo Luzi

Richard G. Baraniuk

AI4CE

183

20 Nov 2022

Understanding the double descent curve in Machine Learning

131

18 Nov 2022

Emergence of Concepts in DNNs?

Tim Räz

11 Nov 2022

Do highly over-parameterized neural networks generalize since bad solutions are rare?IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

Julius Martinetz

T. Martinetz

427

07 Nov 2022

Reward-Predictive Clustering

Lucas Lehnert

M. Frank

Michael L. Littman

OffRL

206

07 Nov 2022

Instance-Dependent Generalization Bounds via Optimal TransportJournal of machine learning research (JMLR), 2022

500

02 Nov 2022

Transfer Learning with Kernel MethodsNature Communications (Nat Commun), 2022

Adityanarayanan Radhakrishnan

Max Ruiz Luyten

Neha Prasad

Caroline Uhler

153

01 Nov 2022

Globally Gated Deep Linear NetworksNeural Information Processing Systems (NeurIPS), 2022

Qianyi Li

H. Sompolinsky

AI4CE

251

31 Oct 2022

A Law of Data Separation in Deep LearningProceedings of the National Academy of Sciences of the United States of America (PNAS), 2022

Hangfeng He

Weijie J. Su

OOD

344

31 Oct 2022

A Solvable Model of Neural Scaling Laws

A. Maloney

Daniel A. Roberts

J. Sully

261

30 Oct 2022

Grokking phase transitions in learning local rules with gradient descentJournal of machine learning research (JMLR), 2022

Bojan Žunkovič

E. Ilievski

275

26 Oct 2022

Learning Ability of Interpolating Deep Convolutional Neural NetworksSocial Science Research Network (SSRN), 2022

Tiancong Zhou

X. Huo

AI4CE

183

25 Oct 2022

Deep Neural Networks as the Semi-classical Limit of Topological Quantum Neural Networks: The problem of generalisation

115

25 Oct 2022

Pruning's Effect on Generalization Through the Lens of Training and RegularizationNeural Information Processing Systems (NeurIPS), 2022

Gintare Karolina Dziugaite

236

25 Oct 2022

On double-descent in uncertainty quantification in overparametrized modelsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Lenka Zdeborová

462

23 Oct 2022

Monotonicity and Double Descent in Uncertainty Estimation with Gaussian ProcessesInternational Conference on Machine Learning (ICML), 2022

Liam Hodgkinson

Christopher van der Heide

Fred Roosta

Michael W. Mahoney

UQCV

257

14 Oct 2022

Identification of quantum entanglement with Siamese convolutional neural networks and semi-supervised learningPhysical Review Applied (Phys. Rev. Appl.), 2022

J. Pawłowski

Mateusz Krawczyk

243

13 Oct 2022

The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspectiveJournal of machine learning research (JMLR), 2022

381

10 Oct 2022

Second-order regression models exhibit progressive sharpening to the edge of stabilityInternational Conference on Machine Learning (ICML), 2022

Atish Agarwala

Fabian Pedregosa

Jeffrey Pennington

252

10 Oct 2022

Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals

428

105

04 Oct 2022

Block-wise Training of Residual Networks via the Minimizing Movement Scheme

214

03 Oct 2022

Ten Years after ImageNet: A 360° Perspective on AI

01 Oct 2022

On the Impossible Safety of Large AI Models

359

30 Sep 2022

Why neural networks find simple solutions: the many regularizers of geometric complexityNeural Information Processing Systems (NeurIPS), 2022

351

27 Sep 2022

In-context Learning and Induction Heads

...

607

700

24 Sep 2022

Deep Double Descent via Smooth Interpolation

611

21 Sep 2022

Deep Linear Networks can Benignly Overfit when Shallow Ones DoJournal of machine learning research (JMLR), 2022

Niladri S. Chatterji

Philip M. Long

236

19 Sep 2022

Lazy vs hasty: linearization in deep networks impacts learning schedule based on example difficulty

Thomas George

Guillaume Lajoie

A. Baratin

193

19 Sep 2022

Importance Tempering: Group Robustness for Overparameterized Models

270

19 Sep 2022

Random Fourier Features for Asymmetric KernelsMachine-mediated learning (ML), 2022

Ming-qian He

Fan He

Fanghui Liu

Xiaolin Huang

231

18 Sep 2022

Generalization in Neural Networks: A Broad SurveyNeurocomputing (Neurocomputing), 2022

Chris Rohlfs

OOD AI4CE

279

04 Sep 2022

Towards Understanding the Overfitting Phenomenon of Deep Click-Through Rate Prediction ModelsInternational Conference on Information and Knowledge Management (CIKM), 2022

199

04 Sep 2022