v1v2 (latest)

Reconciling modern machine learning practice and the bias-variance trade-off

28 December 2018

Papers citing "Reconciling modern machine learning practice and the bias-variance trade-off"

50 / 945 papers shown

Transgressing the boundaries: towards a rigorous understanding of deep learning and its (non-)robustness

C. Hartmann

Lorenz Richter

AAML

206

05 Jul 2023

Abide by the Law and Follow the Flow: Conservation Laws for Gradient FlowsNeural Information Processing Systems (NeurIPS), 2023

Sibylle Marcotte

Rémi Gribonval

Gabriel Peyré

327

30 Jun 2023

A Quantitative Functional Central Limit Theorem for Shallow Neural NetworksModern Stochastics: Theory and Applications (MSTA), 2023

299

29 Jun 2023

Solving Kernel Ridge Regression with Gradient-Based Optimization MethodsElectronic Journal of Statistics (EJS), 2023

Oskar Allerbo

269

29 Jun 2023

Semantic Segmentation of Porosity in 4D Spatio-Temporal X-ray μCT of Titanium Coated Ni wires using Deep Learning

Pradyumna Elavarthi

Arun J. Bhattacharjee

A. P. Y. Puente

Anca L. Ralescu

143

24 Jun 2023

A Unified Approach to Controlling Implicit Regularization via Mirror DescentJournal of machine learning research (JMLR), 2023

255

24 Jun 2023

Efficient Online Processing with Deep Neural Networks

Lukas Hedegaard

209

23 Jun 2023

Quantifying lottery tickets under label noise: accuracy, calibration, and complexityConference on Uncertainty in Artificial Intelligence (UAI), 2023

236

21 Jun 2023

Deep Fusion: Efficient Network Training via Pre-trained InitializationsInternational Conference on Machine Learning (ICML), 2023

509

20 Jun 2023

Sampling from Gaussian Process Posteriors using Stochastic Gradient DescentNeural Information Processing Systems (NeurIPS), 2023

J. Lin

Javier Antorán

Shreyas Padhy

David Janz

José Miguel Hernández-Lobato

Alexander Terenin

279

20 Jun 2023

Eight challenges in developing theory of intelligence

Haiping Huang

286

20 Jun 2023

Can predictive models be used for causal inference?

Maximilian Pichler

F. Hartig

OOD CML

216

18 Jun 2023

Training shallow ReLU networks on noisy data using hinge loss: when do we overfit and is it benign?Neural Information Processing Systems (NeurIPS), 2023

219

16 Jun 2023

Nonparametric regression using over-parameterized shallow ReLU neural networksJournal of machine learning research (JMLR), 2023

Yunfei Yang

Ding-Xuan Zhou

347

14 Jun 2023

Progressive Class-Wise Attention (PCA) Approach for Diagnosing Skin Lesions

Asim Naveed

Syed S. Naqvi

Tariq Mahmood Khan

Imran Razzak

142

11 Jun 2023

Learning a Neuron by a Shallow ReLU Network: Dynamics and Implicit Bias for Correlated InputsNeural Information Processing Systems (NeurIPS), 2023

261

10 Jun 2023

Gibbs-Based Information Criteria and the Over-Parameterized RegimeInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Haobo Chen

Yuheng Bu

Greg Wornell

325

08 Jun 2023

Maximally Machine-Learnable PortfoliosSocial Science Research Network (SSRN), 2023

Philippe Goulet Coulombe

Maximilian Göbel

248

08 Jun 2023

Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection CapabilityInternational Conference on Machine Learning (ICML), 2023

Jiangchao Yao

Bo Han

208

06 Jun 2023

Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage

Yu Gui

Cong Ma

Yiqiao Zhong

205

06 Jun 2023

Aiming towards the minimizers: fast convergence of SGD for overparametrized problemsNeural Information Processing Systems (NeurIPS), 2023

181

05 Jun 2023

TMI! Finetuned Models Leak Private Information from their Pretraining DataProceedings on Privacy Enhancing Technologies (PoPETs), 2023

296

01 Jun 2023

The Law of Parsimony in Gradient Descent for Learning Deep Linear Networks

Qing Qu

294

01 Jun 2023

Traffic Prediction using Artificial Intelligence: Review of Recent Advances and Emerging OpportunitiesTransportation Research Part C: Emerging Technologies (TRC), 2022

Maryam Shaygan

Collin Meese

Wanxin Li

Xiaoliang (George) Zhao

Mark M. Nejad

267

174

31 May 2023

Multi-Epoch Learning for Deep Click-Through Rate Prediction Models

Jian Liang

167

31 May 2023

Generalized equivalences between subsampling and ridge regularizationNeural Information Processing Systems (NeurIPS), 2023

Pratik V. Patil

Jin-Hong Du

260

29 May 2023

Optimization's Neglected Normative CommitmentsConference on Fairness, Accountability and Transparency (FAccT), 2023

218

27 May 2023

Learning Capacity: A Measure of the Effective Dimensionality of a Model

Daiwei Chen

Wei-Di Chang

Pratik Chaudhari

156

27 May 2023

Dropout Drops Double DescentJapanese Journal of Statistics and Data Science (JSDS), 2023

Tianbao Yang

J. Suzuki

262

25 May 2023

Double Descent of Discrepancy: A Task-, Data-, and Model-Agnostic Phenomenon

Yi-Xiao Luo

Bin Dong

174

25 May 2023

From Tempered to Benign Overfitting in ReLU Neural NetworksNeural Information Processing Systems (NeurIPS), 2023

Guy Kornowski

Gilad Yehudai

Ohad Shamir

261

24 May 2023

Least Squares Regression Can Exhibit Under-Parameterized Double DescentNeural Information Processing Systems (NeurIPS), 2023

Xinyue Li

Rishi Sonthalia

331

24 May 2023

A 4D Hybrid Algorithm to Scale Parallel Training to Thousands of GPUs

240

22 May 2023

Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on Regression ErrorsInternational Conference on Learning Representations (ICLR), 2023

Sungyoon Lee

S. Lee

168

22 May 2023

When are ensembles really effective?Neural Information Processing Systems (NeurIPS), 2023

215

21 May 2023

Towards understanding neural collapse in supervised contrastive learning with the information bottleneck method

Siwei Wang

S. Palmer

255

19 May 2023

Exploring the Complexity of Deep Neural Networks through Functional EquivalenceInternational Conference on Machine Learning (ICML), 2023

Guohao Shen

374

19 May 2023

On the ISS Property of the Gradient Flow for Single Hidden-Layer Neural Networks with Linear Activations

A. C. B. D. Oliveira

Milad Siami

Eduardo Sontag

197

17 May 2023

Understanding and Improving Model Averaging in Federated Learning on Heterogeneous DataIEEE Transactions on Mobile Computing (IEEE TMC), 2023

389

13 May 2023

Reinterpreting causal discovery as the task of predicting unobserved joint statistics

337

11 May 2023

Target-Side Augmentation for Document-Level Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Guangsheng Bao

Zhiyang Teng

Yue Zhang

269

08 May 2023

The Training Process of Many Deep Networks Explores the Same Low-Dimensional ManifoldProceedings of the National Academy of Sciences of the United States of America (PNAS), 2023

261

02 May 2023

Is deep learning a useful tool for the pure mathematician?Bulletin of the American Mathematical Society (BAMS), 2023

G. Williamson

FedML

179

25 Apr 2023

Learning Trajectories are Generalization IndicatorsNeural Information Processing Systems (NeurIPS), 2023

445

25 Apr 2023

DeepReShape: Redesigning Neural Networks for Efficient Private Inference

N. Jha

Brandon Reagen

353

20 Apr 2023

Sparsity in neural networks can improve their privacy

239

20 Apr 2023

Approximation and interpolation of deep neural networks

Vlad Constantinescu

Ionel Popescu

109

20 Apr 2023

Generalization and Estimation Error Bounds for Model-based Neural NetworksInternational Conference on Learning Representations (ICLR), 2023

131

19 Apr 2023

AdapterGNN: Parameter-Efficient Fine-Tuning Improves Generalization in GNNsAAAI Conference on Artificial Intelligence (AAAI), 2023

Shengrui Li

Xueting Han

Jing Bai

AI4CE

169

19 Apr 2023

Prediction-Oriented Bayesian Active LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Freddie Bickford-Smith

231

17 Apr 2023