Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1812.11118
Cited By
v1
v2 (latest)
Reconciling modern machine learning practice and the bias-variance trade-off
28 December 2018
M. Belkin
Daniel J. Hsu
Siyuan Ma
Soumik Mandal
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reconciling modern machine learning practice and the bias-variance trade-off"
50 / 945 papers shown
Transgressing the boundaries: towards a rigorous understanding of deep learning and its (non-)robustness
C. Hartmann
Lorenz Richter
AAML
206
2
0
05 Jul 2023
Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows
Neural Information Processing Systems (NeurIPS), 2023
Sibylle Marcotte
Rémi Gribonval
Gabriel Peyré
327
27
0
30 Jun 2023
A Quantitative Functional Central Limit Theorem for Shallow Neural Networks
Modern Stochastics: Theory and Applications (MSTA), 2023
Valentina Cammarota
Domenico Marinucci
M. Salvi
Stefano Vigogna
299
14
0
29 Jun 2023
Solving Kernel Ridge Regression with Gradient-Based Optimization Methods
Electronic Journal of Statistics (EJS), 2023
Oskar Allerbo
269
1
0
29 Jun 2023
Semantic Segmentation of Porosity in 4D Spatio-Temporal X-ray μCT of Titanium Coated Ni wires using Deep Learning
Pradyumna Elavarthi
Arun J. Bhattacharjee
A. P. Y. Puente
Anca L. Ralescu
143
0
0
24 Jun 2023
A Unified Approach to Controlling Implicit Regularization via Mirror Descent
Journal of machine learning research (JMLR), 2023
Haoyuan Sun
Khashayar Gatmiry
Kwangjun Ahn
Navid Azizan
AI4CE
255
16
0
24 Jun 2023
Efficient Online Processing with Deep Neural Networks
Lukas Hedegaard
209
0
0
23 Jun 2023
Quantifying lottery tickets under label noise: accuracy, calibration, and complexity
Conference on Uncertainty in Artificial Intelligence (UAI), 2023
V. Arora
Daniele Irto
Sebastian Goldt
G. Sanguinetti
236
2
0
21 Jun 2023
Deep Fusion: Efficient Network Training via Pre-trained Initializations
International Conference on Machine Learning (ICML), 2023
Hanna Mazzawi
X. Gonzalvo
Michael Wunder
Sammy Jerome
Benoit Dherin
AI4CE
509
3
0
20 Jun 2023
Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent
Neural Information Processing Systems (NeurIPS), 2023
J. Lin
Javier Antorán
Shreyas Padhy
David Janz
José Miguel Hernández-Lobato
Alexander Terenin
279
28
0
20 Jun 2023
Eight challenges in developing theory of intelligence
Haiping Huang
286
12
0
20 Jun 2023
Can predictive models be used for causal inference?
Maximilian Pichler
F. Hartig
OOD
CML
216
6
0
18 Jun 2023
Training shallow ReLU networks on noisy data using hinge loss: when do we overfit and is it benign?
Neural Information Processing Systems (NeurIPS), 2023
Erin E. George
Michael Murray
W. Swartworth
Deanna Needell
MLT
219
8
0
16 Jun 2023
Nonparametric regression using over-parameterized shallow ReLU neural networks
Journal of machine learning research (JMLR), 2023
Yunfei Yang
Ding-Xuan Zhou
347
15
0
14 Jun 2023
Progressive Class-Wise Attention (PCA) Approach for Diagnosing Skin Lesions
Asim Naveed
Syed S. Naqvi
Tariq Mahmood Khan
Imran Razzak
142
1
0
11 Jun 2023
Learning a Neuron by a Shallow ReLU Network: Dynamics and Implicit Bias for Correlated Inputs
Neural Information Processing Systems (NeurIPS), 2023
D. Chistikov
Matthias Englert
R. Lazic
MLT
261
15
0
10 Jun 2023
Gibbs-Based Information Criteria and the Over-Parameterized Regime
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Haobo Chen
Yuheng Bu
Greg Wornell
325
1
0
08 Jun 2023
Maximally Machine-Learnable Portfolios
Social Science Research Network (SSRN), 2023
Philippe Goulet Coulombe
Maximilian Göbel
248
4
0
08 Jun 2023
Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability
International Conference on Machine Learning (ICML), 2023
Jianing Zhu
Hengzhuang Li
Jiangchao Yao
Tongliang Liu
Jianliang Xu
Bo Han
OODD
208
19
0
06 Jun 2023
Unraveling Projection Heads in Contrastive Learning: Insights from Expansion and Shrinkage
Yu Gui
Cong Ma
Yiqiao Zhong
205
9
0
06 Jun 2023
Aiming towards the minimizers: fast convergence of SGD for overparametrized problems
Neural Information Processing Systems (NeurIPS), 2023
Chaoyue Liu
Dmitriy Drusvyatskiy
M. Belkin
Damek Davis
Yi-An Ma
ODL
181
20
0
05 Jun 2023
TMI! Finetuned Models Leak Private Information from their Pretraining Data
Proceedings on Privacy Enhancing Technologies (PoPETs), 2023
John Abascal
Stanley Wu
Alina Oprea
Jonathan R. Ullman
296
22
0
01 Jun 2023
The Law of Parsimony in Gradient Descent for Learning Deep Linear Networks
Can Yaras
Peng Wang
Wei Hu
Zhihui Zhu
Laura Balzano
Qing Qu
294
20
0
01 Jun 2023
Traffic Prediction using Artificial Intelligence: Review of Recent Advances and Emerging Opportunities
Transportation Research Part C: Emerging Technologies (TRC), 2022
Maryam Shaygan
Collin Meese
Wanxin Li
Xiaoliang (George) Zhao
Mark M. Nejad
267
174
0
31 May 2023
Multi-Epoch Learning for Deep Click-Through Rate Prediction Models
Zhaocheng Liu
Zhongxiang Fan
Jian Liang
Dongying Kong
Han Li
167
3
0
31 May 2023
Generalized equivalences between subsampling and ridge regularization
Neural Information Processing Systems (NeurIPS), 2023
Pratik V. Patil
Jin-Hong Du
260
6
0
29 May 2023
Optimization's Neglected Normative Commitments
Conference on Fairness, Accountability and Transparency (FAccT), 2023
Benjamin Laufer
T. Gilbert
Helen Nissenbaum
OffRL
218
8
0
27 May 2023
Learning Capacity: A Measure of the Effective Dimensionality of a Model
Daiwei Chen
Wei-Di Chang
Pratik Chaudhari
156
6
0
27 May 2023
Dropout Drops Double Descent
Japanese Journal of Statistics and Data Science (JSDS), 2023
Tianbao Yang
J. Suzuki
262
1
0
25 May 2023
Double Descent of Discrepancy: A Task-, Data-, and Model-Agnostic Phenomenon
Yi-Xiao Luo
Bin Dong
174
0
0
25 May 2023
From Tempered to Benign Overfitting in ReLU Neural Networks
Neural Information Processing Systems (NeurIPS), 2023
Guy Kornowski
Gilad Yehudai
Ohad Shamir
261
12
0
24 May 2023
Least Squares Regression Can Exhibit Under-Parameterized Double Descent
Neural Information Processing Systems (NeurIPS), 2023
Xinyue Li
Rishi Sonthalia
331
5
0
24 May 2023
A 4D Hybrid Algorithm to Scale Parallel Training to Thousands of GPUs
Siddharth Singh
Prajwal Singhania
Aditya K. Ranjan
Zack Sating
A. Bhatele
240
6
0
22 May 2023
Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on Regression Errors
International Conference on Learning Representations (ICLR), 2023
Sungyoon Lee
S. Lee
168
0
0
22 May 2023
When are ensembles really effective?
Neural Information Processing Systems (NeurIPS), 2023
Ryan Theisen
Hyunsuk Kim
Yaoqing Yang
Liam Hodgkinson
Michael W. Mahoney
FedML
UQCV
215
24
0
21 May 2023
Towards understanding neural collapse in supervised contrastive learning with the information bottleneck method
Siwei Wang
S. Palmer
255
4
0
19 May 2023
Exploring the Complexity of Deep Neural Networks through Functional Equivalence
International Conference on Machine Learning (ICML), 2023
Guohao Shen
374
6
0
19 May 2023
On the ISS Property of the Gradient Flow for Single Hidden-Layer Neural Networks with Linear Activations
A. C. B. D. Oliveira
Milad Siami
Eduardo Sontag
197
2
0
17 May 2023
Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data
IEEE Transactions on Mobile Computing (IEEE TMC), 2023
Tailin Zhou
Zehong Lin
Jinchao Zhang
Danny H. K. Tsang
MoMe
FedML
389
22
0
13 May 2023
Reinterpreting causal discovery as the task of predicting unobserved joint statistics
Dominik Janzing
P. M. Faller
L. C. Vankadara
CML
337
4
0
11 May 2023
Target-Side Augmentation for Document-Level Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Guangsheng Bao
Zhiyang Teng
Yue Zhang
269
12
0
08 May 2023
The Training Process of Many Deep Networks Explores the Same Low-Dimensional Manifold
Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2023
Jialin Mao
Itay Griniasty
H. Teoh
Rahul Ramesh
Rubing Yang
Mark K. Transtrum
James P. Sethna
Pratik Chaudhari
3DPC
261
21
0
02 May 2023
Is deep learning a useful tool for the pure mathematician?
Bulletin of the American Mathematical Society (BAMS), 2023
G. Williamson
FedML
179
18
0
25 Apr 2023
Learning Trajectories are Generalization Indicators
Neural Information Processing Systems (NeurIPS), 2023
Jingwen Fu
Zhizheng Zhang
Dacheng Yin
Yan Lu
Nanning Zheng
AI4CE
445
5
0
25 Apr 2023
DeepReShape: Redesigning Neural Networks for Efficient Private Inference
N. Jha
Brandon Reagen
353
15
0
20 Apr 2023
Sparsity in neural networks can improve their privacy
Antoine Gonon
Léon Zheng
Clément Lalanne
Quoc-Tung Le
Guillaume Lauga
Can Pouliquen
239
2
0
20 Apr 2023
Approximation and interpolation of deep neural networks
Vlad Constantinescu
Ionel Popescu
109
2
0
20 Apr 2023
Generalization and Estimation Error Bounds for Model-based Neural Networks
International Conference on Learning Representations (ICLR), 2023
Avner Shultzman
Eyar Azar
M. Rodrigues
Yonina C. Eldar
131
10
0
19 Apr 2023
AdapterGNN: Parameter-Efficient Fine-Tuning Improves Generalization in GNNs
AAAI Conference on Artificial Intelligence (AAAI), 2023
Shengrui Li
Xueting Han
Jing Bai
AI4CE
169
21
0
19 Apr 2023
Prediction-Oriented Bayesian Active Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Freddie Bickford-Smith
Andreas Kirsch
Sebastian Farquhar
Y. Gal
Adam Foster
Tom Rainforth
231
53
0
17 Apr 2023
Previous
1
2
3
...
6
7
8
...
17
18
19
Next