Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1812.11118
Cited By
v1
v2 (latest)
Reconciling modern machine learning practice and the bias-variance trade-off
28 December 2018
M. Belkin
Daniel J. Hsu
Siyuan Ma
Soumik Mandal
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reconciling modern machine learning practice and the bias-variance trade-off"
50 / 942 papers shown
Title
Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction
Anantha Padmanaban Krishna Kumar
ViT
24
0
0
30 Nov 2025
On the Effect of Regularization on Nonparametric Mean-Variance Regression
Eliot Wong-Toi
Alex Boyd
Vincent Fortuin
Stephan Mandt
12
0
0
27 Nov 2025
SX-GeoTree: Self-eXplaining Geospatial Regression Tree Incorporating the Spatial Similarity of Feature Attributions
Chaogui Kang
Lijian Luo
Qingfeng Guan
Yu Liu
80
0
0
25 Nov 2025
Lower Bias, Higher Welfare: How Creator Competition Reshapes Bias-Variance Tradeoff in Recommendation Platforms?
Kang Wang
Renzhe Xu
Bo Li
56
0
0
25 Nov 2025
Analog Physical Systems Can Exhibit Double Descent
Sam Dillavou
Jason W Rocks
J. F. Wycoff
A. Liu
D. Durian
52
0
0
21 Nov 2025
DNNs, Dataset Statistics, and Correlation Functions
Robert W. Batterman
James F. Woodward
8
0
0
18 Nov 2025
Benign Overfitting in Linear Classifiers with a Bias Term
Yuta Kondo
AI4CE
196
0
0
16 Nov 2025
Source-Optimal Training is Transfer-Suboptimal
C. Evans Hedges
84
0
0
11 Nov 2025
Analyzing the Power of Chain of Thought through Memorization Capabilities
Lijia Yu
Xiao-Shan Gao
Lijun Zhang
LRM
ELM
200
0
0
03 Nov 2025
EL-MIA: Quantifying Membership Inference Risks of Sensitive Entities in LLMs
Ali Satvaty
Suzan Verberne
Fatih Turkmen
MIALM
263
0
0
31 Oct 2025
How Data Mixing Shapes In-Context Learning: Asymptotic Equivalence for Transformers with MLPs
Samet Demir
Zafer Dogan
89
0
0
29 Oct 2025
Aggregation Hides Out-of-Distribution Generalization Failures from Spurious Correlations
Olawale Salaudeen
Haoran Zhang
Mingyu Lu
Sara Beery
Marzyeh Ghassemi
OODD
309
0
0
28 Oct 2025
Convergence of Stochastic Gradient Langevin Dynamics in the Lazy Training Regime
Noah Oberweis
Semih Cayci
189
0
0
24 Oct 2025
A Unified Perspective on Optimization in Machine Learning and Neuroscience: From Gradient Descent to Neural Adaptation
Jesus Garcia Fernandez
Nasir Ahmad
Marcel van Gerven
AI4CE
225
0
0
21 Oct 2025
Position: Many generalization measures for deep learning are fragile
Shuofeng Zhang
A. Louis
AAML
214
0
0
21 Oct 2025
On the Impossibility of Retrain Equivalence in Machine Unlearning
Jiatong Yu
Yinghui He
Anirudh Goyal
Sanjeev Arora
MU
270
0
0
18 Oct 2025
Memorizing Long-tail Data Can Help Generalization Through Composition
Mo Zhou
Haoyang Ma
Rong Ge
TDI
321
0
0
18 Oct 2025
Transfer Learning for Benign Overfitting in High-Dimensional Linear Regression
Yeichan Kim
Ilmun Kim
Seyoung Park
124
0
0
17 Oct 2025
A novel Information-Driven Strategy for Optimal Regression Assessment
Benjamín Castro
Camilo Ramírez
Sebastián Espinosa
Jorge F. Silva
Marcos E. Orchard
Heraldo Rozas
60
0
0
16 Oct 2025
Optimal Regularization for Performative Learning
Edwige Cyffers
Alireza Mirrokni
Marco Mondelli
100
0
0
14 Oct 2025
Redundancy as a Structural Information Principle for Learning and Generalization
Yuda Bi
Ying Zhu
Vince D. Calhoun
100
0
0
13 Oct 2025
Provable Watermarking for Data Poisoning Attacks
Yifan Zhu
Lijia Yu
Xiao-Shan Gao
AAML
127
0
0
10 Oct 2025
High-dimensional Analysis of Synthetic Data Selection
Parham Rezaei
Filip Kovačević
Francesco Locatello
Marco Mondelli
157
0
0
09 Oct 2025
Deep Neural Networks Inspired by Differential Equations
Y. Liu
Lianfang Wang
Kuilin Qin
Qinghua Zhang
Faqiang Wang
Li-min Cui
Jun Liu
Yuping Duan
T. Zeng
AI4TS
AI4CE
174
0
0
09 Oct 2025
The Effect of Label Noise on the Information Content of Neural Representations
Ali Hussaini Umar
Franky Kevin Nando Tezoh
Jean Barbier
Santiago Acevedo
Alessandro Laio
SSL
NoLa
190
0
0
07 Oct 2025
Spectral Thresholds for Identifiability and Stability:Finite-Sample Phase Transitions in High-Dimensional Learning
William Hao-Cheng Huang
56
0
0
04 Oct 2025
On residual network depth
Benoit Dherin
Michael Munn
MDE
225
0
0
03 Oct 2025
Non-Vacuous Generalization Bounds: Can Rescaling Invariances Help?
Damien Rouchouse
Antoine Gonon
Rémi Gribonval
Benjamin Guedj
108
0
0
30 Sep 2025
Specialization after Generalization: Towards Understanding Test-Time Training in Foundation Models
Jonas Hübotter
Patrik Wolf
Alexander Shevchenko
Dennis Jüni
Andreas Krause
Gil Kur
196
0
0
29 Sep 2025
Double Descent as a Lens for Sample Efficiency in Autoregressive vs. Discrete Diffusion Models
Ahmad Fraij
Sam Dauncey
DiffM
84
0
0
29 Sep 2025
Neural Feature Geometry Evolves as Discrete Ricci Flow
Moritz Hehl
Max von Renesse
Melanie Weber
AI4CE
95
0
0
26 Sep 2025
Incorporating priors in learning: a random matrix study under a teacher-student framework
Malik Tiomoko
Ekkehard Schnoor
84
0
0
26 Sep 2025
A Law of Data Reconstruction for Random Features (and Beyond)
Leonardo Iurada
Simone Bombari
Tatiana Tommasi
Marco Mondelli
116
0
0
26 Sep 2025
Closed-form
ℓ
r
\ell_r
ℓ
r
norm scaling with data for overparameterized linear regression and diagonal linear networks under
ℓ
p
\ell_p
ℓ
p
bias
Shuofeng Zhang
A. Louis
149
0
0
25 Sep 2025
Understanding and Enhancing Mask-Based Pretraining towards Universal Representations
Mingze Dong
Leda Wang
Yuval Kluger
SSL
113
0
0
25 Sep 2025
Pre-training under infinite compute
Konwoo Kim
Suhas Kotha
Abigail Z. Jacobs
Tatsunori Hashimoto
200
1
0
18 Sep 2025
Inspired by machine learning optimization: can gradient-based optimizers solve cycle skipping in full waveform inversion given sufficient iterations?
Xinru Mu
Omar M. Saad
Shaowen Wang
Tariq Alkhalifah
65
0
0
18 Sep 2025
Deep learning and abstractive summarisation for radiological reports: an empirical study for adapting the PEGASUS models' family with scarce data
Claudio Benzoni
Martina Langhals
Martin Boeker
Luise Modersohn
Máté E. Maros
MedIm
72
0
0
18 Sep 2025
Data coarse graining can improve model performance
Alex Nguyen
D. Schwab
Vudtiwat Ngampruetikorn
72
0
0
18 Sep 2025
RAPTOR: A Foundation Policy for Quadrotor Control
Jonas Eschmann
Dario Albani
Giuseppe Loianno
138
1
0
15 Sep 2025
An entropy formula for the Deep Linear Network
Govind Menon
Tianmin Yu
81
2
0
11 Sep 2025
Evaluating the Efficiency of Latent Spaces via the Coupling-Matrix
Mehmet Can Yavuz
Berrin Yanikoglu
68
2
0
08 Sep 2025
On Aligning Prediction Models with Clinical Experiential Learning: A Prostate Cancer Case Study
Jacqueline Jil Vallon
William Overman
Wanqiao Xu
Neil Panjwani
Xi Ling
...
Geoffrey Sonn
Sandy Srinivas
E. Pollom
Mark K. Buyyounouski
Mohsen Bayati
96
1
0
04 Sep 2025
Discrete Functional Geometry of ReLU Networks via ReLU Transition Graphs
Sahil Rajesh Dhayalkar
126
0
0
03 Sep 2025
Eigenvalue distribution of the Neural Tangent Kernel in the quadratic scaling
Lucas Benigni
Elliot Paquette
81
1
0
27 Aug 2025
GRADSTOP: Early Stopping of Gradient Descent via Posterior Sampling
Arash Jamshidi
Lauri Seppäläinen
Katsiaryna Haitsiukevich
Hoang Phuc Hau Luu
Anton Björklund
Kai Puolamäki
BDL
125
0
0
26 Aug 2025
On the Edge of Memorization in Diffusion Models
Sam Buchanan
Druv Pai
Yi-An Ma
Valentin De Bortoli
TDI
236
3
0
25 Aug 2025
From Prediction to Simulation: AlphaFold 3 as a Differentiable Framework for Structural Biology
Alireza Abbaszadeh
Armita Shahlaee
AI4CE
56
1
0
25 Aug 2025
Convergence and Generalization of Anti-Regularization for Parametric Models
Dongseok Kim
Wonjun Jeong
Gisung Oh
189
0
0
24 Aug 2025
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
Yi Zhang
Zhenyu Liao
Jingfeng Wu
Difan Zou
DiffM
165
1
0
22 Aug 2025
1
2
3
4
...
17
18
19
Next