ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.11118
  4. Cited By
Reconciling modern machine learning practice and the bias-variance
  trade-off
v1v2 (latest)

Reconciling modern machine learning practice and the bias-variance trade-off

28 December 2018
M. Belkin
Daniel J. Hsu
Siyuan Ma
Soumik Mandal
ArXiv (abs)PDFHTML

Papers citing "Reconciling modern machine learning practice and the bias-variance trade-off"

50 / 938 papers shown
Title
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability
Lijia Yu
Yibo Miao
Yifan Zhu
Xiao-Shan Gao
Lijun Zhang
224
0
0
06 Mar 2025
On the Relationship Between Double Descent of CNNs and Shape/Texture Bias Under Learning ProcessInternational Conference on Pattern Recognition (ICPR), 2025
Shun Iwase
Shuya Takahashi
Nakamasa Inoue
Rio Yokota
Ryo Nakamura
Hirokatsu Kataoka
226
0
0
04 Mar 2025
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
Yifang Chen
Xuyang Guo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
238
3
0
03 Mar 2025
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Gokul Swamy
Sanjiban Choudhury
Wen Sun
Zhiwei Steven Wu
J. Andrew Bagnell
OffRL
331
41
0
03 Mar 2025
A Near Complete Nonasymptotic Generalization Theory For Multilayer Neural Networks: Beyond the Bias-Variance Tradeoff
Hao Yu
Xiangyang Ji
AI4CE
181
0
0
03 Mar 2025
Deep Learning is Not So Mysterious or Different
Deep Learning is Not So Mysterious or Different
Andrew Gordon Wilson
273
21
0
03 Mar 2025
Defining bias in AI-systems: Biased models are fair models
Defining bias in AI-systems: Biased models are fair models
Chiara Lindloff
Ingo Siegert
FaML
143
0
0
25 Feb 2025
From Small to Large Language Models: Revisiting the Federalist Papers
From Small to Large Language Models: Revisiting the Federalist Papers
So Won Jeong
Veronika Rockova
323
2
0
25 Feb 2025
Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Rylan Schaeffer
Punit Singh Koura
Binh Tang
R. Subramanian
Aaditya K. Singh
...
Vedanuj Goswami
Sergey Edunov
Dieuwke Hupkes
Sanmi Koyejo
Sharan Narang
ALM
281
2
0
24 Feb 2025
Understanding Generalization in Transformers: Error Bounds and Training Dynamics Under Benign and Harmful Overfitting
Understanding Generalization in Transformers: Error Bounds and Training Dynamics Under Benign and Harmful Overfitting
Yingying Zhang
Zhikai Wu
Jian Li
Wenshu Fan
MLTAI4CE
163
1
0
18 Feb 2025
Discovering the influence of personal features in psychological processes using Artificial Intelligence techniques: the case of COVID19 lockdown in Spain
Blanca Mellor-Marsa
Alfredo Guitian
Andrew Coney
Berta Padilla
Alberto Nogales
111
0
0
18 Feb 2025
Early Stopping Against Label Noise Without Validation Data
Early Stopping Against Label Noise Without Validation DataInternational Conference on Learning Representations (ICLR), 2025
Suqin Yuan
Lei Feng
Tongliang Liu
NoLa
520
30
0
11 Feb 2025
The late-stage training dynamics of (stochastic) subgradient descent on homogeneous neural networks
The late-stage training dynamics of (stochastic) subgradient descent on homogeneous neural networksAnnual Conference Computational Learning Theory (COLT), 2025
Sholom Schechtman
Nicolas Schreuder
949
0
0
08 Feb 2025
Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization
Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization
Simone Bombari
Marco Mondelli
610
5
0
03 Feb 2025
Efficient Semi-Supervised Adversarial Training via Latent Clustering-Based Data Reduction
Efficient Semi-Supervised Adversarial Training via Latent Clustering-Based Data Reduction
Somrita Ghosh
Yuelin Xu
Xiao Zhang
OODAAML
239
0
0
15 Jan 2025
DEHYDRATOR: Enhancing Provenance Graph Storage via Hierarchical Encoding and Sequence GenerationIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
J. Ying
Tiantian Zhu
Mingqi Lv
Tieming Chen
97
0
0
03 Jan 2025
Functional Risk Minimization
Functional Risk Minimization
Ferran Alet
Clement Gehring
Tomás Lozano-Pérez
Kenji Kawaguchi
Joshua B. Tenenbaum
Leslie Pack Kaelbling
OffRL
213
0
0
31 Dec 2024
The Pitfalls of Memorization: When Memorization Hurts Generalization
The Pitfalls of Memorization: When Memorization Hurts GeneralizationInternational Conference on Learning Representations (ICLR), 2024
Reza Bayat
Mohammad Pezeshki
Elvis Dohmatob
David Lopez-Paz
Pascal Vincent
OOD
291
15
0
10 Dec 2024
Analysis of High-dimensional Gaussian Labeled-unlabeled Mixture Model via Message-passing Algorithm
Analysis of High-dimensional Gaussian Labeled-unlabeled Mixture Model via Message-passing AlgorithmJournal of Statistical Mechanics: Theory and Experiment (JSTAT), 2024
Xiaosi Gu
Tomoyuki Obuchi
378
0
0
29 Nov 2024
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Akhiad Bercovich
Tomer Ronen
Talor Abramovich
Nir Ailon
Nave Assaf
...
Ido Shahaf
Oren Tropp
Omer Ullman Argov
Ran Zilberstein
Ran El-Yaniv
663
8
0
28 Nov 2024
Convolutional Neural Networks Do Work with Pre-Defined Filters
Convolutional Neural Networks Do Work with Pre-Defined FiltersIEEE International Joint Conference on Neural Network (IJCNN), 2023
C. Linse
Erhardt Barth
T. Martinetz
248
5
0
27 Nov 2024
Fast training of large kernel models with delayed projections
Fast training of large kernel models with delayed projections
Amirhesam Abedsoltan
Siyuan Ma
Parthe Pandit
Mikhail Belkin
295
1
0
25 Nov 2024
Accelerated zero-order SGD under high-order smoothness and
  overparameterized regime
Accelerated zero-order SGD under high-order smoothness and overparameterized regimeNelineinaya Dinamika (ND), 2024
Georgii Bychkov
D. Dvinskikh
Anastasia Antsiferova
Alexander Gasnikov
Aleksandr Lobanov
219
1
0
21 Nov 2024
Is network fragmentation a useful complexity measure?
Is network fragmentation a useful complexity measure?
Coenraad Mouton
Randle Rabe
Daniël G. Haasbroek
Marthinus W. Theunissen
Hermanus L. Potgieter
Marelie Hattingh Davel
747
0
0
07 Nov 2024
Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the role of model complexity
Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the role of model complexity
Mouin Ben Ammar
David Brellmann
Arturo Mendoza
Antoine Manzanera
Gianni Franchi
OODD
264
0
0
04 Nov 2024
Theoretical characterisation of the Gauss-Newton conditioning in Neural Networks
Theoretical characterisation of the Gauss-Newton conditioning in Neural NetworksNeural Information Processing Systems (NeurIPS), 2024
Jim Zhao
Sidak Pal Singh
Aurelien Lucchi
AI4CE
403
1
0
04 Nov 2024
Generalizability of Memorization Neural Networks
Generalizability of Memorization Neural Networks
Lijia Yu
Xiao-Shan Gao
Lijun Zhang
Yibo Miao
207
1
0
01 Nov 2024
How many classifiers do we need?
How many classifiers do we need?Neural Information Processing Systems (NeurIPS), 2024
Hyunsuk Kim
Liam Hodgkinson
Ryan Theisen
Michael W. Mahoney
243
0
0
01 Nov 2024
Efficient Model Compression for Bayesian Neural Networks
Efficient Model Compression for Bayesian Neural Networks
Diptarka Saha
Zihe Liu
Feng Liang
BDL
165
0
0
01 Nov 2024
Bilinear Sequence Regression: A Model for Learning from Long Sequences of High-dimensional Tokens
Bilinear Sequence Regression: A Model for Learning from Long Sequences of High-dimensional TokensPhysical Review X (PRX), 2024
Vittorio Erba
Emanuele Troiani
Luca Biggio
Antoine Maillard
Lenka Zdeborová
425
2
0
24 Oct 2024
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling LawsInternational Conference on Learning Representations (ICLR), 2024
M. E. Ildiz
Halil Alperen Gozeten
Ege Onur Taga
Marco Mondelli
Samet Oymak
401
13
0
24 Oct 2024
Enhancing Generalization in Convolutional Neural Networks through
  Regularization with Edge and Line Features
Enhancing Generalization in Convolutional Neural Networks through Regularization with Edge and Line FeaturesInternational Conference on Artificial Neural Networks (ICANN), 2024
C. Linse
Beatrice Brückner
Thomas Martinetz
117
0
0
22 Oct 2024
Rethinking generalization of classifiers in separable classes scenarios
  and over-parameterized regimes
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimesIEEE International Joint Conference on Neural Network (IJCNN), 2024
Julius Martinetz
C. Linse
Thomas Martinetz
269
0
0
22 Oct 2024
Theoretical Limitations of Ensembles in the Age of Overparameterization
Theoretical Limitations of Ensembles in the Age of Overparameterization
Niclas Dern
John P. Cunningham
Geoff Pleiss
BDLUQCV
271
2
0
21 Oct 2024
A Lipschitz spaces view of infinitely wide shallow neural networks
A Lipschitz spaces view of infinitely wide shallow neural networks
Francesca Bartolucci
Marcello Carioni
José A. Iglesias
Yury Korolev
Emanuele Naldi
Stefano Vigogna
282
2
0
18 Oct 2024
Analyzing Deep Transformer Models for Time Series Forecasting via
  Manifold Learning
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning
Ilya Kaufman
Omri Azencot
AI4TS
169
4
0
17 Oct 2024
The Fair Language Model Paradox
The Fair Language Model Paradox
Andrea Pinto
Tomer Galanti
Randall Balestriero
224
2
0
15 Oct 2024
Building a Multivariate Time Series Benchmarking Datasets Inspired by
  Natural Language Processing (NLP)
Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (NLP)
Mohammad Asif Ibna Mustafa
Ferdinand Heinrich
AI4TS
230
0
0
14 Oct 2024
On Goodhart's law, with an application to value alignment
On Goodhart's law, with an application to value alignment
El-Mahdi El-Mhamdi
Lê-Nguyên Hoang
99
4
0
12 Oct 2024
Features are fate: a theory of transfer learning in high-dimensional regression
Features are fate: a theory of transfer learning in high-dimensional regression
Javan Tahir
Surya Ganguli
Grant M. Rotskoff
300
5
0
10 Oct 2024
Defending Membership Inference Attacks via Privacy-aware Sparsity Tuning
Defending Membership Inference Attacks via Privacy-aware Sparsity Tuning
Qiang Hu
Hengxiang Zhang
Jianguo Huang
270
2
0
09 Oct 2024
Understanding Model Ensemble in Transferable Adversarial Attack
Understanding Model Ensemble in Transferable Adversarial Attack
Wei Yao
Zeliang Zhang
Huayi Tang
Yong Liu
314
4
0
09 Oct 2024
Extended convexity and smoothness and their applications in deep learning
Extended convexity and smoothness and their applications in deep learning
Binchuan Qi
Wei Gong
Li Li
322
0
0
08 Oct 2024
Simplicity bias and optimization threshold in two-layer ReLU networks
Simplicity bias and optimization threshold in two-layer ReLU networks
Etienne Boursier
Nicolas Flammarion
266
5
0
03 Oct 2024
Investigating the Impact of Model Complexity in Large Language Models
Investigating the Impact of Model Complexity in Large Language Models
Jing Luo
Huiyuan Wang
Weiran Huang
157
0
0
01 Oct 2024
Random Features Outperform Linear Models: Effect of Strong Input-Label
  Correlation in Spiked Covariance Data
Random Features Outperform Linear Models: Effect of Strong Input-Label Correlation in Spiked Covariance Data
Samet Demir
Zafer Dogan
191
4
0
30 Sep 2024
Classical Statistical (In-Sample) Intuitions Don't Generalize Well: A
  Note on Bias-Variance Tradeoffs, Overfitting and Moving from Fixed to Random
  Designs
Classical Statistical (In-Sample) Intuitions Don't Generalize Well: A Note on Bias-Variance Tradeoffs, Overfitting and Moving from Fixed to Random Designs
Alicia Curth
160
6
0
27 Sep 2024
The poison of dimensionality
The poison of dimensionality
Lê-Nguyên Hoang
225
3
0
25 Sep 2024
Generative Pre-trained Ranking Model with Over-parameterization at
  Web-Scale (Extended Abstract)
Generative Pre-trained Ranking Model with Over-parameterization at Web-Scale (Extended Abstract)
Yuchen Li
Haoyi Xiong
Linghe Kong
Jiang Bian
Shuaiqiang Wang
Guihai Chen
D. Yin
124
0
0
25 Sep 2024
Zero-shot forecasting of chaotic systems
Zero-shot forecasting of chaotic systemsInternational Conference on Learning Representations (ICLR), 2024
Yuanzhao Zhang
William Gilpin
AI4TS
531
15
0
24 Sep 2024
Previous
123456...171819
Next