v1v2 (latest)

Reconciling modern machine learning practice and the bias-variance trade-off

28 December 2018

Papers citing "Reconciling modern machine learning practice and the bias-variance trade-off"

50 / 938 papers shown

Title
Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability Lijia Yu Yibo Miao Yifan Zhu Xiao-Shan Gao Lijun Zhang 224 0 0 06 Mar 2025
On the Relationship Between Double Descent of CNNs and Shape/Texture Bias Under Learning ProcessInternational Conference on Pattern Recognition (ICPR), 2025 Shun Iwase Shuya Takahashi Nakamasa Inoue Rio Yokota Ryo Nakamura Hirokatsu Kataoka 226 0 0 04 Mar 2025
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches Yifang Chen Xuyang Guo Xiaoyu Li Yingyu Liang Zhenmei Shi Zhao Song 238 3 0 03 Mar 2025
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning Gokul Swamy Sanjiban Choudhury Wen Sun Zhiwei Steven Wu J. Andrew Bagnell OffRL 331 41 0 03 Mar 2025
A Near Complete Nonasymptotic Generalization Theory For Multilayer Neural Networks: Beyond the Bias-Variance Tradeoff Hao Yu Xiangyang Ji AI4CE 181 0 0 03 Mar 2025
Deep Learning is Not So Mysterious or Different Andrew Gordon Wilson 273 21 0 03 Mar 2025
Defining bias in AI-systems: Biased models are fair models Chiara Lindloff Ingo Siegert FaML 143 0 0 25 Feb 2025
From Small to Large Language Models: Revisiting the Federalist Papers So Won Jeong Veronika Rockova 323 2 0 25 Feb 2025
Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks Rylan Schaeffer Punit Singh Koura Binh Tang R. Subramanian Aaditya K. Singh ... Vedanuj Goswami Sergey Edunov Dieuwke Hupkes Sanmi Koyejo Sharan Narang ALM 281 2 0 24 Feb 2025
Understanding Generalization in Transformers: Error Bounds and Training Dynamics Under Benign and Harmful Overfitting Yingying Zhang Zhikai Wu Jian Li Wenshu Fan MLT AI4CE 163 1 0 18 Feb 2025
Discovering the influence of personal features in psychological processes using Artificial Intelligence techniques: the case of COVID19 lockdown in Spain Blanca Mellor-Marsa Alfredo Guitian Andrew Coney Berta Padilla Alberto Nogales 111 0 0 18 Feb 2025
Early Stopping Against Label Noise Without Validation DataInternational Conference on Learning Representations (ICLR), 2025 Suqin Yuan Lei Feng Tongliang Liu NoLa 520 30 0 11 Feb 2025
The late-stage training dynamics of (stochastic) subgradient descent on homogeneous neural networksAnnual Conference Computational Learning Theory (COLT), 2025 Sholom Schechtman Nicolas Schreuder 949 0 0 08 Feb 2025
Spurious Correlations in High Dimensional Regression: The Roles of Regularization, Simplicity Bias and Over-Parameterization Simone Bombari Marco Mondelli 610 5 0 03 Feb 2025
Efficient Semi-Supervised Adversarial Training via Latent Clustering-Based Data Reduction Somrita Ghosh Yuelin Xu Xiao Zhang OOD AAML 239 0 0 15 Jan 2025
DEHYDRATOR: Enhancing Provenance Graph Storage via Hierarchical Encoding and Sequence GenerationIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024 J. Ying Tiantian Zhu Mingqi Lv Tieming Chen 97 0 0 03 Jan 2025
Functional Risk Minimization Ferran Alet Clement Gehring Tomás Lozano-Pérez Kenji Kawaguchi Joshua B. Tenenbaum Leslie Pack Kaelbling OffRL 213 0 0 31 Dec 2024
The Pitfalls of Memorization: When Memorization Hurts GeneralizationInternational Conference on Learning Representations (ICLR), 2024 Reza Bayat Mohammad Pezeshki Elvis Dohmatob David Lopez-Paz Pascal Vincent OOD 291 15 0 10 Dec 2024
Analysis of High-dimensional Gaussian Labeled-unlabeled Mixture Model via Message-passing AlgorithmJournal of Statistical Mechanics: Theory and Experiment (JSTAT), 2024 Xiaosi Gu Tomoyuki Obuchi 378 0 0 29 Nov 2024
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Akhiad Bercovich Tomer Ronen Talor Abramovich Nir Ailon Nave Assaf ... Ido Shahaf Oren Tropp Omer Ullman Argov Ran Zilberstein Ran El-Yaniv 663 8 0 28 Nov 2024
Convolutional Neural Networks Do Work with Pre-Defined FiltersIEEE International Joint Conference on Neural Network (IJCNN), 2023 C. Linse Erhardt Barth T. Martinetz 248 5 0 27 Nov 2024
Fast training of large kernel models with delayed projections Amirhesam Abedsoltan Siyuan Ma Parthe Pandit Mikhail Belkin 295 1 0 25 Nov 2024
Accelerated zero-order SGD under high-order smoothness and overparameterized regimeNelineinaya Dinamika (ND), 2024 Georgii Bychkov D. Dvinskikh Anastasia Antsiferova Alexander Gasnikov Aleksandr Lobanov 219 1 0 21 Nov 2024
Is network fragmentation a useful complexity measure? Coenraad Mouton Randle Rabe Daniël G. Haasbroek Marthinus W. Theunissen Hermanus L. Potgieter Marelie Hattingh Davel 747 0 0 07 Nov 2024
Double Descent Meets Out-of-Distribution Detection: Theoretical Insights and Empirical Analysis on the role of model complexity Mouin Ben Ammar David Brellmann Arturo Mendoza Antoine Manzanera Gianni Franchi OODD 264 0 0 04 Nov 2024
Theoretical characterisation of the Gauss-Newton conditioning in Neural NetworksNeural Information Processing Systems (NeurIPS), 2024 Jim Zhao Sidak Pal Singh Aurelien Lucchi AI4CE 403 1 0 04 Nov 2024
Generalizability of Memorization Neural Networks Lijia Yu Xiao-Shan Gao Lijun Zhang Yibo Miao 207 1 0 01 Nov 2024
How many classifiers do we need?Neural Information Processing Systems (NeurIPS), 2024 Hyunsuk Kim Liam Hodgkinson Ryan Theisen Michael W. Mahoney 243 0 0 01 Nov 2024
Efficient Model Compression for Bayesian Neural Networks Diptarka Saha Zihe Liu Feng Liang BDL 165 0 0 01 Nov 2024
Bilinear Sequence Regression: A Model for Learning from Long Sequences of High-dimensional TokensPhysical Review X (PRX), 2024 Vittorio Erba Emanuele Troiani Luca Biggio Antoine Maillard Lenka Zdeborová 425 2 0 24 Oct 2024
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling LawsInternational Conference on Learning Representations (ICLR), 2024 M. E. Ildiz Halil Alperen Gozeten Ege Onur Taga Marco Mondelli Samet Oymak 401 13 0 24 Oct 2024
Enhancing Generalization in Convolutional Neural Networks through Regularization with Edge and Line FeaturesInternational Conference on Artificial Neural Networks (ICANN), 2024 C. Linse Beatrice Brückner Thomas Martinetz 117 0 0 22 Oct 2024
Rethinking generalization of classifiers in separable classes scenarios and over-parameterized regimesIEEE International Joint Conference on Neural Network (IJCNN), 2024 Julius Martinetz C. Linse Thomas Martinetz 269 0 0 22 Oct 2024
Theoretical Limitations of Ensembles in the Age of Overparameterization Niclas Dern John P. Cunningham Geoff Pleiss BDL UQCV 271 2 0 21 Oct 2024
A Lipschitz spaces view of infinitely wide shallow neural networks Francesca Bartolucci Marcello Carioni José A. Iglesias Yury Korolev Emanuele Naldi Stefano Vigogna 282 2 0 18 Oct 2024
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning Ilya Kaufman Omri Azencot AI4TS 169 4 0 17 Oct 2024
The Fair Language Model Paradox Andrea Pinto Tomer Galanti Randall Balestriero 224 2 0 15 Oct 2024
Building a Multivariate Time Series Benchmarking Datasets Inspired by Natural Language Processing (NLP) Mohammad Asif Ibna Mustafa Ferdinand Heinrich AI4TS 230 0 0 14 Oct 2024
On Goodhart's law, with an application to value alignment El-Mahdi El-Mhamdi Lê-Nguyên Hoang 99 4 0 12 Oct 2024
Features are fate: a theory of transfer learning in high-dimensional regression Javan Tahir Surya Ganguli Grant M. Rotskoff 300 5 0 10 Oct 2024
Defending Membership Inference Attacks via Privacy-aware Sparsity Tuning Qiang Hu Hengxiang Zhang Jianguo Huang 270 2 0 09 Oct 2024
Understanding Model Ensemble in Transferable Adversarial Attack Wei Yao Zeliang Zhang Huayi Tang Yong Liu 314 4 0 09 Oct 2024
Extended convexity and smoothness and their applications in deep learning Binchuan Qi Wei Gong Li Li 322 0 0 08 Oct 2024
Simplicity bias and optimization threshold in two-layer ReLU networks Etienne Boursier Nicolas Flammarion 266 5 0 03 Oct 2024
Investigating the Impact of Model Complexity in Large Language Models Jing Luo Huiyuan Wang Weiran Huang 157 0 0 01 Oct 2024
Random Features Outperform Linear Models: Effect of Strong Input-Label Correlation in Spiked Covariance Data Samet Demir Zafer Dogan 191 4 0 30 Sep 2024
Classical Statistical (In-Sample) Intuitions Don't Generalize Well: A Note on Bias-Variance Tradeoffs, Overfitting and Moving from Fixed to Random Designs Alicia Curth 160 6 0 27 Sep 2024
The poison of dimensionality Lê-Nguyên Hoang 225 3 0 25 Sep 2024
Generative Pre-trained Ranking Model with Over-parameterization at Web-Scale (Extended Abstract) Yuchen Li Haoyi Xiong Linghe Kong Jiang Bian Shuaiqiang Wang Guihai Chen D. Yin 124 0 0 25 Sep 2024
Zero-shot forecasting of chaotic systemsInternational Conference on Learning Representations (ICLR), 2024 Yuanzhao Zhang William Gilpin AI4TS 531 15 0 24 Sep 2024