Learning Curve Theory

8 February 2021

Marcus Hutter

ArXiv (abs)PDF HTML

Papers citing "Learning Curve Theory"

50 / 56 papers shown

Zero-Shot Performance Prediction for Probabilistic Scaling Laws

175

19 Oct 2025

Efficient Prediction of Pass@k Scaling in Large Language Models

194

06 Oct 2025

xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity

178

02 Oct 2025

Evaluating the Robustness of Chinchilla Compute-Optimal Scaling

220

28 Sep 2025

Neural Scaling Laws for Deep Regression

Tilen Cadez

Kyoung-Min Kim

258

12 Sep 2025

Scaling Laws Are Unreliable for Downstream Tasks: A Reality Check

Nicholas Lourie

Michael Y. Hu

Dong Wang

215

01 Jul 2025

Complexity Scaling Laws for Neural Models using Combinatorial Optimization

Lowell Weissman

Michael Krumdick

A. Lynn Abbott

352

15 Jun 2025

Improved Scaling Laws in Linear Regression via Data Reuse

Licong Lin

Jingfeng Wu

Peter Bartlett

232

10 Jun 2025

When Models Don't Collapse: On the Consistency of Iterative MLE

Daniel Barzilai

Ohad Shamir

SyDa

253

25 May 2025

Superposition Yields Robust Neural Scaling

749

15 May 2025

Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures

442

11 May 2025

Learning curves theory for hierarchically compositional data with power-law distributed features

Francesco Cagnetta

Hyunmo Kang

Matthieu Wyart

390

11 May 2025

Quiet Feature Learning in Algorithmic Tasks

430

06 May 2025

A Multi-Power Law for Loss Curve Prediction Across Learning Rate SchedulesInternational Conference on Learning Representations (ICLR), 2025

325

17 Mar 2025

Position: Solve Layerwise Linear Models First to Understand Neural Dynamical Phenomena (Neural Collapse, Emergence, Lazy/Rich Regime, and Grokking)

632

28 Feb 2025

How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines

Ayan Sengupta

Tanmoy Chakraborty

604

17 Feb 2025

Loss-to-Loss Prediction: Scaling Laws for All Datasets

335

19 Nov 2024

Scaling Laws for Pre-training Agents and World Models

431

07 Nov 2024

A Simple Model of Inference Scaling Laws

Noam Levi

LRM

265

21 Oct 2024

Adaptive Data Optimization: Dynamic Sample Selection with Scaling LawsInternational Conference on Learning Representations (ICLR), 2024

Yiding Jiang

307

15 Oct 2024

Analyzing Neural Scaling Laws in Two-Layer Networks with Power-Law Data SpectraInternational Conference on Learning Representations (ICLR), 2024

Roman Worschech

B. Rosenow

443

11 Oct 2024

Unified Neural Network Scaling Laws and Scale-time Equivalence

Akhilan Boopathy

Ila Fiete

551

09 Sep 2024

Breaking Neural Network Scaling Laws with ModularityInternational Conference on Learning Representations (ICLR), 2024

Ila Fiete

488

09 Sep 2024

Towards Exact Computation of Inductive Bias

357

22 Jun 2024

Reconciling Kaplan and Chinchilla Scaling Laws

Tim Pearce

Jinyeop Song

445

12 Jun 2024

Scaling Laws in Linear Regression: Compute, Parameters, and Data

556

12 Jun 2024

Scaling Laws for the Value of Individual Data Points in Machine Learning

314

30 May 2024

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

Jiasheng Ye

Peiju Liu

Tianxiang Sun

Yunhua Zhou

Jun Zhan

Xipeng Qiu

522

131

25 Mar 2024

How much data do you need? Part 2: Predicting DL class specific training dataset sizes

Thomas Mühlenstädt

Jelena Frtunikj

178

10 Mar 2024

A Tale of Tails: Model Collapse as a Change of Scaling LawsInternational Conference on Machine Learning (ICML), 2024

345

117

10 Feb 2024

Scaling Laws for Downstream Task Performance of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

383

06 Feb 2024

Scaling Laws for Associative MemoriesInternational Conference on Learning Representations (ICLR), 2023

Vivien A. Cabannes

Elvis Dohmatob

A. Bietti

470

04 Oct 2023

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

347

07 Sep 2023

Is One Epoch All You Need For Multi-Fidelity Hyperparameter Optimization?The European Symposium on Artificial Neural Networks (ESANN), 2023

336

28 Jul 2023

The semantic landscape paradigm for neural networks

Shreyas Gokhale

425

18 Jul 2023

Delegated ClassificationNeural Information Processing Systems (NeurIPS), 2023

Eden Saig

Inbal Talgam-Cohen

Nir Rosenfeld

282

20 Jun 2023

Getting ViT in Shape: Scaling Laws for Compute-Optimal Model DesignNeural Information Processing Systems (NeurIPS), 2023

Ibrahim Alabdulmohsin

686

102

22 May 2023

Model-agnostic Measure of Generalization DifficultyInternational Conference on Machine Learning (ICML), 2023

Jaedong Hwang

Ila Fiete

444

01 May 2023

The Quantization Model of Neural ScalingNeural Information Processing Systems (NeurIPS), 2023

450

133

23 Mar 2023

Scaling Laws for Multilingual Neural Machine TranslationInternational Conference on Machine Learning (ICML), 2023

283

19 Feb 2023

Multiperiodic Processes: Ergodic Sources with a Sublinear Entropy

L. Debowski

MILM

433

17 Feb 2023

Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer

275

04 Dec 2022

Rethinking the transfer learning for FCN based polyp segmentation in colonoscopyIEEE Access (IEEE Access), 2022

Yan-mao Wen

Lei Zhang

Xiangli Meng

Xujiong Ye

232

04 Nov 2022

A Solvable Model of Neural Scaling Laws

A. Maloney

Daniel A. Roberts

J. Sully

378

30 Oct 2022

Revisiting Neural Scaling Laws in Language and VisionNeural Information Processing Systems (NeurIPS), 2022

Ibrahim Alabdulmohsin

Behnam Neyshabur

Xiaohua Zhai

645

157

13 Sep 2022

How Much More Data Do I Need? Estimating Requirements for Downstream TasksComputer Vision and Pattern Recognition (CVPR), 2022

Sanja Fidler

233

04 Jul 2022

Unified Scaling Laws for Routed Language ModelsInternational Conference on Machine Learning (ICML), 2022

...

442

261

02 Feb 2022

Error Scaling Laws for Kernel Classification under Source and Capacity Conditions

Hugo Cui

Bruno Loureiro

Florent Krzakala

Lenka Zdeborová

368

29 Jan 2022

Scaling Law for Recommendation Models: Towards General-purpose User RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2021

507

15 Nov 2021

Turing-Universal Learners with Optimal Scaling Laws

Preetum Nakkiran

225

09 Nov 2021