A Neural Scaling Law from the Dimension of the Data Manifold

22 April 2020

Utkarsh Sharma

Jared Kaplan

ArXiv (abs)PDF HTML

Papers citing "A Neural Scaling Law from the Dimension of the Data Manifold"

42 / 42 papers shown

Optimal Look-back Horizon for Time Series Forecasting in Federated Learning

493

16 Nov 2025

Improved Scaling Laws in Linear Regression via Data Reuse

Licong Lin

Jingfeng Wu

Peter Bartlett

201

10 Jun 2025

From Text to Time? Rethinking the Effectiveness of the Large Language Model for Time Series Forecasting

221

09 Apr 2025

A Multi-Power Law for Loss Curve Prediction Across Learning Rate SchedulesInternational Conference on Learning Representations (ICLR), 2025

289

17 Mar 2025

Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches

341

03 Mar 2025

How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines

Ayan Sengupta

Tanmoy Chakraborty

491

17 Feb 2025

Explaining Context Length Scaling and Bounds for Language Models

703

03 Feb 2025

Physics of Skill Learning

375

21 Jan 2025

KAT to KANs: A Review of Kolmogorov-Arnold Networks and the Neural Leap Forward

Divesh Basina

Joseph Raj Vishal

Aarya Choudhary

Bharatesh Chakravarthi

294

15 Nov 2024

An Empirical Study of Scaling Laws for Transfer

Matthew Barnett

167

30 Aug 2024

Scaling Laws in Linear Regression: Compute, Parameters, and Data

480

12 Jun 2024

Hardness of Learning Neural Networks under the Manifold Hypothesis

B. Kiani

Jason Wang

Melanie Weber

263

03 Jun 2024

Survival of the Fittest Representation: A Case Study with Modular Addition

380

27 May 2024

Scaling Law for Time Series Forecasting

329

24 May 2024

KAN: Kolmogorov-Arnold Networks

995

1,319

30 Apr 2024

Language models scale reliably with over-training and on downstream tasksInternational Conference on Learning Representations (ICLR), 2024

...

Niklas Muennighoff

353

13 Mar 2024

A Resource Model For Neural Scaling Law

367

07 Feb 2024

Scaling Laws for Downstream Task Performance of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

325

06 Feb 2024

Scaling Laws in Jet ClassificationSciPost Physics Core (SPC), 2023

Joshua D. Batson

Yonatan Kahn

191

04 Dec 2023

Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable TasksInternational Conference on Machine Learning (ICML), 2023

339

21 Nov 2023

A Neural Scaling Law from Lottery Ticket Ensembling

Ziming Liu

Max Tegmark

253

03 Oct 2023

Improved Bayes Risk Can Yield Reduced Social Welfare Under CompetitionNeural Information Processing Systems (NeurIPS), 2023

303

26 Jun 2023

Pythia: A Suite for Analyzing Large Language Models Across Training and ScalingInternational Conference on Machine Learning (ICML), 2023

...

397

1,667

03 Apr 2023

Exploring the Representation Manifolds of Stable Diffusion Through the Lens of Intrinsic Dimension

161

16 Feb 2023

Precision Machine Learning

Eric J. Michaud

Ziming Liu

Max Tegmark

173

24 Oct 2022

Scaling Laws for Reward Model OveroptimizationInternational Conference on Machine Learning (ICML), 2022

391

782

19 Oct 2022

Limitations of the NTK for Understanding Generalization in Deep Learning

Nikhil Vyas

Yamini Bansal

Preetum Nakkiran

310

20 Jun 2022

Deconstructing Distributions: A Pointwise Framework of LearningInternational Conference on Learning Representations (ICLR), 2022

217

20 Feb 2022

Data Scaling Laws in NMT: The Effect of Noise and ArchitectureInternational Conference on Machine Learning (ICML), 2022

Colin Cherry

248

04 Feb 2022

Nonlinear Initialization Methods for Low-Rank Neural Networks

Kiran Vodrahalli

243

02 Feb 2022

Tensor network to learn the wavefunction of dataPhysical Review Research (Phys. Rev. Res.), 2021

A. Dymarsky

K. Pavlenko

195

15 Nov 2021

Practical Galaxy Morphology Tools from Deep Supervised Representation Learning

Michelle Lochner

...

Sandor Kruk

180

25 Oct 2021

A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your Pre-training Effective?

351

25 Aug 2021

Topological Obstructions to AutoencodingJournal of High Energy Physics (JHEP), 2021

192

16 Feb 2021

Explaining Neural Scaling LawsProceedings of the National Academy of Sciences of the United States of America (PNAS), 2021

351

381

12 Feb 2021

Learning Curve Theory

Marcus Hutter

380

08 Feb 2021

Scaling Laws for Transfer

502

285

02 Feb 2021

Towards Continual Reinforcement Learning: A Review and PerspectivesJournal of Artificial Intelligence Research (JAIR), 2020

560

381

25 Dec 2020

Scaling Laws for Autoregressive Generative Modeling

...

476

560

28 Oct 2020

Distributional Generalization: A New Kind of Generalization

Preetum Nakkiran

Yamini Bansal

OOD

253

17 Sep 2020

Partial local entropy and anisotropy in deep weight spacesPhysical Review E (PRE), 2020

Daniele Musso

259

17 Jul 2020

The Depth-to-Width Interplay in Self-Attention

380

22 Jun 2020