ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.10802
  4. Cited By
A Neural Scaling Law from the Dimension of the Data Manifold

A Neural Scaling Law from the Dimension of the Data Manifold

22 April 2020
Utkarsh Sharma
Jared Kaplan
ArXiv (abs)PDFHTML

Papers citing "A Neural Scaling Law from the Dimension of the Data Manifold"

42 / 42 papers shown
Optimal Look-back Horizon for Time Series Forecasting in Federated Learning
Optimal Look-back Horizon for Time Series Forecasting in Federated Learning
Dahao Tang
Nan Yang
Yanli Li
Zhiyu Zhu
Zhibo Jin
Dong Yuan
AI4TSFedML
493
0
0
16 Nov 2025
Improved Scaling Laws in Linear Regression via Data Reuse
Licong Lin
Jingfeng Wu
Peter Bartlett
201
0
0
10 Jun 2025
From Text to Time? Rethinking the Effectiveness of the Large Language Model for Time Series Forecasting
From Text to Time? Rethinking the Effectiveness of the Large Language Model for Time Series Forecasting
Xinyu Zhang
Shanshan Feng
Xutao Li
AI4TS
221
2
0
09 Apr 2025
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
A Multi-Power Law for Loss Curve Prediction Across Learning Rate SchedulesInternational Conference on Learning Representations (ICLR), 2025
Kairong Luo
Haodong Wen
Shengding Hu
Zhenbo Sun
Zhiyuan Liu
Maosong Sun
Kaifeng Lyu
Wenguang Chen
CLL
289
13
0
17 Mar 2025
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
Yifang Chen
Xuyang Guo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
341
4
0
03 Mar 2025
How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines
How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines
Ayan Sengupta
Ayan Sengupta
Tanmoy Chakraborty
491
4
0
17 Feb 2025
Explaining Context Length Scaling and Bounds for Language Models
Explaining Context Length Scaling and Bounds for Language Models
Jingzhe Shi
Qinwei Ma
Hongyi Liu
Hang Zhao
Jeng-Neng Hwang
Lei Li
LRM
703
10
0
03 Feb 2025
Physics of Skill Learning
Physics of Skill Learning
Ziming Liu
Yizhou Liu
Eric J. Michaud
Jeff Gore
Max Tegmark
375
2
0
21 Jan 2025
KAT to KANs: A Review of Kolmogorov-Arnold Networks and the Neural Leap
  Forward
KAT to KANs: A Review of Kolmogorov-Arnold Networks and the Neural Leap Forward
Divesh Basina
Joseph Raj Vishal
Aarya Choudhary
Bharatesh Chakravarthi
294
1
0
15 Nov 2024
An Empirical Study of Scaling Laws for Transfer
An Empirical Study of Scaling Laws for Transfer
Matthew Barnett
167
7
0
30 Aug 2024
Scaling Laws in Linear Regression: Compute, Parameters, and Data
Scaling Laws in Linear Regression: Compute, Parameters, and Data
Licong Lin
Jingfeng Wu
Sham Kakade
Peter L. Bartlett
Jason D. Lee
LRM
480
34
0
12 Jun 2024
Hardness of Learning Neural Networks under the Manifold Hypothesis
Hardness of Learning Neural Networks under the Manifold Hypothesis
B. Kiani
Jason Wang
Melanie Weber
263
15
0
03 Jun 2024
Survival of the Fittest Representation: A Case Study with Modular
  Addition
Survival of the Fittest Representation: A Case Study with Modular Addition
Xiaoman Delores Ding
Zifan Carl Guo
Eric J. Michaud
Ziming Liu
Max Tegmark
380
5
0
27 May 2024
Scaling Law for Time Series Forecasting
Scaling Law for Time Series Forecasting
Jingzhe Shi
Qinwei Ma
Huan Ma
Lei Li
AI4TS
329
21
0
24 May 2024
KAN: Kolmogorov-Arnold Networks
KAN: Kolmogorov-Arnold Networks
Ziming Liu
Yixuan Wang
Sachin Vaidya
Fabian Ruehle
James Halverson
Marin Soljacic
Thomas Y. Hou
Max Tegmark
995
1,319
0
30 Apr 2024
Language models scale reliably with over-training and on downstream
  tasks
Language models scale reliably with over-training and on downstream tasksInternational Conference on Learning Representations (ICLR), 2024
S. Gadre
Georgios Smyrnis
Vaishaal Shankar
Suchin Gururangan
Mitchell Wortsman
...
Y. Carmon
Achal Dave
Reinhard Heckel
Niklas Muennighoff
Ludwig Schmidt
ALMELMLRM
353
77
0
13 Mar 2024
A Resource Model For Neural Scaling Law
A Resource Model For Neural Scaling Law
Jinyeop Song
Ziming Liu
Max Tegmark
Jeff Gore
367
7
0
07 Feb 2024
Scaling Laws for Downstream Task Performance of Large Language Models
Scaling Laws for Downstream Task Performance of Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Berivan Isik
Natalia Ponomareva
Hussein Hazimeh
Dimitris Paparas
Sergei Vassilvitskii
Sanmi Koyejo
325
49
0
06 Feb 2024
Scaling Laws in Jet Classification
Scaling Laws in Jet ClassificationSciPost Physics Core (SPC), 2023
Joshua D. Batson
Yonatan Kahn
191
4
0
04 Dec 2023
Compositional Capabilities of Autoregressive Transformers: A Study on
  Synthetic, Interpretable Tasks
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable TasksInternational Conference on Machine Learning (ICML), 2023
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
339
14
0
21 Nov 2023
A Neural Scaling Law from Lottery Ticket Ensembling
A Neural Scaling Law from Lottery Ticket Ensembling
Ziming Liu
Max Tegmark
253
4
0
03 Oct 2023
Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition
Improved Bayes Risk Can Yield Reduced Social Welfare Under CompetitionNeural Information Processing Systems (NeurIPS), 2023
Meena Jagadeesan
Michael I. Jordan
Jacob Steinhardt
Nika Haghtalab
303
16
0
26 Jun 2023
Pythia: A Suite for Analyzing Large Language Models Across Training and
  Scaling
Pythia: A Suite for Analyzing Large Language Models Across Training and ScalingInternational Conference on Machine Learning (ICML), 2023
Stella Biderman
Hailey Schoelkopf
Quentin G. Anthony
Herbie Bradley
Kyle O'Brien
...
USVSN Sai Prashanth
Edward Raff
Aviya Skowron
Lintang Sutawika
Oskar van der Wal
397
1,667
0
03 Apr 2023
Exploring the Representation Manifolds of Stable Diffusion Through the
  Lens of Intrinsic Dimension
Exploring the Representation Manifolds of Stable Diffusion Through the Lens of Intrinsic Dimension
Henry Kvinge
Davis Brown
Charles Godfrey
DiffM
161
6
0
16 Feb 2023
Precision Machine Learning
Precision Machine Learning
Eric J. Michaud
Ziming Liu
Max Tegmark
173
40
0
24 Oct 2022
Scaling Laws for Reward Model Overoptimization
Scaling Laws for Reward Model OveroptimizationInternational Conference on Machine Learning (ICML), 2022
Leo Gao
John Schulman
Jacob Hilton
ALM
391
782
0
19 Oct 2022
Limitations of the NTK for Understanding Generalization in Deep Learning
Limitations of the NTK for Understanding Generalization in Deep Learning
Nikhil Vyas
Yamini Bansal
Preetum Nakkiran
310
38
0
20 Jun 2022
Deconstructing Distributions: A Pointwise Framework of Learning
Deconstructing Distributions: A Pointwise Framework of LearningInternational Conference on Learning Representations (ICLR), 2022
Gal Kaplun
Nikhil Ghosh
Saurabh Garg
Boaz Barak
Preetum Nakkiran
OOD
217
24
0
20 Feb 2022
Data Scaling Laws in NMT: The Effect of Noise and Architecture
Data Scaling Laws in NMT: The Effect of Noise and ArchitectureInternational Conference on Machine Learning (ICML), 2022
Yamini Bansal
Behrooz Ghorbani
Ankush Garg
Biao Zhang
M. Krikun
Colin Cherry
Behnam Neyshabur
Orhan Firat
248
61
0
04 Feb 2022
Nonlinear Initialization Methods for Low-Rank Neural Networks
Nonlinear Initialization Methods for Low-Rank Neural Networks
Kiran Vodrahalli
Rakesh Shivanna
M. Sathiamoorthy
Sagar Jain
Ed H. Chi
243
4
0
02 Feb 2022
Tensor network to learn the wavefunction of data
Tensor network to learn the wavefunction of dataPhysical Review Research (Phys. Rev. Res.), 2021
A. Dymarsky
K. Pavlenko
195
7
0
15 Nov 2021
Practical Galaxy Morphology Tools from Deep Supervised Representation
  Learning
Practical Galaxy Morphology Tools from Deep Supervised Representation Learning
Mike Walmsley
Anna M. M. Scaife
Chris J. Lintott
Michelle Lochner
Yu Zhu
...
Xibo Ma
Sandor Kruk
Zhen Lei
G. Guo
B. Simmons
180
36
0
25 Oct 2021
A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your
  Pre-training Effective?
A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your Pre-training Effective?
Hiroaki Mikami
Kenji Fukumizu
Shogo Murai
Shuji Suzuki
Yuta Kikuchi
Taiji Suzuki
S. Maeda
Kunihiko Miyoshi
351
14
0
25 Aug 2021
Topological Obstructions to Autoencoding
Topological Obstructions to AutoencodingJournal of High Energy Physics (JHEP), 2021
Joshua D. Batson
C. G. Haaf
Yonatan Kahn
Daniel A. Roberts
AI4CE
192
41
0
16 Feb 2021
Explaining Neural Scaling Laws
Explaining Neural Scaling LawsProceedings of the National Academy of Sciences of the United States of America (PNAS), 2021
Yasaman Bahri
Ethan Dyer
Jared Kaplan
Jaehoon Lee
Utkarsh Sharma
351
381
0
12 Feb 2021
Learning Curve Theory
Learning Curve Theory
Marcus Hutter
380
79
0
08 Feb 2021
Scaling Laws for Transfer
Scaling Laws for Transfer
Danny Hernandez
Jared Kaplan
T. Henighan
Sam McCandlish
502
285
0
02 Feb 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and PerspectivesJournal of Artificial Intelligence Research (JAIR), 2020
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLLOffRL
560
381
0
25 Dec 2020
Scaling Laws for Autoregressive Generative Modeling
Scaling Laws for Autoregressive Generative Modeling
T. Henighan
Jared Kaplan
Mor Katz
Mark Chen
Christopher Hesse
...
Nick Ryder
Daniel M. Ziegler
John Schulman
Dario Amodei
Sam McCandlish
476
560
0
28 Oct 2020
Distributional Generalization: A New Kind of Generalization
Distributional Generalization: A New Kind of Generalization
Preetum Nakkiran
Yamini Bansal
OOD
253
47
0
17 Sep 2020
Partial local entropy and anisotropy in deep weight spaces
Partial local entropy and anisotropy in deep weight spacesPhysical Review E (PRE), 2020
Daniele Musso
259
3
0
17 Jul 2020
The Depth-to-Width Interplay in Self-Attention
The Depth-to-Width Interplay in Self-Attention
Yoav Levine
Noam Wies
Or Sharir
Hofit Bata
Amnon Shashua
380
52
0
22 Jun 2020
1
Page 1 of 1