Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.10802
Cited By
A Neural Scaling Law from the Dimension of the Data Manifold
22 April 2020
Utkarsh Sharma
Jared Kaplan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Neural Scaling Law from the Dimension of the Data Manifold"
42 / 42 papers shown
Optimal Look-back Horizon for Time Series Forecasting in Federated Learning
Dahao Tang
Nan Yang
Yanli Li
Zhiyu Zhu
Zhibo Jin
Dong Yuan
AI4TS
FedML
493
0
0
16 Nov 2025
Improved Scaling Laws in Linear Regression via Data Reuse
Licong Lin
Jingfeng Wu
Peter Bartlett
201
0
0
10 Jun 2025
From Text to Time? Rethinking the Effectiveness of the Large Language Model for Time Series Forecasting
Xinyu Zhang
Shanshan Feng
Xutao Li
AI4TS
221
2
0
09 Apr 2025
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
International Conference on Learning Representations (ICLR), 2025
Kairong Luo
Haodong Wen
Shengding Hu
Zhenbo Sun
Zhiyuan Liu
Maosong Sun
Kaifeng Lyu
Wenguang Chen
CLL
289
13
0
17 Mar 2025
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
Yifang Chen
Xuyang Guo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
341
4
0
03 Mar 2025
How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines
Ayan Sengupta
Ayan Sengupta
Tanmoy Chakraborty
491
4
0
17 Feb 2025
Explaining Context Length Scaling and Bounds for Language Models
Jingzhe Shi
Qinwei Ma
Hongyi Liu
Hang Zhao
Jeng-Neng Hwang
Lei Li
LRM
703
10
0
03 Feb 2025
Physics of Skill Learning
Ziming Liu
Yizhou Liu
Eric J. Michaud
Jeff Gore
Max Tegmark
375
2
0
21 Jan 2025
KAT to KANs: A Review of Kolmogorov-Arnold Networks and the Neural Leap Forward
Divesh Basina
Joseph Raj Vishal
Aarya Choudhary
Bharatesh Chakravarthi
294
1
0
15 Nov 2024
An Empirical Study of Scaling Laws for Transfer
Matthew Barnett
167
7
0
30 Aug 2024
Scaling Laws in Linear Regression: Compute, Parameters, and Data
Licong Lin
Jingfeng Wu
Sham Kakade
Peter L. Bartlett
Jason D. Lee
LRM
480
34
0
12 Jun 2024
Hardness of Learning Neural Networks under the Manifold Hypothesis
B. Kiani
Jason Wang
Melanie Weber
263
15
0
03 Jun 2024
Survival of the Fittest Representation: A Case Study with Modular Addition
Xiaoman Delores Ding
Zifan Carl Guo
Eric J. Michaud
Ziming Liu
Max Tegmark
380
5
0
27 May 2024
Scaling Law for Time Series Forecasting
Jingzhe Shi
Qinwei Ma
Huan Ma
Lei Li
AI4TS
329
21
0
24 May 2024
KAN: Kolmogorov-Arnold Networks
Ziming Liu
Yixuan Wang
Sachin Vaidya
Fabian Ruehle
James Halverson
Marin Soljacic
Thomas Y. Hou
Max Tegmark
995
1,319
0
30 Apr 2024
Language models scale reliably with over-training and on downstream tasks
International Conference on Learning Representations (ICLR), 2024
S. Gadre
Georgios Smyrnis
Vaishaal Shankar
Suchin Gururangan
Mitchell Wortsman
...
Y. Carmon
Achal Dave
Reinhard Heckel
Niklas Muennighoff
Ludwig Schmidt
ALM
ELM
LRM
353
77
0
13 Mar 2024
A Resource Model For Neural Scaling Law
Jinyeop Song
Ziming Liu
Max Tegmark
Jeff Gore
367
7
0
07 Feb 2024
Scaling Laws for Downstream Task Performance of Large Language Models
International Conference on Learning Representations (ICLR), 2024
Berivan Isik
Natalia Ponomareva
Hussein Hazimeh
Dimitris Paparas
Sergei Vassilvitskii
Sanmi Koyejo
325
49
0
06 Feb 2024
Scaling Laws in Jet Classification
SciPost Physics Core (SPC), 2023
Joshua D. Batson
Yonatan Kahn
191
4
0
04 Dec 2023
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
International Conference on Machine Learning (ICML), 2023
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
339
14
0
21 Nov 2023
A Neural Scaling Law from Lottery Ticket Ensembling
Ziming Liu
Max Tegmark
253
4
0
03 Oct 2023
Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition
Neural Information Processing Systems (NeurIPS), 2023
Meena Jagadeesan
Michael I. Jordan
Jacob Steinhardt
Nika Haghtalab
303
16
0
26 Jun 2023
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
International Conference on Machine Learning (ICML), 2023
Stella Biderman
Hailey Schoelkopf
Quentin G. Anthony
Herbie Bradley
Kyle O'Brien
...
USVSN Sai Prashanth
Edward Raff
Aviya Skowron
Lintang Sutawika
Oskar van der Wal
397
1,667
0
03 Apr 2023
Exploring the Representation Manifolds of Stable Diffusion Through the Lens of Intrinsic Dimension
Henry Kvinge
Davis Brown
Charles Godfrey
DiffM
161
6
0
16 Feb 2023
Precision Machine Learning
Eric J. Michaud
Ziming Liu
Max Tegmark
173
40
0
24 Oct 2022
Scaling Laws for Reward Model Overoptimization
International Conference on Machine Learning (ICML), 2022
Leo Gao
John Schulman
Jacob Hilton
ALM
391
782
0
19 Oct 2022
Limitations of the NTK for Understanding Generalization in Deep Learning
Nikhil Vyas
Yamini Bansal
Preetum Nakkiran
310
38
0
20 Jun 2022
Deconstructing Distributions: A Pointwise Framework of Learning
International Conference on Learning Representations (ICLR), 2022
Gal Kaplun
Nikhil Ghosh
Saurabh Garg
Boaz Barak
Preetum Nakkiran
OOD
217
24
0
20 Feb 2022
Data Scaling Laws in NMT: The Effect of Noise and Architecture
International Conference on Machine Learning (ICML), 2022
Yamini Bansal
Behrooz Ghorbani
Ankush Garg
Biao Zhang
M. Krikun
Colin Cherry
Behnam Neyshabur
Orhan Firat
248
61
0
04 Feb 2022
Nonlinear Initialization Methods for Low-Rank Neural Networks
Kiran Vodrahalli
Rakesh Shivanna
M. Sathiamoorthy
Sagar Jain
Ed H. Chi
243
4
0
02 Feb 2022
Tensor network to learn the wavefunction of data
Physical Review Research (Phys. Rev. Res.), 2021
A. Dymarsky
K. Pavlenko
195
7
0
15 Nov 2021
Practical Galaxy Morphology Tools from Deep Supervised Representation Learning
Mike Walmsley
Anna M. M. Scaife
Chris J. Lintott
Michelle Lochner
Yu Zhu
...
Xibo Ma
Sandor Kruk
Zhen Lei
G. Guo
B. Simmons
180
36
0
25 Oct 2021
A Scaling Law for Synthetic-to-Real Transfer: How Much Is Your Pre-training Effective?
Hiroaki Mikami
Kenji Fukumizu
Shogo Murai
Shuji Suzuki
Yuta Kikuchi
Taiji Suzuki
S. Maeda
Kunihiko Miyoshi
351
14
0
25 Aug 2021
Topological Obstructions to Autoencoding
Journal of High Energy Physics (JHEP), 2021
Joshua D. Batson
C. G. Haaf
Yonatan Kahn
Daniel A. Roberts
AI4CE
192
41
0
16 Feb 2021
Explaining Neural Scaling Laws
Proceedings of the National Academy of Sciences of the United States of America (PNAS), 2021
Yasaman Bahri
Ethan Dyer
Jared Kaplan
Jaehoon Lee
Utkarsh Sharma
351
381
0
12 Feb 2021
Learning Curve Theory
Marcus Hutter
380
79
0
08 Feb 2021
Scaling Laws for Transfer
Danny Hernandez
Jared Kaplan
T. Henighan
Sam McCandlish
502
285
0
02 Feb 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Journal of Artificial Intelligence Research (JAIR), 2020
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLL
OffRL
560
381
0
25 Dec 2020
Scaling Laws for Autoregressive Generative Modeling
T. Henighan
Jared Kaplan
Mor Katz
Mark Chen
Christopher Hesse
...
Nick Ryder
Daniel M. Ziegler
John Schulman
Dario Amodei
Sam McCandlish
476
560
0
28 Oct 2020
Distributional Generalization: A New Kind of Generalization
Preetum Nakkiran
Yamini Bansal
OOD
253
47
0
17 Sep 2020
Partial local entropy and anisotropy in deep weight spaces
Physical Review E (PRE), 2020
Daniele Musso
259
3
0
17 Jul 2020
The Depth-to-Width Interplay in Self-Attention
Yoav Levine
Noam Wies
Or Sharir
Hofit Bata
Amnon Shashua
380
52
0
22 Jun 2020
1
Page 1 of 1