ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.04376
  4. Cited By
Scaling laws for learning with real and surrogate data
v1v2 (latest)

Scaling laws for learning with real and surrogate data

6 February 2024
Ayush Jain
Andrea Montanari
Eren Sasoglu
ArXiv (abs)PDFHTML

Papers citing "Scaling laws for learning with real and surrogate data"

17 / 17 papers shown
Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures
Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures
Haohui Wang
Jingyuan Qi
Jianpeng Chen
Jun Wu
Lifu Huang
...
Balaji Veeramani
Edward Bowen
Alison Hu
Tyler Cody
Dawei Zhou
154
0
0
17 Nov 2025
Optimal Regularization for Performative Learning
Optimal Regularization for Performative Learning
Edwige Cyffers
Alireza Mirrokni
Marco Mondelli
105
0
0
14 Oct 2025
Beyond Real Data: Synthetic Data through the Lens of Regularization
Beyond Real Data: Synthetic Data through the Lens of Regularization
Amitis Shidani
Tyler Farghly
Yang Sun
Habib Ganjgahi
George Deligiannidis
219
0
0
09 Oct 2025
High-dimensional Analysis of Synthetic Data Selection
High-dimensional Analysis of Synthetic Data Selection
Parham Rezaei
Filip Kovačević
Francesco Locatello
Marco Mondelli
168
0
0
09 Oct 2025
Filtering with Confidence: When Data Augmentation Meets Conformal Prediction
Filtering with Confidence: When Data Augmentation Meets Conformal Prediction
Zixuan Wu
So Won Jeong
Yating Liu
Yeo Jin Jung
Claire Donnat
149
0
0
25 Sep 2025
When Models Don't Collapse: On the Consistency of Iterative MLE
When Models Don't Collapse: On the Consistency of Iterative MLE
Daniel Barzilai
Ohad Shamir
SyDa
186
3
0
25 May 2025
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
A Multi-Power Law for Loss Curve Prediction Across Learning Rate SchedulesInternational Conference on Learning Representations (ICLR), 2025
Kairong Luo
Haodong Wen
Shengding Hu
Zhenbo Sun
Zhiyuan Liu
Maosong Sun
Kaifeng Lyu
Wenguang Chen
CLL
289
13
0
17 Mar 2025
MixMin: Finding Data Mixtures via Convex Minimization
MixMin: Finding Data Mixtures via Convex Minimization
Anvith Thudi
Evianne Rovers
Yangjun Ruan
Tristan Thrush
Chris J. Maddison
314
1
0
14 Feb 2025
Rate of Model Collapse in Recursive Training
Rate of Model Collapse in Recursive TrainingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
A. Suresh
A. Thangaraj
Aditya Nanda Kishore Khandavally
SyDa
207
11
0
23 Dec 2024
Loss-to-Loss Prediction: Scaling Laws for All Datasets
Loss-to-Loss Prediction: Scaling Laws for All Datasets
David Brandfonbrener
Nikhil Anand
Nikhil Vyas
Eran Malach
Sham Kakade
289
12
0
19 Nov 2024
Universality of the $π^2/6$ Pathway in Avoiding Model Collapse
Universality of the π2/6π^2/6π2/6 Pathway in Avoiding Model Collapse
Apratim Dey
D. Donoho
300
14
0
30 Oct 2024
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling LawsInternational Conference on Learning Representations (ICLR), 2024
M. E. Ildiz
Halil Alperen Gozeten
Ege Onur Taga
Marco Mondelli
Samet Oymak
497
13
0
24 Oct 2024
Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World
Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World
Joshua Kazdan
Rylan Schaeffer
Apratim Dey
Matthias Gerstgrasser
Rafael Rafailov
D. Donoho
Sanmi Koyejo
612
36
0
22 Oct 2024
Analyzing Neural Scaling Laws in Two-Layer Networks with Power-Law Data
  Spectra
Analyzing Neural Scaling Laws in Two-Layer Networks with Power-Law Data SpectraInternational Conference on Learning Representations (ICLR), 2024
Roman Worschech
B. Rosenow
355
1
0
11 Oct 2024
Strong Model Collapse
Strong Model CollapseInternational Conference on Learning Representations (ICLR), 2024
Elvis Dohmatob
Yunzhen Feng
Arjun Subramonian
Julia Kempe
277
35
0
07 Oct 2024
Scaling Laws in Linear Regression: Compute, Parameters, and Data
Scaling Laws in Linear Regression: Compute, Parameters, and Data
Licong Lin
Jingfeng Wu
Sham Kakade
Peter L. Bartlett
Jason D. Lee
LRM
471
33
0
12 Jun 2024
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Jiasheng Ye
Peiju Liu
Tianxiang Sun
Yunhua Zhou
Jun Zhan
Xipeng Qiu
379
107
0
25 Mar 2024
1