ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.04151
  4. Cited By
STEP: Staged Parameter-Efficient Pre-training for Large Language Models

STEP: Staged Parameter-Efficient Pre-training for Large Language Models

5 April 2025
Kazuki Yano
Takumi Ito
Jun Suzuki
    LRM
ArXivPDFHTML

Papers citing "STEP: Staged Parameter-Efficient Pre-training for Large Language Models"

1 / 1 papers shown
Title
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
3,054
0
23 Jan 2020
1