ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.21551
  4. Cited By
Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test
v1v2v3 (latest)

Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

26 June 2025
Ziyue Li
Chenrui Fan
Tianyi Zhou
ArXiv (abs)PDFHTMLHuggingFace (27 upvotes)

Papers citing "Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test"

1 / 1 papers shown
Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models
Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models
Guinan Su
Yanwu Yang
Li Shen
Lu Yin
Shiwei Liu
Jonas Geiping
MoEKELM
180
2
0
16 Oct 2025
1