Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2508.18588
Cited By

History Rhymes: Accelerating LLM Reinforcement Learning with RhymeRL

History Rhymes: Accelerating LLM Reinforcement Learning with RhymeRL

26 August 2025

ArXiv (abs)PDF HTML Github (15633★)

Papers citing "History Rhymes: Accelerating LLM Reinforcement Learning with RhymeRL"

9 / 9 papers shown

Fast LLM Post-training via Decoupled and Fastest-of-N Speculation

Fast LLM Post-training via Decoupled and Fastest-of-N Speculation

...

436

0

0

24 Dec 2025

Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding

Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding

...

Mohamed S. Abdelfattah

186

1

0

01 Dec 2025

CoPRIS: Efficient and Stable Reinforcement Learning via Concurrency-Controlled Partial Rollout with Importance Sampling

CoPRIS: Efficient and Stable Reinforcement Learning via Concurrency-Controlled Partial Rollout with Importance Sampling

81

0

0

05 Nov 2025

ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems

ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems

104

2

0

30 Oct 2025

LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs

LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs

Stefanie Jegelka

176

0

0

18 Oct 2025

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

...

98

1

0

13 Oct 2025

Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning

Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning

244

1

0

05 Oct 2025

Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?

Prosperity before Collapse: How Far Can Off-Policy RL Reach with Stale Data on LLMs?

154

5

0

01 Oct 2025

RollPacker: Mitigating Long-Tail Rollouts for Fast, Synchronous RL Post-Training

RollPacker: Mitigating Long-Tail Rollouts for Fast, Synchronous RL Post-Training

...

208

9

0

25 Sep 2025