Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.14363
Cited By
Improving RL Exploration for LLM Reasoning through Retrospective Replay
19 April 2025
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving RL Exploration for LLM Reasoning through Retrospective Replay"
Title
No papers