Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.22601
Cited By
v1
v2 (latest)
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
26 September 2025
Yulei Qin
Xiaoyu Tan
Zhengbao He
Gang Li
Haojia Lin
Zongyi Li
Zihan Xu
Yuchen Shi
Siqi Cai
Renting Rui
Shaofei Cai
Yuzheng Cai
Xuan Zhang
Sheng Ye
Ke Li
Xing Sun
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (9 upvotes)
Github (7★)
Papers citing
"Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning"
0 / 0 papers shown
Title
No papers found