ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.22601
  4. Cited By
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
v1v2 (latest)

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

26 September 2025
Yulei Qin
Xiaoyu Tan
Zhengbao He
Gang Li
Haojia Lin
Zongyi Li
Zihan Xu
Yuchen Shi
Siqi Cai
Renting Rui
Shaofei Cai
Yuzheng Cai
Xuan Zhang
Sheng Ye
Ke Li
Xing Sun
ArXiv (abs)PDFHTMLHuggingFace (9 upvotes)Github (7★)

Papers citing "Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning"

0 / 0 papers shown
Title

No papers found