Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2601.22607
Cited By
v1
v2 (latest)
From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents
30 January 2026
Jiaxuan Gao
Jiaao Chen
Chuyi He
Wei-Chen Wang
Shusheng Xu
Hanrui Wang
Di Jin
Yi Wu
OffRL
SyDa
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3489★)
Papers citing
"From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents"
0 / 0 papers shown
No papers found
Page 1 of 0