Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.21339
Cited By
v1
v2 (latest)
Multi-turn Training with Basic Human Feedback Helps Little on LLM Reasoning
24 October 2025
Qiang Liu
Wuganjing Song
Zhenzhou Lin
Feifan Chen
Qiaolong Cai
Chen Li
Yongduo Sui
OffRL
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multi-turn Training with Basic Human Feedback Helps Little on LLM Reasoning"
0 / 0 papers shown
No papers found