ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.21339
  4. Cited By
Multi-turn Training with Basic Human Feedback Helps Little on LLM Reasoning
v1v2 (latest)

Multi-turn Training with Basic Human Feedback Helps Little on LLM Reasoning

24 October 2025
Qiang Liu
Wuganjing Song
Zhenzhou Lin
Feifan Chen
Qiaolong Cai
Chen Li
Yongduo Sui
    OffRLLRM
ArXiv (abs)PDFHTML

Papers citing "Multi-turn Training with Basic Human Feedback Helps Little on LLM Reasoning"

0 / 0 papers shown

No papers found