ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.24320
  4. Cited By
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

28 October 2025
Zhiheng Xi
Jixuan Huang
Xin Guo
Boyang Hong
Dingwen Yang
Xiaoran Fan
S. Li
Zehui Chen
Junjie Ye
Siyu Yuan
Zhengyin Du
Xuesong Yao
Yufei Xu
Jiecao Chen
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
    OffRLLRM
ArXiv (abs)PDFHTMLHuggingFace (18 upvotes)Github

Papers citing "Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning"

0 / 0 papers shown
Title

No papers found