ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.02835
  4. Cited By
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

5 May 2025
Yi-Fan Zhang
Xingyu Lu
X. Hu
Chaoyou Fu
Bin Wen
Tianke Zhang
Changyi Liu
Kaiyu Jiang
Kaibing Chen
Kaiyu Tang
Haojie Ding
J. Chen
Fan Yang
Z. Zhang
Tingting Gao
Liang Wang
    OffRL
    LRM
ArXivPDFHTML

Papers citing "R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning"

Title
No papers