ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.23958
  4. Cited By
Reinforcement Learning with Inverse Rewards for World Model Post-training

Reinforcement Learning with Inverse Rewards for World Model Post-training

28 September 2025
Yang Ye
Tianyu He
Shuo Yang
Jiang Bian
    VGen
ArXiv (abs)PDFHTML

Papers citing "Reinforcement Learning with Inverse Rewards for World Model Post-training"

1 / 1 papers shown
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
Zongjian Li
Zheyuan Liu
Qihui Zhang
Bin Lin
Feize Wu
...
Wangbo Yu
Yuwei Niu
Shaodong Wang
Xinhua Cheng
Li Yuan
400
13
0
19 Oct 2025
1