ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.04324
  4. Cited By
TempFlow-GRPO: When Timing Matters for GRPO in Flow Models
v1v2v3v4 (latest)

TempFlow-GRPO: When Timing Matters for GRPO in Flow Models

6 August 2025
Xiaoxuan He
Siming Fu
Yuke Zhao
W. Li
Zhiqiang Wang
Dacheng Yin
Fengyun Rao
Bo Zhang
    AI4CE
ArXiv (abs)PDFHTMLHuggingFace (9 upvotes)

Papers citing "TempFlow-GRPO: When Timing Matters for GRPO in Flow Models"

13 / 13 papers shown
Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards
Multi-GRPO: Multi-Group Advantage Estimation for Text-to-Image Generation with Tree-Based Trajectories and Multiple Rewards
Qiang Lyu
Z. Chen
C. Wang
Haolin Shi
Shibo Gao
...
Jianlou Si
Fei Ding
Jing Li
Chun Pong Lau
Weiqiang Wang
EGVM
121
1
0
30 Nov 2025
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning
Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning
Guanjie Chen
Shirui Huang
Kai Liu
J. Zhu
Xiaoye Qu
Peng Chen
Yu Cheng
Yifu Sun
188
1
0
25 Nov 2025
Growing with the Generator: Self-paced GRPO for Video Generation
Growing with the Generator: Self-paced GRPO for Video Generation
Rui Li
Yuanzhi Liang
Ziqi Ni
H. Huang
Chi Zhang
Xuelong Li
EGVMVGen
120
0
0
24 Nov 2025
Seeing What Matters: Visual Preference Policy Optimization for Visual Generation
Seeing What Matters: Visual Preference Policy Optimization for Visual Generation
Ziqi Ni
Yuanzhi Liang
Rui Li
Yi Zhou
H. Huang
Chi Zhang
Xuelong Li
115
0
0
24 Nov 2025
$π_\texttt{RL}$: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
πRLπ_\texttt{RL}πRL​: Online RL Fine-tuning for Flow-based Vision-Language-Action Models
Kang Chen
Zhihao Liu
T. Zhang
Zhen Guo
Si Xu
...
Guoliang Fan
T. Huang
Yu Wang
Yu Wang
Chao Yu
OffRLVLM
557
0
0
29 Oct 2025
GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping
GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping
Jing Wang
Jiajun Liang
Jie Liu
Henglin Liu
Gongye Liu
...
Zhenyu Xie
Xintao Wang
Meng Wang
Pengfei Wan
Xiaodan Liang
160
1
0
25 Oct 2025
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation
Yifu Luo
Penghui Du
Bo Li
Sinan Du
Tiantian Zhang
Yongzhe Chang
Kai Wu
Kun Gai
Xueqian Wang
151
4
0
24 Oct 2025
Smart-GRPO: Smartly Sampling Noise for Efficient RL of Flow-Matching Models
Smart-GRPO: Smartly Sampling Noise for Efficient RL of Flow-Matching Models
Benjamin Yu
Jackie Liu
Justin Cui
130
1
0
03 Oct 2025
Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models
Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models
Shuchen Xue
Chongjian Ge
Shilong Zhang
Yichen Li
Zhi-Ming Ma
139
3
0
29 Sep 2025
Enhancing Blind Face Restoration through Online Reinforcement Learning
Enhancing Blind Face Restoration through Online Reinforcement Learning
Bin Wu
Yahui Liu
Chi Zhang
Yao-Min Zhao
Wei Wang
CVBMOffRLCLLOnRL
429
0
0
27 Sep 2025
Dynamic-TreeRPO: Breaking the Independent Trajectory Bottleneck with Structured Sampling
Dynamic-TreeRPO: Breaking the Independent Trajectory Bottleneck with Structured Sampling
Xiaolong Fu
Lichen Ma
Zipeng Guo
Gaojing Zhou
Chongxiao Wang
...
Tan Lit Sin
Yu Shi
Zhen Chen
Junshi Huang
Jason Li
184
4
0
27 Sep 2025
BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models
BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models
Yuming Li
Y. Wang
Yuying Zhu
Zhongyu Zhao
Ming Lu
Qi She
Shanghang Zhang
277
14
0
07 Sep 2025
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
Y. Wang
Zhimin Li
Yuhang Zang
Yujie Zhou
Jiazi Bu
Chunyu Wang
Qinglin Lu
Cheng Jin
Jiaqi Wang
EGVM
148
30
0
28 Aug 2025
1