Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2501.13918
Cited By
v1
v2 (latest)
Improving Video Generation with Human Feedback
23 January 2025
Jie Liu
Gongye Liu
Jiajun Liang
Ziyang Yuan
Xiaokun Liu
Mingwu Zheng
Xiele Wu
Qiulin Wang
Wenyu Qin
Menghan Xia
Xintao Wang
Xiaohong Liu
Fei Yang
Pengfei Wan
Di Zhang
Kun Gai
Yujiu Yang
VGen
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (51 upvotes)
Papers citing
"Improving Video Generation with Human Feedback"
50 / 131 papers shown
Title
Video Generation Models Are Good Latent Reward Models
Xiaoyue Mi
W. Yu
Jiesong Lian
Shibo Jie
Ruizhe Zhong
...
Z. Zhou
Zhiyong Xu
Yuan Zhou
Qinglin Lu
Fan Tang
EGVM
VGen
182
0
0
24 Dec 2025
Reinforcement Learning for Large Model: A Survey
Weijia Wu
Chen Gao
Joya Chen
Kevin Lin
Qingwei Meng
Yiming Zhang
Yuke Qiu
Hong Zhou
Mike Zheng Shou
273
2
0
24 Dec 2025
Rethinking Prompt Design for Inference-time Scaling in Text-to-Visual Generation
Subin Kim
Sangwoo Mo
Mamshad Nayeem Rizve
Yiran Xu
Difan Liu
Jinwoo Shin
Tobias Hinz
LRM
112
0
0
03 Dec 2025
YingVideo-MV: Music-Driven Multi-Stage Video Generation
Jiahui Chen
Weida Wang
Runhua Shi
Huan Yang
Chaofan Ding
Zihao Chen
DiffM
VGen
149
0
0
02 Dec 2025
Taming Camera-Controlled Video Generation with Verifiable Geometry Reward
Zhaoqing Wang
Xiaobo Xia
Zhuolin Bie
Jinlin Liu
Dongdong Yu
Jia-Wang Bian
Changhu Wang
EGVM
VGen
129
0
0
02 Dec 2025
RULER-Bench: Probing Rule-based Reasoning Abilities of Next-level Video Generation Models for Vision Foundation Intelligence
Xuming He
Zehao Fan
Hengjia Li
Fan Zhuo
Hankun Xu
Senlin Cheng
Di Weng
Haifeng Liu
Can Ye
Boxi Wu
VGen
ELM
184
0
0
02 Dec 2025
GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment
Haoyang He
Jay Patrikar
Dong-Ki Kim
Max Smith
Daniel McGann
Ali-Akbar Agha-Mohammadi
Shayegan Omidshafiei
Sebastian Scherer
VGen
105
0
0
01 Dec 2025
McSc: Motion-Corrective Preference Alignment for Video Generation with Self-Critic Hierarchical Reasoning
Q. Yang
Yingjie Chen
Yuan Yao
Yifang Men
Huaizhuo Liu
Miaomiao Cui
EGVM
VGen
206
0
0
28 Nov 2025
TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models
Jiaming He
Guanyu Hou
Hongwei Li
Zhicong Huang
Kangjie Chen
Yi Yu
Wenbo Jiang
Guowen Xu
Tianwei Zhang
EGVM
VGen
175
0
0
26 Nov 2025
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
Tianyi Xiong
Yi Ge
Ming Li
Zuolong Zhang
Pranav Kulkarni
...
Yanshuo Chen
X. Wang
Renrui Zhang
Wenhu Chen
Heng Huang
169
0
0
26 Nov 2025
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation
Weijia Mao
Hao Chen
Zhenheng Yang
Mike Zheng Shou
EGVM
244
0
0
25 Nov 2025
Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization
Tahira Kazimi
Connor Dunlop
Pinar Yanardag
VGen
85
0
0
25 Nov 2025
MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models
Chieh-Yun Chen
Zhonghao Wang
Qi-An Chen
Zhifan Ye
Min Shi
...
Wei-An Lin
Yiru Shen
Ajinkya Kale
Irfan Essa
Humphrey Shi
112
0
0
25 Nov 2025
Growing with the Generator: Self-paced GRPO for Video Generation
Rui Li
Yuanzhi Liang
Ziqi Ni
H. Huang
Chi Zhang
Xuelong Li
EGVM
VGen
104
0
0
24 Nov 2025
Beyond Reward Margin: Rethinking and Resolving Likelihood Displacement in Diffusion Models via Video Generation
Ruojun Xu
Yu Kai
Xuhua Ren
Jiaxiang Cheng
Bing Ma
Tianxiang Zheng
Qinhlin Lu
EGVM
136
0
0
24 Nov 2025
Seeing What Matters: Visual Preference Policy Optimization for Visual Generation
Ziqi Ni
Yuanzhi Liang
Rui Li
Yi Zhou
H. Huang
Chi Zhang
Xuelong Li
40
0
0
24 Nov 2025
Learning What to Trust: Bayesian Prior-Guided Optimization for Visual Generation
Ruiying Liu
Yuanzhi Liang
Haibin Huang
Tianshu Yu
Chi Zhang
73
0
0
24 Nov 2025
Simulating the Visual World with Artificial Intelligence: A Roadmap
Jingtong Yue
Z. Huang
Z. Chen
Xintao Wang
Pengfei Wan
Ziwei Liu
VGen
LM&Ro
374
0
0
11 Nov 2025
AlignSurvey: A Comprehensive Benchmark for Human Preferences Alignment in Social Surveys
Chenxi Lin
Weikang Yuan
Zhuoren Jiang
Biao Huang
Ruitao Zhang
Jianan Ge
Yueqian Xu
Jianxing Yu
ALM
541
0
0
11 Nov 2025
Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions
Eyal Gutflaish
Eliran Kachlon
Hezi Zisman
Tal Hacham
Nimrod Sarid
...
Saar Huberman
Gal Davidi
Guy Bukchin
Kfir Goldberg
Ron Mokady
DiffM
VLM
213
1
0
10 Nov 2025
PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization
Zehui Feng
Tian Qiu
Tong Wu
Junxuan Li
Huayuan Xu
Ting Han
120
0
0
07 Nov 2025
PhysCorr: Dual-Reward DPO for Physics-Constrained Text-to-Video Generation with Automated Preference Selection
Peiyao Wang
Weining Wang
Qi Li
EGVM
VGen
375
1
0
06 Nov 2025
Reg-DPO: SFT-Regularized Direct Preference Optimization with GT-Pair for Improving Video Generation
Jie Du
Xinyu Gong
Qingshan Tan
W. Li
Yangming Cheng
Weitao Wang
Chenlu Zhan
Suhui Wu
H. Zhang
J. Zhang
VGen
272
0
0
03 Nov 2025
World Simulation with Video Foundation Models for Physical AI
Nvidia
A. M. Ali
Junjie Bai
Maciej Bala
Yogesh Balaji
...
Jing Zhang
Qinsheng Zhang
Kaiwen Zheng
Andrew Zhu
Yuke Zhu
VGen
PINN
407
15
0
28 Oct 2025
Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences
Zhuoran Jin
Hongbang Yuan
Kejian Zhu
Jiachun Li
Pengfei Cao
Yubo Chen
Kang Liu
Jun Zhao
113
0
0
27 Oct 2025
LongCat-Video Technical Report
M-A-P Team
Xunliang Cai
Qilong Huang
Zhuoliang Kang
Hongyu Li
...
Liya Ma
Siyu Ren
Xiaoming Wei
Rixu Xie
Tong Zhang
VGen
VLM
139
6
0
25 Oct 2025
GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping
Jing Wang
Jiajun Liang
Jie Liu
Henglin Liu
Gongye Liu
...
Zhenyu Xie
Xintao Wang
Meng Wang
Pengfei Wan
Xiaodan Liang
132
1
0
25 Oct 2025
Epipolar Geometry Improves Video Generation Models
Orest Kupyn
Fabian Manhardt
F. Tombari
Christian Rupprecht
VGen
214
0
0
24 Oct 2025
ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints
Meiqi Wu
Jiashu Zhu
Xiaokun Feng
C. L. Philip Chen
Chen Zhu
Bingze Song
Fangyuan Mao
Jiahong Wu
Xiangxiang Chu
Kaiqi Huang
VGen
EGVM
VLM
342
0
0
16 Oct 2025
RealDPO: Real or Not Real, that is the Preference
Guo Cheng
Danni Yang
Ziqi Huang
Jianlou Si
Chenyang Si
Ziwei Liu
VGen
298
0
0
16 Oct 2025
Identity-Preserving Image-to-Video Generation via Reward-Guided Optimization
Liao Shen
Wentao Jiang
Yiran Zhu
Tiezheng Ge
Z. Cao
Bo Zheng
Bo Zheng
VGen
292
4
0
16 Oct 2025
Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning
Xiangyu Meng
Zixian Zhang
Zhenghao Zhang
Junchao Liao
Long Qin
Weizhi Wang
VGen
171
2
0
16 Oct 2025
PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning
S. Ji
Xi Chen
Xin Tao
Pengfei Wan
Hengshuang Zhao
VGen
PINN
206
3
0
15 Oct 2025
Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback
Xingpei Ma
Shenneng Huang
Jiaran Cai
Yuansheng Guan
Shen Zheng
HanFeng Zhao
Qiang Zhang
Shunsi Zhang
VGen
149
3
0
14 Oct 2025
VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning
Q. Wang
Jie Liu
Jiajun Liang
Yilei Jiang
Yuanxing Zhang
...
Y. Zheng
Xintao Wang
Pengfei Wan
Xiangyu Yue
Jiaheng Liu
OffRL
VGen
LRM
333
1
0
12 Oct 2025
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation
Justin Cui
Jie Wu
Ming Li
Tao Yang
Xiaojie Li
Rui Wang
Andrew Bai
Yuanhao Ban
Cho-Jui Hsieh
DiffM
VGen
201
24
0
02 Oct 2025
Aligning Video Models with Human Social Judgments via Behavior-Guided Fine-Tuning
Kathy Garcia
Leyla Isik
96
0
0
01 Oct 2025
Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation
Agneet Chatterjee
Rahim Entezari
Maksym Zhuravinskyi
Maksim Lapin
Reshinth Adithyan
Amit Raj
Chitta Baral
Yezhou Yang
Varun Jampani
DiffM
EGVM
VGen
133
0
0
30 Sep 2025
EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing
Keming Wu
Sicong Jiang
Max Ku
Ping Nie
Minghao Liu
Wenhu Chen
116
9
0
30 Sep 2025
Reinforcement Learning with Inverse Rewards for World Model Post-training
Yang Ye
Tianyu He
Shuo Yang
Jiang Bian
VGen
135
1
0
28 Sep 2025
Follow-Your-Preference: Towards Preference-Aligned Image Inpainting
Yutao Shen
Junkun Yuan
Toru Aonishi
Hideki Nakayama
Yue Ma
EGVM
160
3
0
27 Sep 2025
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs
Xingyu Fu
Siyi Liu
Yinuo Xu
Pan Lu
Guangqiuse Hu
...
Chung Un Lee
Yejin Choi
James Zou
Dan Roth
Chris Callison-Burch
117
0
0
26 Sep 2025
Soft-Di[M]O: Improving One-Step Discrete Image Generation with Soft Embeddings
Yuanzhi Zhu
Xi Wang
Stéphane Lathuilière
Vicky Kalogeiton
128
2
0
26 Sep 2025
VideoScore2: Think before You Score in Generative Video Evaluation
Xuan He
Dongfu Jiang
Ping Nie
Minghao Liu
Z. L. Jiang
...
Qunshu Lin
Yuanxing Zhang
Ge Zhang
Wenhao Huang
Wenhu Chen
EGVM
VGen
LRM
1.2K
4
0
26 Sep 2025
MVQA-68K: A Multi-dimensional and Causally-annotated Dataset with Quality Interpretability for Video Assessment
Yanyun Pu
Kehan Li
Zeyi Huang
Zhijie Zhong
Kaixiang Yang
VGen
64
0
0
15 Sep 2025
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
Jenna Kang
Maria Silva
Patsorn Sangkloy
Kenneth Chen
Niall Williams
Qi Sun
EGVM
VGen
136
0
0
10 Sep 2025
RewardDance: Reward Scaling in Visual Generation
Jie Wu
Yu Gao
Zilyu Ye
Ming Li
Liang Li
...
Zeyue Xue
Xiaoxia Hou
Wei Liu
Yan Zeng
Weilin Huang
EGVM
209
16
0
10 Sep 2025
BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models
Yuming Li
Y. Wang
Yuying Zhu
Zhongyu Zhao
Ming Lu
Qi She
Shanghang Zhang
272
10
0
07 Sep 2025
FantasyHSI: Video-Generation-Centric 4D Human Synthesis In Any Scene through A Graph-based Multi-Agent Framework
Lingzhou Mu
Qiang Wang
Fan Jiang
Mengchao Wang
Yaqi Fan
Mu Xu
Kai Zhang
VGen
144
0
0
01 Sep 2025
OneReward: Unified Mask-Guided Image Generation via Multi-Task Human Preference Learning
Yuan Gong
Xionghui Wang
Jie Wu
Shiyin Wang
Yitong Wang
Xinglong Wu
DiffM
OffRL
91
8
0
28 Aug 2025
1
2
3
Next