ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.13503
  4. Cited By
VBench++: Comprehensive and Versatile Benchmark Suite for Video
  Generative Models

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

20 November 2024
Ziqi Huang
Fan Zhang
Xiaojie Xu
Yinan He
Jiashuo Yu
Z. Dong
Qianli Ma
Nattapol Chanpaisit
Chenyang Si
Yuming Jiang
Yaohui Wang
Xinyuan Chen
Yuxiao Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziqiang Liu
    VGen
ArXiv (abs)PDFHTMLHuggingFace (35 upvotes)

Papers citing "VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models"

33 / 83 papers shown
Towards Holistic Visual Quality Assessment of AI-Generated Videos: A LLM-Based Multi-Dimensional Evaluation Model
Towards Holistic Visual Quality Assessment of AI-Generated Videos: A LLM-Based Multi-Dimensional Evaluation Model
Zelu Qi
Ping Shi
C. Zhang
Shuqi Wang
F. Zhao
Da Pan
Zefeng Ying
EGVMVGen
377
1
0
05 Jun 2025
DualX-VSR: Dual Axial Spatial$\times$Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation
DualX-VSR: Dual Axial Spatial×\times×Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation
Shuo Cao
Yihao Liu
Xiaohui Li.Yuanting Gao.Yu Zhou
Yuanting Gao
Yu Zhou
Chao Dong
301
0
0
05 Jun 2025
Training-Free Efficient Video Generation via Dynamic Token Carving
Training-Free Efficient Video Generation via Dynamic Token Carving
Yuechen Zhang
Jinbo Xing
Bin Xia
Shaoteng Liu
Bohao Peng
Xin Tao
Pengfei Wan
Eric Lo
Jiaya Jia
DiffMVGen
438
17
0
22 May 2025
EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models
EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models
Hu Yue
Siyuan Huang
Yue Liao
Shengcong Chen
Pengfei Zhou
Liliang Chen
Maoqing Yao
Maoqing Yao
VGen
318
12
0
14 May 2025
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao Song
Jiahao Zhang
Jiale Zhao
VGen
1.1K
8
0
08 May 2025
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding
Zongxia Li
Xiyang Wu
Guangyao Shi
Yubin Qin
Hongyang Du
Tianyi Zhou
Wanrong Zhu
Dinesh Manocha
Jordan Lee Boyd-Graber
MLLM
651
0
0
02 May 2025
Controllable Weather Synthesis and Removal with Video Diffusion Models
Controllable Weather Synthesis and Removal with Video Diffusion Models
Chih-Hao Lin
Liang Luo
Ruofan Liang
Yuxuan Zhang
Sanja Fidler
Shenlong Wang
Zan Gojcic
DiffMVGen
354
4
0
01 May 2025
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation
Chenjie Cao
Jingkai Zhou
Shikai Li
Jingyun Liang
Chaohui Yu
Fan Wang
Xiangyang Xue
Yanwei Fu
VGenDiffM
434
27
0
21 Apr 2025
RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild
RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild
Jingkai Zhou
Yifan Wu
Shikai Li
Min Wei
Chao Fan
Weihua Chen
Wei Jiang
Fan Wang
VGen
268
12
0
21 Apr 2025
Frame Context Packing and Drift Prevention in Next-Frame-Prediction Video Diffusion Models
Frame Context Packing and Drift Prevention in Next-Frame-Prediction Video Diffusion Models
Lvmin Zhang
S. Cai
Muyang Li
Gordon Wetzstein
Maneesh Agrawala
DiffMVGen
551
43
0
17 Apr 2025
Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments
Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments
Chenyu Zhang
Daniil Cherniavskii
Antonios Tragoudaras
Antonios Vozikis
Thijmen Nijdam
Thijmen Nijdam
Mark Bodracska
Mark Bodracska
Andrii Zadaianchuk
E. Gavves
EGVMVGen
294
13
0
03 Apr 2025
SkyReels-A2: Compose Anything in Video Diffusion Transformers
SkyReels-A2: Compose Anything in Video Diffusion Transformers
Zhengcong Fei
Didong Li
Di Qiu
Jiadong Wang
Yikun Dou
...
Jinfeng Xu
Mingyuan Fan
Guibin Chen
Yang Li
Yahui Zhou
DiffMVGen
333
35
0
03 Apr 2025
WorldScore: A Unified Evaluation Benchmark for World Generation
WorldScore: A Unified Evaluation Benchmark for World Generation
Haoyi Duan
Hong-Xing Yu
Sirui Chen
L. Fei-Fei
Jiajun Wu
VGen
401
46
0
01 Apr 2025
VideoGen-Eval: Agent-based System for Video Generation Evaluation
VideoGen-Eval: Agent-based System for Video Generation Evaluation
Yuhang Yang
Ke Fan
Siyang Song
Hongxiang Li
Ailing Zeng
FeiLin Han
Wei-dong Zhai
Wen Liu
Yang Cao
Zheng-jun Zha
EGVMVGen
419
9
0
30 Mar 2025
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
Dian Zheng
Ziqi Huang
Hongbo Liu
Kai Zou
Yinan He
...
Jingwen He
Wei-Shi Zheng
Botian Shi
Yu Qiao
Ziwei Liu
EGVMVGen
339
95
0
27 Mar 2025
Multi-Object Sketch Animation by Scene Decomposition and Motion Planning
Multi-Object Sketch Animation by Scene Decomposition and Motion Planning
Jingyu Liu
Zijie Xin
Yuhan Fu
Ruixiang Zhao
Bangxiang Lan
Xirong Li
283
2
0
25 Mar 2025
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models
Weichen Fan
Amber Yijia Zheng
Raymond A. Yeh
Yu Qiao
387
19
0
24 Mar 2025
SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis
SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis
Hou In Ivan Tam
Hou In Derek Pun
Austin T. Wang
Angel X. Chang
Manolis Savva
444
5
0
18 Mar 2025
MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Few-Step Synthesis
MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Few-Step Synthesis
Shitong Shao
Hongwei Yi
Hanzhong Guo
Tian Ye
Daquan Zhou
Michael Lingelbach
Zhiqiang Xu
Bo Han
VGen
381
0
0
17 Mar 2025
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model
Haoyang Huang
Guoqing Ma
Nan Duan
Xing Chen
Changyi Wan
...
Xiangyu Zhang
Yi Xiu
Yibo Zhu
H. Shum
Daxin Jiang
VGen
237
16
0
14 Mar 2025
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models
Hao He
Ceyuan Yang
Shanchuan Lin
Yinghao Xu
Meng Wei
Liangke Gui
Qi Zhao
Gordon Wetzstein
Lu Jiang
Hongsheng Li
DiffMVGen
392
42
0
13 Mar 2025
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
Hyeonho Jeong
Suhyeon Lee
Jong Chul Ye
VGen
1.2K
11
0
12 Mar 2025
VACE: All-in-One Video Creation and Editing
Zeyinzi Jiang
Zhen Han
Chaojie Mao
Junxuan Zhang
Yulin Pan
Yu Liu
DiffMVGen
413
169
0
10 Mar 2025
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation
Runze Zhang
Guoguang Du
Xiaochuan Li
Qi Jia
Liang Jin
...
Zhenhua Guo
Yaqian Zhao
Xiaoli Gong
Rengang Li
Baoyu Fan
VGen
311
6
0
08 Mar 2025
An Egocentric Vision-Language Model based Portable Real-time Smart Assistant
Yuanmin Huang
Jilan Xu
Baoqi Pei
Yuping He
Guo Chen
...
Xinyuan Chen
Yaohui Wang
Yali Wang
Yu Qiao
Limin Wang
331
6
0
06 Mar 2025
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less ComputeComputer Vision and Pattern Recognition (CVPR), 2025
Sotiris Anagnostidis
Gregor Bachmann
Yeongmin Kim
Jonas Kohler
Markos Georgopoulos
A. Sanakoyeu
Yuming Du
Albert Pumarola
Ali K. Thabet
Edgar Schönfeld
401
5
0
27 Feb 2025
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Xinlong Chen
Yuanxing Zhang
Chongling Rao
Yushuo Guan
Qingbin Liu
Fuzheng Zhang
Chengru Song
Qiang Liu
Di Zhang
Tieniu Tan
358
14
0
18 Feb 2025
Phantom: Subject-consistent video generation via cross-modal alignment
Phantom: Subject-consistent video generation via cross-modal alignment
Lijie Liu
Tianxiang Ma
Bingchuan Li
Zhuowei Chen
Jiawei Liu
Qian He
Xinglong Wu
Qian He
Xinglong Wu
DiffMVGen
506
55
0
16 Feb 2025
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Yuanmin Huang
Jilan Xu
Baoqi Pei
Yuping He
Guo Chen
...
Kunpeng Li
C. Yuan
Yidan Wang
Yu Qiao
L. Wang
463
14
0
31 Dec 2024
Generative Inbetweening through Frame-wise Conditions-Driven Video
  Generation
Generative Inbetweening through Frame-wise Conditions-Driven Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Tianyi Zhu
Dongwei Ren
Qilong Wang
Xiaohe Wu
W. Zuo
VGen
292
8
0
16 Dec 2024
MotionStone: Decoupled Motion Intensity Modulation with Diffusion
  Transformer for Image-to-Video Generation
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Shuwei Shi
Biao Gong
Xi Chen
Dandan Zheng
Shuai Tan
...
Jingwen He
Kecheng Zheng
Jingdong Chen
Ming-Hsuan Yang
Yinqiang Zheng
VGenDiffM
258
13
0
08 Dec 2024
FrameBridge: Improving Image-to-Video Generation with Bridge Models
FrameBridge: Improving Image-to-Video Generation with Bridge Models
Yuji Wang
Zehua Chen
Xiaoyu Chen
Jun-Jie Zhu
Jianfei Chen
Jianfei Chen
DiffMVGen
1.0K
14
0
20 Oct 2024
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning
Aakanksha
Arash Ahmadian
Seraphina Goldfarb-Tarrant
Beyza Ermis
Marzieh Fadaee
Sara Hooker
MoMe
255
18
0
14 Oct 2024
Previous
12
Page 2 of 2