ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
Motion Consistency Model: Accelerating Video Diffusion with Disentangled
  Motion-Appearance Distillation
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
Yuanhao Zhai
Kevin Lin
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Chung-Ching Lin
David Doermann
Junsong Yuan
Lijuan Wang
VGenDiffM
247
26
0
11 Jun 2024
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video
  Prediction
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Zhen Xing
Jingdong Sun
Zejia Weng
Zuxuan Wu
Yu-Gang Jiang
VGen
295
23
0
10 Jun 2024
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
GAIA: Rethinking Action Quality Assessment for AI-Generated VideosNeural Information Processing Systems (NeurIPS), 2024
Zijian Chen
Wei Sun
Yuan Tian
Jun Jia
Zicheng Zhang
Jiarui Wang
Ru Huang
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
EGVM
318
37
0
10 Jun 2024
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled
  Object Motion
Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion
Ge Ya Luo
Zhi Hao Luo
Anthony Gosselin
Alexia Jolicoeur-Martineau
Christopher Pal
VGenDiffM
167
1
0
09 Jun 2024
CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion
CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion
Xingrui Wang
Xin Li
Zhibo Chen
DiffM
174
4
0
07 Jun 2024
ACE Metric: Advection and Convection Evaluation for Accurate Weather
  Forecasting
ACE Metric: Advection and Convection Evaluation for Accurate Weather Forecasting
Doyi Kim
Minseok Seo
Yeji Choi
171
1
0
07 Jun 2024
GenAI Arena: An Open Evaluation Platform for Generative Models
GenAI Arena: An Open Evaluation Platform for Generative ModelsNeural Information Processing Systems (NeurIPS), 2024
Dongfu Jiang
Max Ku
Tianle Li
Yuansheng Ni
Shizhuo Sun
Rongqi Fan
Wenhu Chen
EGVM
451
45
0
06 Jun 2024
VideoTetris: Towards Compositional Text-to-Video Generation
VideoTetris: Towards Compositional Text-to-Video GenerationNeural Information Processing Systems (NeurIPS), 2024
Ye Tian
Ling Yang
Haotian Yang
Yuan Gao
Yufan Deng
...
Zhaochen Yu
Xin Tao
Pengfei Wan
Di Zhang
Bin Cui
DiffMVGen
272
43
0
06 Jun 2024
VideoPhy: Evaluating Physical Commonsense for Video Generation
VideoPhy: Evaluating Physical Commonsense for Video Generation
Hritik Bansal
Zongyu Lin
Tianyi Xie
Zeshun Zong
Michal Yarom
Yonatan Bitton
Jian Ren
Zhaoxin Fan
Kai-Wei Chang
Aditya Grover
EGVMVGen
230
98
0
05 Jun 2024
Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling
Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling
Jingyun Xue
Haobo Wang
Q. Tian
Yi Ma
Andong Wang
...
Kaihao Zhang
H. Shum
Wen Liu
Mengyang Liu
Tong Lu
DiffM
386
4
0
05 Jun 2024
V-Express: Conditional Dropout for Progressive Training of Portrait
  Video Generation
V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation
Cong Wang
Kuan Tian
Jun Zhang
Yonghang Guan
Feng Luo
Fei Shen
Zhiwei Jiang
Qing Gu
Xiao Han
Wei Yang
250
81
0
04 Jun 2024
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
Dejia Xu
Weili Nie
Chao Liu
Sifei Liu
Jan Kautz
Zhangyang Wang
Arash Vahdat
DiffMVGen
268
108
0
04 Jun 2024
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait
  Animation
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
Yi Ma
Hongyu Liu
Haobo Wang
Heng Pan
Yingqing He
...
Ailing Zeng
Chengfei Cai
H. Shum
Wen Liu
Qifeng Chen
371
127
0
04 Jun 2024
Unleashing Generalization of End-to-End Autonomous Driving with
  Controllable Long Video Generation
Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Enhui Ma
Lijun Zhou
Tao Tang
Zhan Zhang
Dong Han
...
Fu Liu
Xianpeng Lang
Haiyang Sun
Di Lin
Kaicheng Yu
VGen
294
43
0
03 Jun 2024
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human
  Image Animation
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Xiang Wang
Shiwei Zhang
Changxin Gao
Jiayu Wang
Xiaoqiang Zhou
Yingya Zhang
Luxin Yan
Nong Sang
VGen
322
76
0
03 Jun 2024
SNED: Superposition Network Architecture Search for Efficient Video
  Diffusion Model
SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model
Zhengang Li
Yan Kang
Yuchen Liu
Difan Liu
Tobias Hinz
Feng Liu
Yanzhi Wang
DiffM
244
1
0
31 May 2024
MotionFollower: Editing Video Motion via Lightweight Score-Guided
  Diffusion
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
Shuyuan Tu
Jingdong Sun
Zihao Zhang
Sicheng Xie
Zhi-Qi Cheng
Chong Luo
Xintong Han
Zuxuan Wu
Yu-Gang Jiang
DiffMVGen
210
22
0
30 May 2024
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Sijie Zhao
Yong Zhang
Xiaodong Cun
Shaoshu Yang
Muyao Niu
Xiaoyu Li
Wenbo Hu
Ying Shan
DiffM
237
46
0
30 May 2024
VividPose: Advancing Stable Video Diffusion for Realistic Human Image
  Animation
VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
Qilin Wang
Zhengkai Jiang
Chengming Xu
Jiangning Zhang
Yabiao Wang
Xinyi Zhang
Yunkang Cao
Weijian Cao
Chengjie Wang
Yanwei Fu
VGen
205
27
0
28 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
456
209
0
27 May 2024
Controllable Longer Image Animation with Diffusion Models
Controllable Longer Image Animation with Diffusion Models
Qiang Wang
Minghua Liu
Junjun Hu
Fan Jiang
Mu Xu
VGen
250
2
0
27 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Haifeng Zhang
Mingsheng Long
VGen
293
85
0
24 May 2024
Diffusion for World Modeling: Visual Details Matter in Atari
Diffusion for World Modeling: Visual Details Matter in Atari
Eloi Alonso
Adam Jelley
Vincent Micheli
Anssi Kanervisto
Amos Storkey
Tim Pearce
Franccois Fleuret
326
144
0
20 May 2024
FIFO-Diffusion: Generating Infinite Videos from Text without Training
FIFO-Diffusion: Generating Infinite Videos from Text without TrainingNeural Information Processing Systems (NeurIPS), 2024
Jihwan Kim
Junoh Kang
Jinyoung Choi
Bohyung Han
DiffMVGen
438
76
0
19 May 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
276
38
0
17 May 2024
Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Dance Any Beat: Blending Beats with Visuals in Dance Video GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Xuanchen Wang
Heng Wang
Dongnan Liu
Weidong Cai
201
14
0
15 May 2024
The Lost Melody: Empirical Observations on Text-to-Video Generation From
  A Storytelling Perspective
The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective
Andrew Shin
Yusuke Mori
Kunitake Kaneko
VGenEGVM
195
3
0
13 May 2024
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video
  Motion Editing
Edit-Your-Motion: Space-Time Diffusion Decoupling Learning for Video Motion Editing
Yi Zuo
Lingling Li
Licheng Jiao
Fang Liu
Xu Liu
Wenping Ma
Shuyuan Yang
Yuwei Guo
VGenDiffM
201
4
0
07 May 2024
Video Diffusion Models: A Survey
Video Diffusion Models: A Survey
Andrew Melnik
Michal Ljubljanac
Cong Lu
Qi Yan
Weiming Ren
Helge J. Ritter
VGen
340
36
0
06 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGenLM&Ro
362
82
0
06 May 2024
Matten: Video Generation with Mamba-Attention
Matten: Video Generation with Mamba-Attention
Yu Gao
Jiancheng Huang
Xiaopeng Sun
Zequn Jie
Yujie Zhong
Lin Ma
401
30
0
05 May 2024
Unveiling Differences in Generative Models: A Scalable Differential Clustering Approach
Unveiling Differences in Generative Models: A Scalable Differential Clustering ApproachComputer Vision and Pattern Recognition (CVPR), 2024
Jingwei Zhang
Mohammad Jalali
Cheuk Ting Li
Farzan Farnia
292
5
0
04 May 2024
FlexiFilm: Long Video Generation with Flexible Conditions
FlexiFilm: Long Video Generation with Flexible Conditions
Yichen Ouyang
Jianhao Yuan
Hao Zhao
Gaoang Wang
Bo Zhao
DiffM
223
12
0
29 Apr 2024
TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion
  Models
TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models
Haomiao Ni
Bernhard Egger
Suhas Lohit
A. Cherian
Ye Wang
T. Koike-Akino
S. X. Huang
Tim K. Marks
DiffM
176
21
0
25 Apr 2024
TAVGBench: Benchmarking Text to Audible-Video Generation
TAVGBench: Benchmarking Text to Audible-Video Generation
Yuxin Mao
Xuyang Shen
Jing Zhang
Zhen Qin
Jinxing Zhou
Mochu Xiang
Yiran Zhong
Yuchao Dai
183
27
0
22 Apr 2024
Zero-shot High-fidelity and Pose-controllable Character Animation
Zero-shot High-fidelity and Pose-controllable Character Animation
Bingwen Zhu
Fanyi Wang
Tianyi Lu
Peng Liu
Jingwen Su
Yu Lei
Yanhao Zhang
Zuxuan Wu
Guo-Jun Qi
Yu-Gang Jiang
DiffMVGen
215
9
0
21 Apr 2024
Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text
  Consistency and Domain Distribution Gap
Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap
Bowen Qu
Xiaoyu Liang
Shangkun Sun
Wei-Nan Gao
EGVM
377
12
0
21 Apr 2024
PhysDreamer: Physics-Based Interaction with 3D Objects via Video
  Generation
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation
Tianyuan Zhang
Hong-Xing Yu
Rundi Wu
Brandon Yushan Feng
Changxi Zheng
Noah Snavely
Jiajun Wu
William T. Freeman
AI4CEVGen
293
135
0
19 Apr 2024
On the Content Bias in Fréchet Video Distance
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
256
32
0
18 Apr 2024
PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation
  for Videos
PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos
Qi Zhao
M. Salman Asif
Zhan Ma
231
8
0
13 Apr 2024
PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in
  Viewers' Opinion Scores
PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion ScoresEuropean Conference on Computer Vision (ECCV), 2024
Lucas Goncalves
Prashant Mathur
Chandrashekhar Lavania
Metehan Cekic
Marcello Federico
Kyu J. Han
170
9
0
10 Apr 2024
Action-conditioned video data improves predictability
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
327
0
0
08 Apr 2024
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion ModelComputer Vision and Pattern Recognition (CVPR), 2024
Xu He
Qiaochu Huang
Zhensong Zhang
Zhiwei Lin
Zhiyong Wu
Sicheng Yang
Minglei Li
Zhiyi Chen
Songcen Xu
Xiaofei Wu
200
28
0
02 Apr 2024
Video Interpolation with Diffusion Models
Video Interpolation with Diffusion Models
Siddhant Jain
Daniel Watson
Eric Tabellion
Aleksander Holyñski
Ben Poole
Janne Kontkanen
SupRVGenDiffM
284
62
0
01 Apr 2024
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject
  Control
SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Binyuan Huang
Yuqing Wen
Yucheng Zhao
Yaosi Hu
Yingfei Liu
...
Tiancai Wang
Chi Zhang
Chang Wen Chen
Zhenzhong Chen
Xiangyu Zhang
280
21
0
28 Mar 2024
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
Ziyao Huang
Fan Tang
Yong Zhang
Xiaodong Cun
Juan Cao
Jintao Li
Tong-Yee Lee
DiffMVGen
235
28
0
25 Mar 2024
A Survey on Long Video Generation: Challenges, Methods, and Prospects
A Survey on Long Video Generation: Challenges, Methods, and Prospects
Chengxuan Li
Di Huang
Zeyu Lu
Yang Xiao
Qingqi Pei
Lei Bai
EGVM
183
33
0
25 Mar 2024
Champ: Controllable and Consistent Human Image Animation with 3D
  Parametric Guidance
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Shenhao Zhu
Junming Leo Chen
Zuozhuo Dai
Qingkun Su
Yinghui Xu
Xun Cao
Yao Yao
Hao Zhu
Siyu Zhu
3DHVGen
378
226
0
21 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent
  Decomposition
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGenDiffM
277
25
0
21 Mar 2024
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific
  Adaptation
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Fu-Yun Wang
Xiaoshi Wu
Zhaoyang Huang
Xiaoyu Shi
Dazhong Shen
Guanglu Song
Yu Liu
Jiaming Song
DiffM
218
30
0
20 Mar 2024
Previous
123...8910...131415
Next