Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1812.01717
Cited By
v1
v2 (latest)
Towards Accurate Generative Models of Video: A New Metric & Challenges
3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Accurate Generative Models of Video: A New Metric & Challenges"
50 / 715 papers shown
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
T. Hu
Xuemeng Yang
Licheng Wen
Yu Yang
Tiantian Wei
Yukai Ma
Min Dou
Botian Shi
Yong Liu
VGen
DiffM
535
16
0
06 Sep 2024
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model
Weipeng Tan
Chuming Lin
Chengming Xu
Xiaozhong Ji
Junwei Zhu
Chengjie Wang
Yanwei Fu
DiffM
153
3
0
05 Sep 2024
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model
Liuhan Chen
Zongjian Li
Bin Lin
Bin Zhu
Qian Wang
Shenghai Yuan
X. Zhou
Xinhua Cheng
Li Yuan
DiffM
361
22
0
02 Sep 2024
Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
Qihua Chen
Yi Ma
Haobo Wang
Junkun Yuan
Wenzhe Zhao
Q. Tian
Hongmei Wang
Shaobo Min
Qifeng Chen
Wen Liu
DiffM
226
39
0
02 Sep 2024
DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving
Yongjie Fu
Anmol Jain
Xuan Di
Xu Chen
Chengbo Zang
VGen
218
10
0
29 Aug 2024
GenRec: Unifying Video Generation and Recognition with Diffusion Models
Neural Information Processing Systems (NeurIPS), 2024
Zejia Weng
Xitong Yang
Zhen Xing
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
336
14
0
27 Aug 2024
Empowering Sign Language Communication: Integrating Sentiment and Semantics for Facial Expression Synthesis
Computers & graphics (CG), 2024
Rafael Azevedo
Thiago M. Coutinho
Joao Klock Ferreira
Thiago L. Gomes
Erickson R. Nascimento
SLR
214
8
0
27 Aug 2024
TC-PDM: Temporally Consistent Patch Diffusion Models for Infrared-to-Visible Video Translation
Anh-Dzung Doan
Vu Minh Hieu Phan
Surabhi Gupta
Markus Wagner
Tat-Jun Chin
Ian Reid
VGen
DiffM
182
1
0
26 Aug 2024
SurGen: Text-Guided Diffusion Model for Surgical Video Generation
Joseph Cho
Samuel Schmidgall
C. Zakka
Mrudang Mathur
Dhamanpreet Kaur
R. Shad
W. Hiesinger
VGen
MedIm
308
18
0
26 Aug 2024
K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences
Computer Vision and Pattern Recognition (CVPR), 2024
Zhikai Li
Xuewen Liu
Dongrong Fu
Jianquan Li
Qingyi Gu
Kurt Keutzer
Zhen Dong
EGVM
VGen
DiffM
351
8
0
26 Aug 2024
E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Shangkun Sun
Xiaoyu Liang
S. Fan
Wenxu Gao
Wei-Nan Gao
DiffM
262
6
0
21 Aug 2024
TrackGo: A Flexible and Efficient Method for Controllable Video Generation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Haitao Zhou
Chuang Wang
Rui Nie
Jinxiao Lin
Dongdong Yu
Qian Yu
Changhu Wang
VGen
DiffM
551
28
0
21 Aug 2024
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Tao Yang
Yangming Shi
Yunwen Huang
Feng Chen
Yin Zheng
Lei Zhang
DiffM
VGen
206
0
0
19 Aug 2024
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Liu He
Yizhi Song
Hejun Huang
Pinxin Liu
Yunlong Tang
Daniel G. Aliaga
Xin Zhou
DiffM
VGen
461
10
0
19 Aug 2024
Quality Assessment in the Era of Large Models: A Survey
Zicheng Zhang
Yingjie Zhou
Chunyi Li
Baixuan Zhao
Xiaohong Liu
Guangtao Zhai
344
33
0
17 Aug 2024
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Zhichao Zhang
Xinyue Li
Wei Sun
Jun Jia
Xiongkuo Min
...
Puyi Wang
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Guangtao Zhai
EGVM
250
0
0
31 Jul 2024
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Zhenghao Zhang
Junchao Liao
Menghao Li
Zuozhuo Dai
Bingxue Qiu
Hao Hu
Shaowei Cai
Weizhi Wang
VGen
558
111
0
31 Jul 2024
Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
IEEE Access (IEEE Access), 2024
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Zinuo Li
Hamid Laga
F. Boussaïd
VGen
256
5
0
27 Jul 2024
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Zhenzhi Wang
Shouqing Yang
Yanhong Zeng
Youqing Fang
Yuwei Guo
...
Jing Tan
Kai Chen
Tianfan Xue
Bo Dai
Dahua Lin
VGen
3DH
385
50
0
24 Jul 2024
SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
Yiming Xie
Chun-Han Yao
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
VGen
384
99
0
24 Jul 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGen
EGVM
331
28
0
23 Jul 2024
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
...
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGen
DiffM
549
107
0
17 Jul 2024
QVD: Post-training Quantization for Video Diffusion Models
Shilong Tian
Hong Chen
Chengtao Lv
Yu Liu
Jinyang Guo
Xianglong Liu
Shengxi Li
Hao Yang
Tao Xie
VGen
MQ
280
12
0
16 Jul 2024
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Yuanhao Zhai
Kevin Qinghong Lin
Linjie Li
Chung-Ching Lin
Jianfeng Wang
Zhengyuan Yang
David Doermann
Junsong Yuan
Zicheng Liu
Lijuan Wang
DiffM
VGen
241
11
0
15 Jul 2024
Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation
Yuhwan Jeong
Hoonhee Cho
Kuk-Jin Yoon
DiffM
202
8
0
15 Jul 2024
Kinetic Typography Diffusion Model
Seonmi Park
Inhwan Bae
Seunghyun Shin
Hae-Gon Jeon
DiffM
298
5
0
15 Jul 2024
TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
J. Kim
Min-Jung Kim
Junsoo Lee
Jaegul Choo
DiffM
204
11
0
12 Jul 2024
A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Wentao Lei
Jinting Wang
Fengji Ma
Guanjie Huang
Li Liu
VGen
EGVM
289
16
0
11 Jul 2024
PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines
Zidong Wang
Zeyu Lu
Di Huang
Tong He
Xihui Liu
Wanli Ouyang
Mengwei He
239
9
0
11 Jul 2024
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
458
54
0
10 Jul 2024
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
340
7
0
10 Jul 2024
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
Xuan Ju
Yiming Gao
Zhaoyang Zhang
Ziyang Yuan
Xintao Wang
Ailing Zeng
Yu Xiong
Qiang Xu
Ying Shan
VGen
291
104
0
08 Jul 2024
VIMI: Grounding Video Generation through Multi-modal Instruction
Yuwei Fang
Willi Menapace
Aliaksandr Siarohin
Tsai-Shien Chen
Kuan-Chien Wang
Ivan Skorokhodov
Graham Neubig
Sergey Tulyakov
VGen
332
10
0
08 Jul 2024
Towards a Scalable Reference-Free Evaluation of Generative Models
Azim Ospanov
Jingwei Zhang
Mohammad Jalali
Xuenan Cao
Andrej Bogdanov
Farzan Farnia
EGVM
251
18
0
03 Jul 2024
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Yuang Zhang
Jiaxi Gu
L. Wang
Han Wang
Junqi Cheng
Yuefeng Zhu
Fangyuan Zou
VGen
429
153
0
28 Jun 2024
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
Zihao Wang
Shaofei Cai
Zhancun Mu
Haowei Lin
Ceyao Zhang
Xuejie Liu
Qing Li
Hoang Trung-Dung
Xiaojian Ma
Yitao Liang
LM&Ro
310
25
0
27 Jun 2024
MultiDiff: Consistent Novel View Synthesis from a Single Image
Norman Muller
Katja Schwarz
Barbara Roessle
Lorenzo Porzi
Samuel Rota Buló
Matthias Nießner
Peter Kontschieder
DiffM
300
52
0
26 Jun 2024
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance
Younghyun Kim
Geunmin Hwang
Junyu Zhang
Eunbyung Park
688
26
0
26 Jun 2024
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models
Haonan Qiu
Zhaoxi Chen
Zhouxia Wang
Yingqing He
Menghan Xia
Ziwei Liu
VGen
DiffM
208
46
0
24 Jun 2024
Listen and Move: Improving GANs Coherency in Agnostic Sound-to-Video Generation
Rafael Redondo
193
0
0
23 Jun 2024
Image Conductor: Precision Control for Interactive Video Synthesis
Yaowei Li
Xintao Wang
Zhaoyang Zhang
Zhouxia Wang
Ziyang Yuan
Liangbin Xie
Yuexian Zou
Ying Shan
VGen
257
45
0
21 Jun 2024
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Xuan He
Dongfu Jiang
Ge Zhang
Max Ku
Achint Soni
...
Yaswanth Narsupalli
Rongqi Fan
Zhiheng Lyu
Yuchen Lin
Wenhu Chen
EGVM
VGen
ALM
313
118
0
21 Jun 2024
Video Generation with Learned Action Prior
Meenakshi Sarkar
Devansh Bhardwaj
Debasish Ghose
VGen
GAN
306
0
0
20 Jun 2024
IRASim: A Fine-Grained World Model for Robot Manipulation
Fangqi Zhu
Hongtao Wu
Song Guo
Yuxiao Liu
Chilam Cheang
Tao Kong
329
25
0
20 Jun 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Neural Information Processing Systems (NeurIPS), 2024
Zhiyuan Ma
Liangliang Zhao
Biqing Qi
Bowen Zhou
DiffM
412
8
0
19 Jun 2024
L4GM: Large 4D Gaussian Reconstruction Model
Neural Information Processing Systems (NeurIPS), 2024
Jiawei Ren
Kevin Xie
Ashkan Mirzaei
Hanxue Liang
Xiaohui Zeng
...
Ziwei Liu
Antonio Torralba
Sanja Fidler
Seung Wook Kim
Huan Ling
3DGS
261
95
0
14 Jun 2024
Training-free Camera Control for Video Generation
International Conference on Learning Representations (ICLR), 2024
Chen Hou
Guoqiang Wei
VGen
DiffM
627
80
0
14 Jun 2024
SimGen: Simulator-conditioned Driving Scene Generation
Yunsong Zhou
Michael Simon
Zhenghao Peng
Sicheng Mo
Hongzi Zhu
Minyi Guo
Bolei Zhou
VGen
301
23
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
356
4
0
13 Jun 2024
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Ivan Skorokhodov
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
VGen
245
20
0
12 Jun 2024
Previous
1
2
3
...
7
8
9
...
13
14
15
Next