Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1812.01717
Cited By
v1
v2 (latest)
Towards Accurate Generative Models of Video: A New Metric & Challenges
3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
EGVM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Accurate Generative Models of Video: A New Metric & Challenges"
50 / 715 papers shown
Personalized Generation In Large Model Era: A Survey
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Wenjie Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
551
28
0
04 Mar 2025
Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models
International Conference on Learning Representations (ICLR), 2025
Xingzhuo Guo
Yu Zhang
Baixu Chen
Haoran Xu
Chao Guo
Mingsheng Long
DiffM
AI4TS
357
6
0
02 Mar 2025
Learning to Animate Images from A Few Videos to Portray Delicate Human Actions
Haoxin Li
Yingchen Yu
Qilong Wu
Hanwang Zhang
Boyang Li
Song Bai
3DH
VGen
1.1K
1
0
01 Mar 2025
Unified Video Action Model
Shuang Li
Yihuai Gao
Dorsa Sadigh
Shuran Song
VGen
685
65
0
28 Feb 2025
WorldModelBench: Judging Video Generation Models As World Models
Dacheng Li
Yunhao Fang
Yukang Chen
Shuo Yang
Shiyi Cao
...
Hongxu Yin
Alfons Kemper
Ion Stoica
Enze Xie
Yaojie Lu
VGen
237
28
0
28 Feb 2025
Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos
Zhiyu Tan
Junyan Wang
Hao Yang
Luozheng Qin
Hesen Chen
Qiang-feng Zhou
Hao Li
VGen
400
3
0
28 Feb 2025
Mobius: Text to Seamless Looping Video Generation via Latent Shift
Xiuli Bi
Jianfei Yuan
Bo Liu
Yanmei Zhang
Xiaodong Cun
Chi-Man Pun
Bin Xiao
DiffM
VGen
171
0
0
27 Feb 2025
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Yuhao Li
Mirana Claire Angel
Salman Khan
Yu Zhu
Jinqiu Sun
Yanning Zhang
Fahad Shahbaz Khan
VGen
248
4
0
27 Feb 2025
Glad: A Streaming Scene Generator for Autonomous Driving
International Conference on Learning Representations (ICLR), 2025
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
Xinming Zhang
3DGS
VGen
291
11
0
26 Feb 2025
X-Dancer: Expressive Music to Human Dance Video Generation
Zeyuan Chen
Hongyi Xu
Guoxian Song
You Xie
Chenxu Zhang
Xiusi Chen
Chao Wang
Di Chang
Linjie Luo
VGen
322
8
0
24 Feb 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
693
6
0
18 Feb 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Computer Vision and Pattern Recognition (CVPR), 2025
Jingcheng Ni
Yuxin Guo
Yichen Liu
Rui Chen
Lewei Lu
Z. Wu
DiffM
VGen
303
18
0
17 Feb 2025
SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Junxian Ma
Shiwen Wang
Jian Yang
Junyi Hu
Jian Liang
Guosheng Lin
Jingbo Chen
Kai Li
Yu Meng
DiffM
VGen
333
9
0
17 Feb 2025
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation
Qinghe Wang
Yawen Luo
Xiaoyu Shi
Xu Jia
Huchuan Lu
Tianfan Xue
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
DiffM
VGen
405
33
0
12 Feb 2025
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Li Hu
Guangyuan Wang
Zhen Shen
Xin Gao
Dechao Meng
Lian Zhuo
Peng Zhang
Bang Zhang
Liefeng Bo
DiffM
VGen
417
36
0
10 Feb 2025
Pre-Trained Video Generative Models as World Simulators
Haoran He
Yang Zhang
Guanbin Li
Zhihao Xu
Ling Pan
VGen
376
23
0
10 Feb 2025
History-Guided Video Diffusion
Kiwhan Song
Boyuan Chen
Max Simchowitz
Yilun Du
Russ Tedrake
Vincent Sitzmann
VGen
554
65
0
10 Feb 2025
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer
Xinyu Liu
Ailing Zeng
Wei Xue
Harry Yang
Wenhan Luo
Qifeng Liu
Wenhan Luo
VGen
1.2K
7
0
09 Feb 2025
MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
Haibo Tong
Zhaoyang Wang
Zhe Chen
Haonian Ji
Shi Qiu
...
Peng Xia
Mingyu Ding
Rafael Rafailov
Chelsea Finn
Huaxiu Yao
EGVM
VGen
663
9
0
03 Feb 2025
Improving Tropical Cyclone Forecasting With Video Diffusion Models
Zhibo Ren
Pritthijit Nath
Pancham Shukla
422
1
0
27 Jan 2025
Taming Teacher Forcing for Masked Autoregressive Video Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Deyu Zhou
Quan Sun
Yuang Peng
Kun Yan
Runpei Dong
...
Zheng Ge
Nan Duan
Xiangyu Zhang
L. Ni
H. Shum
VGen
387
19
0
21 Jan 2025
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations
Computer Vision and Pattern Recognition (CVPR), 2025
Weixi Feng
Chao Liu
Sifei Liu
William Yang Wang
Arash Vahdat
Weili Nie
VGen
DiffM
207
10
0
13 Jan 2025
MEt3R: Measuring Multi-View Consistency in Generated Images
Computer Vision and Pattern Recognition (CVPR), 2025
Mohammad Asim
Christopher Wewer
Thomas Wimmer
Bernt Schiele
J. E. Lenssen
EGVM
3DGS
VGen
257
38
0
10 Jan 2025
VideoAuteur: Towards Long Narrative Video Generation
Junfei Xiao
Feng Cheng
Lu Qi
Liangke Gui
Jiepeng Cen
Zhibei Ma
Yaoyao Liu
Lu Jiang
VGen
393
7
0
10 Jan 2025
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Guy Yariv
Yuval Kirstain
Amit Zohar
Shelly Sheynin
Yaniv Taigman
Yossi Adi
Sagie Benaim
Adam Polyak
VGen
DiffM
190
11
0
06 Jan 2025
Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation
Xincheng Shuai
Henghui Ding
Zhenyuan Qin
Hao Luo
Jiabo He
Dacheng Tao
VGen
DiffM
296
0
0
02 Jan 2025
AKiRa: Augmentation Kit on Rays for optical video generation
Computer Vision and Pattern Recognition (CVPR), 2024
Xi Wang
Robin Courant
Marc Christie
Vicky Kalogeiton
VGen
419
12
0
31 Dec 2024
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yiyuan Liang
Zhiying Yan
Liqun Chen
Jiahuan Zhou
Luxin Yan
Sheng Zhong
Xu Zou
DiffM
VGen
392
9
0
31 Dec 2024
Grid Diffusion Models for Text-to-Video Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Taegyeong Lee
Soyeong Kwon
Taehwan Kim
313
19
0
31 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Rundong Wang
1.0K
44
0
24 Dec 2024
VidTwin: Video VAE with Decoupled Structure and Dynamics
Computer Vision and Pattern Recognition (CVPR), 2024
Yuchi Wang
Junliang Guo
Xinyi Xie
Tianyu He
Xu Sun
Li Zhao
DRL
VGen
369
7
0
23 Dec 2024
Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy
AAAI Conference on Artificial Intelligence (AAAI), 2024
Shaoyan Pan
Yikang Liu
Lin Zhao
Eric Z. Chen
Xiao Chen
Terrence Chen
Shanhui Sun
VGen
MedIm
468
1
0
20 Dec 2024
Parallelized Autoregressive Visual Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Yanjie Wang
Shuhuai Ren
Zhijie Lin
Yujin Han
Haoyuan Guo
Zhenheng Yang
Difan Zou
Jiashi Feng
Xihui Liu
VGen
647
36
0
19 Dec 2024
AniDoc: Animation Creation Made Easier
Computer Vision and Pattern Recognition (CVPR), 2024
Yihao Meng
Hao Ouyang
Hanlin Wang
Qiuyu Wang
Wen Wang
Ka Leong Cheng
Zhiheng Liu
Yujun Shen
Huamin Qu
DiffM
VGen
485
15
0
18 Dec 2024
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Runtao Liu
Haoyu Wu
Zheng Ziqiang
Chen Wei
Yingqing He
Renjie Pi
Qifeng Chen
VGen
324
66
0
18 Dec 2024
SurgSora: Object-Aware Diffusion Model for Controllable Surgical Video Generation
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Tong Chen
Shuya Yang
Junyi Wang
Long Bai
Hongliang Ren
Luping Zhou
VGen
MedIm
435
4
0
18 Dec 2024
Can video generation replace cinematographers? Research on the cinematic language of generated video
Xuelong Li
Kai WU
Siyi Yang
YiZhan Qu
Guohua. Zhang
...
Mingliang Xiong
Hao Deng
Qingwen Liu
Gang Li
Bin He
VGen
DiffM
388
2
0
16 Dec 2024
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Computer Vision and Pattern Recognition (CVPR), 2024
Mariam Hassan
Sebastian Stapf
Ahmad Rahimi
Pedro M B Rezende
Yasaman Haghighi
...
Mathieu Salzmann
Davide Scaramuzza
Marc Pollefeys
Paolo Favaro
Alexandre Alahi
VLM
VGen
322
40
0
15 Dec 2024
OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Bohan Li
Jianfeng Dong
Jiadong Wang
Yukai Shi
Yasheng Sun
...
Zhuang Ma
Baao Xie
Chao Ma
Yunbo Wang
Wenjun Zeng
DiffM
874
4
0
15 Dec 2024
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Jun Zheng
Jing Wang
Fuwei Zhao
Xujie Zhang
Xiaodan Liang
VGen
DiffM
357
2
0
13 Dec 2024
OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
Weiqi Li
Shijie Zhao
Chong Mou
Xuhan Sheng
Ying Tai
Qian Wang
Junlin Li
Li Zhang
Jian Zhang
DiffM
VGen
171
6
0
12 Dec 2024
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Haonan Qiu
Shiwei Zhang
Yujie Wei
Ruihang Chu
Hangjie Yuan
Xinyu Wang
Yujiao Shi
Ziwei Liu
357
18
0
12 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
239
0
0
07 Dec 2024
DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models
Yizhuo Li
Yuying Ge
Yixiao Ge
Ping Luo
Mingyu Ding
DiffM
VGen
297
1
0
05 Dec 2024
The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Ruili Feng
Han Zhang
Zhantao Yang
Jie Xiao
Zhilei Shu
Zhiheng Liu
Andy Zheng
Yukun Huang
Yu Liu
Han Zhang
VGen
288
46
0
04 Dec 2024
Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention
Hannan Lu
Xiaohe Wu
Shudong Wang
Xiameng Qin
Xinyu Zhang
Junyu Han
W. Zuo
Ji Tao
349
4
0
04 Dec 2024
CPA: Camera-pose-awareness Diffusion Transformer for Video Generation
Yuelei Wang
Jian Zhang
Pengtao Jiang
Hao Zhang
Jinwei Chen
Bo Li
VGen
DiffM
334
9
0
02 Dec 2024
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Z. Wu
Jingcheng Ni
Xiaodong Wang
Yuxin Guo
Rui Chen
Lewei Lu
Jifeng Dai
Yuwen Xiong
360
17
0
02 Dec 2024
Playable Game Generation
Mingyu Yang
Junyou Li
Zhongbin Fang
Sheng Chen
Yangbin Yu
Qiang Fu
Wei Yang
Deheng Ye
VGen
292
19
0
01 Dec 2024
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
Computer Vision and Pattern Recognition (CVPR), 2024
Jiahao Cui
Hui Li
Yun Zhan
Hanlin Shang
K. Cheng
Yuqi Ma
Shan Mu
Hang Zhou
Jingdong Wang
Siyu Zhu
ViT
VGen
545
78
0
01 Dec 2024
Previous
1
2
3
...
5
6
7
...
13
14
15
Next