ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
Personalized Generation In Large Model Era: A Survey
Personalized Generation In Large Model Era: A SurveyAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Wenjie Wang
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
551
28
0
04 Mar 2025
Dynamical Diffusion: Learning Temporal Dynamics with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2025
Xingzhuo Guo
Yu Zhang
Baixu Chen
Haoran Xu
Chao Guo
Mingsheng Long
DiffMAI4TS
357
6
0
02 Mar 2025
Learning to Animate Images from A Few Videos to Portray Delicate Human Actions
Haoxin Li
Yingchen Yu
Qilong Wu
Hanwang Zhang
Boyang Li
Song Bai
3DHVGen
1.1K
1
0
01 Mar 2025
Unified Video Action Model
Unified Video Action Model
Shuang Li
Yihuai Gao
Dorsa Sadigh
Shuran Song
VGen
685
65
0
28 Feb 2025
WorldModelBench: Judging Video Generation Models As World Models
WorldModelBench: Judging Video Generation Models As World Models
Dacheng Li
Yunhao Fang
Yukang Chen
Shuo Yang
Shiyi Cao
...
Hongxu Yin
Alfons Kemper
Ion Stoica
Enze Xie
Yaojie Lu
VGen
237
28
0
28 Feb 2025
Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos
Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos
Zhiyu Tan
Junyan Wang
Hao Yang
Luozheng Qin
Hesen Chen
Qiang-feng Zhou
Hao Li
VGen
400
3
0
28 Feb 2025
Mobius: Text to Seamless Looping Video Generation via Latent Shift
Mobius: Text to Seamless Looping Video Generation via Latent Shift
Xiuli Bi
Jianfei Yuan
Bo Liu
Yanmei Zhang
Xiaodong Cun
Chi-Man Pun
Bin Xiao
DiffMVGen
171
0
0
27 Feb 2025
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Yuhao Li
Mirana Claire Angel
Salman Khan
Yu Zhu
Jinqiu Sun
Yanning Zhang
Fahad Shahbaz Khan
VGen
248
4
0
27 Feb 2025
Glad: A Streaming Scene Generator for Autonomous DrivingInternational Conference on Learning Representations (ICLR), 2025
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
Xinming Zhang
3DGSVGen
291
11
0
26 Feb 2025
X-Dancer: Expressive Music to Human Dance Video Generation
X-Dancer: Expressive Music to Human Dance Video Generation
Zeyuan Chen
Hongyi Xu
Guoxian Song
You Xie
Chenxu Zhang
Xiusi Chen
Chao Wang
Di Chang
Linjie Luo
VGen
322
8
0
24 Feb 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffMVGen
693
6
0
18 Feb 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionComputer Vision and Pattern Recognition (CVPR), 2025
Jingcheng Ni
Yuxin Guo
Yichen Liu
Rui Chen
Lewei Lu
Z. Wu
DiffMVGen
303
18
0
17 Feb 2025
SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Junxian Ma
Shiwen Wang
Jian Yang
Junyi Hu
Jian Liang
Guosheng Lin
Jingbo Chen
Kai Li
Yu Meng
DiffMVGen
333
9
0
17 Feb 2025
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation
Qinghe Wang
Yawen Luo
Xiaoyu Shi
Xu Jia
Huchuan Lu
Tianfan Xue
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
DiffMVGen
405
33
0
12 Feb 2025
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance
Li Hu
Guangyuan Wang
Zhen Shen
Xin Gao
Dechao Meng
Lian Zhuo
Peng Zhang
Bang Zhang
Liefeng Bo
DiffMVGen
417
36
0
10 Feb 2025
Pre-Trained Video Generative Models as World Simulators
Pre-Trained Video Generative Models as World Simulators
Haoran He
Yang Zhang
Guanbin Li
Zhihao Xu
Ling Pan
VGen
376
23
0
10 Feb 2025
History-Guided Video Diffusion
History-Guided Video Diffusion
Kiwhan Song
Boyuan Chen
Max Simchowitz
Yilun Du
Russ Tedrake
Vincent Sitzmann
VGen
554
65
0
10 Feb 2025
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer
Xinyu Liu
Ailing Zeng
Wei Xue
Harry Yang
Wenhan Luo
Qifeng Liu
Wenhan Luo
VGen
1.2K
7
0
09 Feb 2025
MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
Haibo Tong
Zhaoyang Wang
Zhe Chen
Haonian Ji
Shi Qiu
...
Peng Xia
Mingyu Ding
Rafael Rafailov
Chelsea Finn
Huaxiu Yao
EGVMVGen
663
9
0
03 Feb 2025
Improving Tropical Cyclone Forecasting With Video Diffusion Models
Improving Tropical Cyclone Forecasting With Video Diffusion Models
Zhibo Ren
Pritthijit Nath
Pancham Shukla
422
1
0
27 Jan 2025
Taming Teacher Forcing for Masked Autoregressive Video Generation
Taming Teacher Forcing for Masked Autoregressive Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Deyu Zhou
Quan Sun
Yuang Peng
Kun Yan
Runpei Dong
...
Zheng Ge
Nan Duan
Xiangyu Zhang
L. Ni
H. Shum
VGen
387
19
0
21 Jan 2025
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video RepresentationsComputer Vision and Pattern Recognition (CVPR), 2025
Weixi Feng
Chao Liu
Sifei Liu
William Yang Wang
Arash Vahdat
Weili Nie
VGenDiffM
207
10
0
13 Jan 2025
MEt3R: Measuring Multi-View Consistency in Generated Images
MEt3R: Measuring Multi-View Consistency in Generated ImagesComputer Vision and Pattern Recognition (CVPR), 2025
Mohammad Asim
Christopher Wewer
Thomas Wimmer
Bernt Schiele
J. E. Lenssen
EGVM3DGSVGen
257
38
0
10 Jan 2025
VideoAuteur: Towards Long Narrative Video Generation
VideoAuteur: Towards Long Narrative Video Generation
Junfei Xiao
Feng Cheng
Lu Qi
Liangke Gui
Jiepeng Cen
Zhibei Ma
Yaoyao Liu
Lu Jiang
VGen
393
7
0
10 Jan 2025
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Guy Yariv
Yuval Kirstain
Amit Zohar
Shelly Sheynin
Yaniv Taigman
Yossi Adi
Sagie Benaim
Adam Polyak
VGenDiffM
190
11
0
06 Jan 2025
Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation
Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation
Xincheng Shuai
Henghui Ding
Zhenyuan Qin
Hao Luo
Jiabo He
Dacheng Tao
VGenDiffM
296
0
0
02 Jan 2025
AKiRa: Augmentation Kit on Rays for optical video generation
AKiRa: Augmentation Kit on Rays for optical video generationComputer Vision and Pattern Recognition (CVPR), 2024
Xi Wang
Robin Courant
Marc Christie
Vicky Kalogeiton
VGen
419
12
0
31 Dec 2024
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving ScenesAAAI Conference on Artificial Intelligence (AAAI), 2024
Yiyuan Liang
Zhiying Yan
Liqun Chen
Jiahuan Zhou
Luxin Yan
Sheng Zhong
Xu Zou
DiffMVGen
392
9
0
31 Dec 2024
Grid Diffusion Models for Text-to-Video Generation
Grid Diffusion Models for Text-to-Video GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Taegyeong Lee
Soyeong Kwon
Taehwan Kim
313
19
0
31 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with
  Multi-modal Autoregressive Transformers
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Rundong Wang
1.0K
44
0
24 Dec 2024
VidTwin: Video VAE with Decoupled Structure and Dynamics
VidTwin: Video VAE with Decoupled Structure and DynamicsComputer Vision and Pattern Recognition (CVPR), 2024
Yuchi Wang
Junliang Guo
Xinyi Xie
Tianyu He
Xu Sun
Li Zhao
DRLVGen
369
7
0
23 Dec 2024
Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac Fluoroscopy
Label-Efficient Data Augmentation with Video Diffusion Models for Guidewire Segmentation in Cardiac FluoroscopyAAAI Conference on Artificial Intelligence (AAAI), 2024
Shaoyan Pan
Yikang Liu
Lin Zhao
Eric Z. Chen
Xiao Chen
Terrence Chen
Shanhui Sun
VGenMedIm
468
1
0
20 Dec 2024
Parallelized Autoregressive Visual Generation
Parallelized Autoregressive Visual GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Yanjie Wang
Shuhuai Ren
Zhijie Lin
Yujin Han
Haoyuan Guo
Zhenheng Yang
Difan Zou
Jiashi Feng
Xihui Liu
VGen
647
36
0
19 Dec 2024
AniDoc: Animation Creation Made Easier
AniDoc: Animation Creation Made EasierComputer Vision and Pattern Recognition (CVPR), 2024
Yihao Meng
Hao Ouyang
Hanlin Wang
Qiuyu Wang
Wen Wang
Ka Leong Cheng
Zhiheng Liu
Yujun Shen
Huamin Qu
DiffMVGen
485
15
0
18 Dec 2024
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
VideoDPO: Omni-Preference Alignment for Video Diffusion GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Runtao Liu
Haoyu Wu
Zheng Ziqiang
Chen Wei
Yingqing He
Renjie Pi
Qifeng Chen
VGen
324
66
0
18 Dec 2024
SurgSora: Object-Aware Diffusion Model for Controllable Surgical Video Generation
SurgSora: Object-Aware Diffusion Model for Controllable Surgical Video GenerationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Tong Chen
Shuya Yang
Junyi Wang
Long Bai
Hongliang Ren
Luping Zhou
VGenMedIm
435
4
0
18 Dec 2024
Can video generation replace cinematographers? Research on the cinematic language of generated video
Can video generation replace cinematographers? Research on the cinematic language of generated video
Xuelong Li
Kai WU
Siyi Yang
YiZhan Qu
Guohua. Zhang
...
Mingliang Xiong
Hao Deng
Qingwen Liu
Gang Li
Bin He
VGenDiffM
388
2
0
16 Dec 2024
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained
  Ego-Motion, Object Dynamics, and Scene Composition Control
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition ControlComputer Vision and Pattern Recognition (CVPR), 2024
Mariam Hassan
Sebastian Stapf
Ahmad Rahimi
Pedro M B Rezende
Yasaman Haghighi
...
Mathieu Salzmann
Davide Scaramuzza
Marc Pollefeys
Paolo Favaro
Alexandre Alahi
VLMVGen
322
40
0
15 Dec 2024
OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene GenerationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Bohan Li
Jianfeng Dong
Jiadong Wang
Yukai Shi
Yasheng Sun
...
Zhuang Ma
Baao Xie
Chao Ma
Yunbo Wang
Wenjun Zeng
DiffM
874
4
0
15 Dec 2024
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism
Jun Zheng
Jing Wang
Fuwei Zhao
Xujie Zhang
Xiaodan Liang
VGenDiffM
357
2
0
13 Dec 2024
OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video
  Generation
OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation
Weiqi Li
Shijie Zhao
Chong Mou
Xuhan Sheng
Ying Tai
Qian Wang
Junlin Li
Li Zhang
Jian Zhang
DiffMVGen
171
6
0
12 Dec 2024
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Haonan Qiu
Shiwei Zhang
Yujie Wei
Ruihang Chu
Hangjie Yuan
Xinyu Wang
Yujiao Shi
Ziwei Liu
357
18
0
12 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
239
0
0
07 Dec 2024
DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video
  Generation with Language Models
DiCoDe: Diffusion-Compressed Deep Tokens for Autoregressive Video Generation with Language Models
Yizhuo Li
Yuying Ge
Yixiao Ge
Ping Luo
Mingyu Ding
DiffMVGen
297
1
0
05 Dec 2024
The Matrix: Infinite-Horizon World Generation with Real-Time Moving
  Control
The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Ruili Feng
Han Zhang
Zhantao Yang
Jie Xiao
Zhilei Shu
Zhiheng Liu
Andy Zheng
Yukun Huang
Yu Liu
Han Zhang
VGen
288
46
0
04 Dec 2024
Seeing Beyond Views: Multi-View Driving Scene Video Generation with
  Holistic Attention
Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention
Hannan Lu
Xiaohe Wu
Shudong Wang
Xiameng Qin
Xinyu Zhang
Junyu Han
W. Zuo
Ji Tao
349
4
0
04 Dec 2024
CPA: Camera-pose-awareness Diffusion Transformer for Video Generation
CPA: Camera-pose-awareness Diffusion Transformer for Video Generation
Yuelei Wang
Jian Zhang
Pengtao Jiang
Hao Zhang
Jinwei Chen
Bo Li
VGenDiffM
334
9
0
02 Dec 2024
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for
  Autonomous Driving
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Z. Wu
Jingcheng Ni
Xiaodong Wang
Yuxin Guo
Rui Chen
Lewei Lu
Jifeng Dai
Yuwen Xiong
360
17
0
02 Dec 2024
Playable Game Generation
Playable Game Generation
Mingyu Yang
Junyou Li
Zhongbin Fang
Sheng Chen
Yangbin Yu
Qiang Fu
Wei Yang
Deheng Ye
VGen
292
19
0
01 Dec 2024
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion TransformerComputer Vision and Pattern Recognition (CVPR), 2024
Jiahao Cui
Hui Li
Yun Zhan
Hanlin Shang
K. Cheng
Yuqi Ma
Shan Mu
Hang Zhou
Jingdong Wang
Siyu Zhu
ViTVGen
545
78
0
01 Dec 2024
Previous
123...567...131415
Next