ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
Probabilistic Forecasting with Stochastic Interpolants and Föllmer
  Processes
Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes
Yifan Chen
Mark Goldstein
Mengjian Hua
M. S. Albergo
Nicholas M. Boffi
Eric Vanden-Eijnden
AI4TS
352
35
0
20 Mar 2024
S2DM: Sector-Shaped Diffusion Models for Video Generation
S2DM: Sector-Shaped Diffusion Models for Video Generation
Haoran Lang
Yuxuan Ge
Zheng Tian
DiffMVGen
185
0
0
20 Mar 2024
AnimateDiff-Lightning: Cross-Model Diffusion Distillation
AnimateDiff-Lightning: Cross-Model Diffusion Distillation
Shanchuan Lin
Xiao Yang
DiffMVGen
222
41
0
19 Mar 2024
Enhancing Bandwidth Efficiency for Video Motion Transfer Applications
  using Deep Learning Based Keypoint Prediction
Enhancing Bandwidth Efficiency for Video Motion Transfer Applications using Deep Learning Based Keypoint Prediction
Xue Bai
Tasmiah Haque
S. Mohan
Yuliang Cai
Byungheon Jeong
Adam Halasz
Srinjoy Das
214
1
0
17 Mar 2024
Endora: Video Generation Models as Endoscopy Simulators
Endora: Video Generation Models as Endoscopy Simulators
Chenxin Li
Hengyu Liu
Yifan Liu
Brandon Yushan Feng
Wuyang Li
Xinyu Liu
Daming Gao
Jing Shao
Yixuan Yuan
VGenMedIm
234
69
0
17 Mar 2024
Animate Your Motion: Turning Still Images into Dynamic Videos
Animate Your Motion: Turning Still Images into Dynamic VideosEuropean Conference on Computer Vision (ECCV), 2024
Mingxiao Li
Bo Wan
Marie-Francine Moens
Tinne Tuytelaars
VGenDiffM
288
9
0
15 Mar 2024
Generalized Predictive Model for Autonomous Driving
Generalized Predictive Model for Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2024
Jiazhi Yang
Shenyuan Gao
Yihang Qiu
Li Chen
Tianyu Li
...
Ping Luo
Jun Zhang
Andreas Geiger
Yu Qiao
Hongyang Li
VGen
233
120
0
14 Mar 2024
Intention-driven Ego-to-Exo Video Generation
Intention-driven Ego-to-Exo Video Generation
Hongcheng Luo
Kai Zhu
Wei Zhai
Yang Cao
DiffMVGen
230
16
0
14 Mar 2024
SSM Meets Video Diffusion Models: Efficient Video Generation with
  Structured State Spaces
SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces
Yuta Oshima
Shohei Taniguchi
Masahiro Suzuki
Yutaka Matsuo
317
7
0
12 Mar 2024
DragAnything: Motion Control for Anything using Entity Representation
DragAnything: Motion Control for Anything using Entity RepresentationEuropean Conference on Computer Vision (ECCV), 2024
Wejia Wu
Zhuang Li
Yuchao Gu
Rui Zhao
Yefei He
David Junhao Zhang
Mike Zheng Shou
Yan Li
Yan Li
Chen Zhang
VGen
471
120
0
12 Mar 2024
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video
  Generation
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao
Xiaofeng Wang
Zheng Zhu
Xinze Chen
Guan Huang
Xiaoyi Bao
Xingang Wang
VGen
248
14
0
11 Mar 2024
Audio-Synchronized Visual Animation
Audio-Synchronized Visual AnimationEuropean Conference on Computer Vision (ECCV), 2024
Lin Zhang
Shentong Mo
Yijing Zhang
Pedro Morgado
DiffM
241
33
0
08 Mar 2024
Sora as a World Model? A Complete Survey on Text-to-Video Generation
Sora as a World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Noor Ul Eman
...
Caiyan Qin
Tae-Ho Kim
Choong Seon Hong
Yang Yang
Heng Tao Shen
EGVMVGen
284
66
0
08 Mar 2024
Time Weaver: A Conditional Time Series Generation Model
Time Weaver: A Conditional Time Series Generation ModelInternational Conference on Machine Learning (ICML), 2024
Sai Shankar Narasimhan
Shubhankar Agarwal
Oguzhan Akcin
Sujay Sanghavi
Sandeep Chinchali
MedImDiffM
435
33
0
05 Mar 2024
Abductive Ego-View Accident Video Understanding for Safe Driving
  Perception
Abductive Ego-View Accident Video Understanding for Safe Driving Perception
Jianwu Fang
Lei-lei Li
Junfei Zhou
Junbin Xiao
Hongkai Yu
Chen Lv
Jianru Xue
Tat-Seng Chua
278
43
0
01 Mar 2024
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Tsai-Shien Chen
Aliaksandr Siarohin
Willi Menapace
Ekaterina Deyneka
Hsiang-wei Chao
...
Yuwei Fang
Hsin-Ying Lee
Jian Ren
Ming-Hsuan Yang
Sergey Tulyakov
VGen
369
342
0
29 Feb 2024
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion
  Latent Aligners
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
Yazhou Xing
Yin-Yin He
Zeyue Tian
Xintao Wang
Qifeng Chen
345
109
0
27 Feb 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
  Synthesis
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
340
97
0
22 Feb 2024
Make a Cheap Scaling: A Self-Cascade Diffusion Model for
  Higher-Resolution Adaptation
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Lanqing Guo
Yin-Yin He
Haoxin Chen
Menghan Xia
Xiaodong Cun
...
Yong Zhang
Xintao Wang
Qifeng Chen
Ying Shan
Bihan Wen
230
49
0
16 Feb 2024
One-shot Neural Face Reenactment via Finding Directions in GAN's Latent
  Space
One-shot Neural Face Reenactment via Finding Directions in GAN's Latent SpaceInternational Journal of Computer Vision (IJCV), 2024
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
CVBM3DH
243
11
0
05 Feb 2024
Direct-a-Video: Customized Video Generation with User-Directed Camera
  Movement and Object Motion
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object MotionInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2024
Shiyuan Yang
Liang Hou
Haibin Huang
Chongyang Ma
Pengfei Wan
Di Zhang
Xiaodong Chen
Jing Liao
VGenDiffM
457
143
0
05 Feb 2024
Video-LaVIT: Unified Video-Language Pre-training with Decoupled
  Visual-Motional Tokenization
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional TokenizationInternational Conference on Machine Learning (ICML), 2024
Yang Jin
Zhicheng Sun
Kun Xu
Kun Xu
Liwei Chen
...
Yuliang Liu
Chen Zhang
Yang Song
Kun Gai
Yadong Mu
VGen
258
78
0
05 Feb 2024
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Jiawei Wang
Yuchen Zhang
Jiaxin Zou
Yan Zeng
Guoqiang Wei
Liping Yuan
Hang Li
DiffMVGen
259
82
0
02 Feb 2024
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models
  and Adapters with Decoupled Consistency Learning
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning
Fu-Yun Wang
Zhaoyang Huang
Xiaoyu Shi
Weikang Bian
Guanglu Song
Yu Liu
Jiaming Song
169
16
0
01 Feb 2024
Lumiere: A Space-Time Diffusion Model for Video Generation
Lumiere: A Space-Time Diffusion Model for Video GenerationACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2024
Omer Bar-Tal
Hila Chefer
Omer Tov
Charles Herrmann
Roni Paiss
...
T. Michaeli
Oliver Wang
Deqing Sun
Tali Dekel
Inbar Mosseri
VGen
403
383
0
23 Jan 2024
DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing
  High-Quality Implicit Neural Representations
DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural RepresentationsInternational Conference on Learning Representations (ICLR), 2024
Dogyun Park
S. Kim
Sojin Lee
Hyunwoo J. Kim
DiffM
324
14
0
23 Jan 2024
Synthesizing Moving People with 3D Control
Synthesizing Moving People with 3D Control
Boyi Li
Jathushan Rajasegaran
Yossi Gandelsman
Alexei A. Efros
Jitendra Malik
VGenDiffM
178
5
0
19 Jan 2024
Vlogger: Make Your Dream A Vlog
Vlogger: Make Your Dream A VlogComputer Vision and Pattern Recognition (CVPR), 2024
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGenDiffM
147
63
0
17 Jan 2024
Towards A Better Metric for Text-to-Video Generation
Towards A Better Metric for Text-to-Video Generation
Jay Zhangjie Wu
Guian Fang
Haoning Wu
Xintao Wang
Yixiao Ge
...
Rui Zhao
Weisi Lin
Wynne Hsu
Ying Shan
Mike Zheng Shou
VGen
256
44
0
15 Jan 2024
RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane
  Networks
RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane NetworksInternational Conference on Information Photonics (ICIP), 2024
Partha Ghosh
Soubhik Sanyal
Cordelia Schmid
Bernhard Scholkopf
VGen
271
1
0
11 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Ziqiang Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffMVGen
896
429
0
05 Jan 2024
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
VASE: Object-Centric Appearance and Shape Manipulation of Real Videos
E. Peruzzo
Vidit Goel
Dejia Xu
Xingqian Xu
Lezhi Li
Zinan Lin
Humphrey Shi
Andrii Zadaianchuk
LM&RoVGenDiffM
275
18
0
04 Jan 2024
Moonshot: Towards Controllable Video Generation and Editing with
  Multimodal Conditions
Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
David Junhao Zhang
Dongxu Li
Hung Le
Mike Zheng Shou
Caiming Xiong
Doyen Sahoo
VGen
225
36
0
03 Jan 2024
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
Bin Lei
Le Chen
Caiwen Ding
VGen
154
2
0
30 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
532
402
0
21 Dec 2023
Sign Language Production with Latent Motion Transformer
Sign Language Production with Latent Motion Transformer
Pan Xie
Taiying Peng
Yao Du
Qipeng Zhang
SLR
207
10
0
20 Dec 2023
Video Dynamics Prior: An Internal Learning Approach for Robust Video
  Enhancements
Video Dynamics Prior: An Internal Learning Approach for Robust Video EnhancementsNeural Information Processing Systems (NeurIPS), 2023
Gaurav Shrivastava
Ser-Nam Lim
Abhinav Shrivastava
149
12
0
13 Dec 2023
PEEKABOO: Interactive Video Generation via Masked-Diffusion
PEEKABOO: Interactive Video Generation via Masked-DiffusionComputer Vision and Pattern Recognition (CVPR), 2023
Yash Jain
Anshul Nasery
Vibhav Vineet
Harkirat Singh Behl
VGen
276
60
0
12 Dec 2023
Photorealistic Video Generation with Diffusion Models
Photorealistic Video Generation with Diffusion Models
Agrim Gupta
Lijun Yu
Kihyuk Sohn
Xiuye Gu
Meera Hahn
Fei-Fei Li
Irfan Essa
Lu Jiang
José Lezama
VGen
607
264
0
11 Dec 2023
CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional
  Modeling
CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling
Ruihan Yang
H. Gamper
Sebastian Braun
DiffM
155
6
0
08 Dec 2023
GenDeF: Learning Generative Deformation Field for Video Generation
GenDeF: Learning Generative Deformation Field for Video Generation
Wen Wang
Kecheng Zheng
Qiuyu Wang
Hao Chen
Zifan Shi
Ceyuan Yang
Yujun Shen
Chunhua Shen
VGenDiffM
193
3
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGenDiffM
211
55
0
07 Dec 2023
MotionCtrl: A Unified and Flexible Motion Controller for Video
  Generation
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
Zhouxia Wang
Ziyang Yuan
Xintao Wang
Tianshui Chen
Menghan Xia
Ping Luo
Ying Shan
DiffMVGen
419
398
0
06 Dec 2023
FAAC: Facial Animation Generation with Anchor Frame and Conditional
  Control for Superior Fidelity and Editability
FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability
Linze Li
Sunqi Fan
Hengjun Pu
Z. Bing
Yao Tang
Tianzhu Ye
Tong Yang
Liangyu Chen
Jiajun Liang
VGenDiffM
193
0
0
06 Dec 2023
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera
  Driving Scene Generation
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene GenerationEuropean Conference on Computer Vision (ECCV), 2023
Jiachen Lu
Ze Huang
Zeyu Yang
Jiahui Zhang
Li Zhang
VGen
348
72
0
05 Dec 2023
DragVideo: Interactive Drag-style Video Editing
DragVideo: Interactive Drag-style Video EditingEuropean Conference on Computer Vision (ECCV), 2023
Yufan Deng
Ruida Wang
Yuhao Zhang
Yu-Wing Tai
Chi-Keung Tang
DiffMVGen
260
36
0
03 Dec 2023
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion
  Models
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
Pengxiang Li
Kai Chen
Zhili Liu
Ruiyuan Gao
Lanqing Hong
Guo Zhou
Hua Yao
Dit-Yan Yeung
Huchuan Lu
Xu Jia
VGenDiffM
202
0
0
01 Dec 2023
ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with
  Diffusion Models
ART⋅\boldsymbol{\cdot}⋅V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Wenming Weng
Ruoyu Feng
Yanhui Wang
Jingdong Sun
Chunyu Wang
...
Jianmin Bao
Yuhui Yuan
Chong Luo
Yueyi Zhang
Zhiwei Xiong
VGen
213
64
0
30 Nov 2023
Driving into the Future: Multiview Visual Forecasting and Planning with
  World Model for Autonomous Driving
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2023
Yu-Quan Wang
Jiawei He
Lue Fan
Hongxin Li
Yuntao Chen
Zhaoxiang Zhang
VGen
317
244
0
29 Nov 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
VBench: Comprehensive Benchmark Suite for Video Generative ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
518
976
0
29 Nov 2023
Previous
123...91011...131415
Next