ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.01717
  4. Cited By
Towards Accurate Generative Models of Video: A New Metric & Challenges
v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018
Thomas Unterthiner
Sjoerd van Steenkiste
Karol Kurach
Raphaël Marinier
Marcin Michalski
Sylvain Gelly
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown
SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion
  Models for One-shot Video Tuning
SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Liang Peng
Haoran Cheng
Zheng Yang
Ruisi Zhao
Linxuan Xia
Chaotian Song
Qinglin Lu
Boxi Wu
Wei Liu
VGen
177
2
0
29 Nov 2023
Panacea: Panoramic and Controllable Video Generation for Autonomous
  Driving
Panacea: Panoramic and Controllable Video Generation for Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2023
Yuqing Wen
Yucheng Zhao
Yingfei Liu
Fan Jia
Yanhui Wang
Chong Luo
Chi Zhang
Tiancai Wang
Xiaoyan Sun
Xiangyu Zhang
272
125
0
28 Nov 2023
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for
  Character Animation
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character AnimationComputer Vision and Pattern Recognition (CVPR), 2023
Liucheng Hu
Xin Gao
Peng Zhang
Ke Sun
Bang Zhang
Liefeng Bo
DiffMVGen
473
655
0
28 Nov 2023
MagicAnimate: Temporally Consistent Human Image Animation using
  Diffusion Model
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion ModelComputer Vision and Pattern Recognition (CVPR), 2023
Zhongcong Xu
Jianfeng Zhang
Jun Hao Liew
Hanshu Yan
Jia-Wei Liu
Chenxu Zhang
Jiashi Feng
Mike Zheng Shou
VGenDiffM
400
315
0
27 Nov 2023
FLAIR: A Conditional Diffusion Framework with Applications to Face Video
  Restoration
FLAIR: A Conditional Diffusion Framework with Applications to Face Video RestorationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Zihao Zou
Jiaming Liu
Shirin Shoushtari
Yubo Wang
Weijie Gan
Ulugbek S. Kamilov
VGenDiffM
253
4
0
26 Nov 2023
Decouple Content and Motion for Conditional Image-to-Video Generation
Decouple Content and Motion for Conditional Image-to-Video GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023
Cuifeng Shen
Yulu Gan
Chen Chen
Xiongwei Zhu
Lele Cheng
Yan Li
Jinzhi Wang
VGenDiffM
220
10
0
24 Nov 2023
ADriver-I: A General World Model for Autonomous Driving
ADriver-I: A General World Model for Autonomous Driving
Fan Jia
Weixin Mao
Yingfei Liu
Yucheng Zhao
Yuqing Wen
Chi Zhang
Xiangyu Zhang
Tiancai Wang
366
94
0
22 Nov 2023
FusionFrames: Efficient Architectural Aspects for Text-to-Video
  Generation Pipeline
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
V.Ya. Arkhipkin
Zein Shaheen
Viacheslav Vasilev
E. Dakhova
Andrey Kuznetsov
Denis Dimitrov
DiffMVGen
304
7
0
22 Nov 2023
MoVideo: Motion-Aware Video Generation with Diffusion Models
MoVideo: Motion-Aware Video Generation with Diffusion Models
Christos Sakaridis
Yuchen Fan
Kai Zhang
Radu Timofte
Luc Van Gool
Rakesh Ranjan
DiffMVGen
207
14
0
19 Nov 2023
Make Pixels Dance: High-Dynamic Video Generation
Make Pixels Dance: High-Dynamic Video Generation
Yan Zeng
Guoqiang Wei
Jiani Zheng
Jiaxin Zou
Yang Wei
Yuchen Zhang
Hang Li
DiffMVGen
241
148
0
18 Nov 2023
LLM4Drive: A Survey of Large Language Models for Autonomous Driving
LLM4Drive: A Survey of Large Language Models for Autonomous Driving
Zhenjie Yang
Xiaosong Jia
Guoying Gu
Junchi Yan
ELM
595
171
0
02 Nov 2023
POS: A Prompts Optimization Suite for Augmenting Text-to-Video
  Generation
POS: A Prompts Optimization Suite for Augmenting Text-to-Video Generation
Shijie Ma
Huayi Xu
Mengjian Li
Weidong Geng
Yaxiong Wang
Meng Wang
DiffMVGen
169
2
0
02 Nov 2023
One Style is All you Need to Generate a Video
One Style is All you Need to Generate a VideoIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Sandeep Manandhar
Auguste Genovesio
VGen
172
0
0
27 Oct 2023
FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling
FreeNoise: Tuning-Free Longer Video Diffusion via Noise ReschedulingInternational Conference on Learning Representations (ICLR), 2023
Haonan Qiu
Menghan Xia
Yong Zhang
Yin-Yin He
Xintao Wang
Ying Shan
Ziwei Liu
DiffMVGen
291
150
0
23 Oct 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
Vision Language Models in Autonomous Driving: A Survey and OutlookIEEE Transactions on Intelligent Vehicles (TIV), 2023
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
304
130
0
22 Oct 2023
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
EvalCrafter: Benchmarking and Evaluating Large Video Generation ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Yaofang Liu
Xiaodong Cun
Xuebo Liu
Xintao Wang
Yong Zhang
Haoxin Chen
Yang Liu
Tieyong Zeng
Raymond H. F. Chan
Ying Shan
VGenEGVM
357
238
0
17 Oct 2023
A Survey on Video Diffusion Models
A Survey on Video Diffusion ModelsACM Computing Surveys (ACM Comput. Surv.), 2023
Zhen Xing
Qijun Feng
Haoran Chen
Jingdong Sun
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVMVGen
457
220
0
16 Oct 2023
DrivingDiffusion: Layout-Guided multi-view driving scene video
  generation with latent diffusion model
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
Xiaofan Li
Yifu Zhang
Xiaoqing Ye
VGen
282
83
0
11 Oct 2023
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with
  Diffusion Models
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023
Yin-Yin He
Shaoshu Yang
Haoxin Chen
Xiaodong Cun
Menghan Xia
Yong Zhang
Xintao Wang
Ran He
Qifeng Chen
Ying Shan
225
110
0
11 Oct 2023
State of the Art on Diffusion Models for Visual Computing
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
276
154
0
11 Oct 2023
Echocardiography video synthesis from end diastolic semantic map via
  diffusion model
Echocardiography video synthesis from end diastolic semantic map via diffusion modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Nguyen Van Phi
Tran Minh Duc
Hieu H. Pham
Tran Quoc Long
DiffMMedImVGen
179
9
0
11 Oct 2023
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
435
523
0
09 Oct 2023
FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video
  Synthesis from Static Imagery
FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery
Tasin Islam
A. Miron
Xiaohui Liu
Yongmin Li
DiffM
278
8
0
29 Sep 2023
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model
  Adaptation
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model AdaptationAAAI Conference on Artificial Intelligence (AAAI), 2023
Guy Yariv
Itai Gat
Sagie Benaim
Lior Wolf
Idan Schwartz
Yossi Adi
DiffMVGen
275
69
0
28 Sep 2023
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video GenerationInternational Journal of Computer Vision (IJCV), 2023
David Junhao Zhang
Jay Zhangjie Wu
Jia-Wei Liu
Rui Zhao
L. Ran
Yuchao Gu
Difei Gao
Mike Zheng Shou
DiffMVGen
612
294
0
27 Sep 2023
Automatic Animation of Hair Blowing in Still Portrait Photos
Automatic Animation of Hair Blowing in Still Portrait PhotosIEEE International Conference on Computer Vision (ICCV), 2023
Wenpeng Xiao
Wentao Liu
Yitong Wang
Guohao Li
Bing Li
3DH
246
15
0
25 Sep 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous
  Driving
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
413
233
0
18 Sep 2023
Generative Image Dynamics
Generative Image DynamicsComputer Vision and Pattern Recognition (CVPR), 2023
Zhengqi Li
Richard Tucker
Noah Snavely
Aleksander Holynski
DiffM
358
93
0
14 Sep 2023
The Power of Sound (TPoS): Audio Reactive Video Generation with Stable
  Diffusion
The Power of Sound (TPoS): Audio Reactive Video Generation with Stable DiffusionIEEE International Conference on Computer Vision (ICCV), 2023
Yujin Jeong
Won-Wha Ryoo
Seunghyun Lee
Dabin Seo
Wonmin Byeon
Sangpil Kim
Jinkyu Kim
DiffM
174
39
0
08 Sep 2023
Hierarchical Masked 3D Diffusion Model for Video Outpainting
Hierarchical Masked 3D Diffusion Model for Video OutpaintingACM Multimedia (ACM MM), 2023
Fanda Fan
Chaoxu Guo
Litong Gong
Biao Wang
Bo Xiao
Yuning Jiang
Chunjie Luo
Jianfeng Zhan
DiffMVGen
259
25
0
05 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High
  Definition Text-to-Video Generation
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Tao Gui
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
332
70
0
01 Sep 2023
StyleInV: A Temporal Style Modulated Inversion Network for Unconditional
  Video Generation
StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Yuhan Wang
Liming Jiang
Chen Change Loy
VGen
234
18
0
31 Aug 2023
Learning Modulated Transformation in GANs
Learning Modulated Transformation in GANsNeural Information Processing Systems (NeurIPS), 2023
Ceyuan Yang
Qihang Zhang
Yinghao Xu
Jiapeng Zhu
Yujun Shen
Bo Dai
194
1
0
29 Aug 2023
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMsComputer Vision and Pattern Recognition (CVPR), 2023
Hao Fei
Shengqiong Wu
Wei Ji
Hanwang Zhang
Tat-Seng Chua
VGenDiffM
220
45
0
26 Aug 2023
Direction-aware Video Demoireing with Temporal-guided Bilateral Learning
Direction-aware Video Demoireing with Temporal-guided Bilateral LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Shuning Xu
Binbin Song
Xiangyu Chen
Jiantao Zhou
VGen
245
14
0
25 Aug 2023
Long-Term Prediction of Natural Video Sequences with Robust Video
  Predictors
Long-Term Prediction of Natural Video Sequences with Robust Video Predictors
Luke Ditria
Tom Drummond
298
1
0
21 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
SimDA: Simple Diffusion Adapter for Efficient Video GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Zhen Xing
Jingdong Sun
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGenDiffM
268
105
0
18 Aug 2023
OnUVS: Online Feature Decoupling Framework for High-Fidelity Ultrasound
  Video Synthesis
OnUVS: Online Feature Decoupling Framework for High-Fidelity Ultrasound Video Synthesis
Hangyu Zhou
Dong Ni
Ao Chang
Xinrui Zhou
Rusi Chen
...
Yuhao Huang
Tong Han
Zhe-Yu Liu
Deng-Ping Fan
Xin Yang
176
2
0
16 Aug 2023
Shortcut-V2V: Compression Framework for Video-to-Video Translation based
  on Temporal Redundancy Reduction
Shortcut-V2V: Compression Framework for Video-to-Video Translation based on Temporal Redundancy ReductionIEEE International Conference on Computer Vision (ICCV), 2023
Chaeyeon Chung
Yeojeong Park
Seunghwan Choi
Munkhsoyol Ganbat
Jaegul Choo
227
3
0
15 Aug 2023
ModelScope Text-to-Video Technical Report
ModelScope Text-to-Video Technical Report
Jiuniu Wang
Hangjie Yuan
Dayou Chen
Yingya Zhang
Xiang Wang
Shiwei Zhang
VGenDiffM
348
604
0
12 Aug 2023
Interactive Neural Painting
Interactive Neural PaintingComputer Vision and Image Understanding (CVIU), 2023
E. Peruzzo
Willi Menapace
Vidit Goel
F. Arrigoni
Hao Tang
...
Nikita Orlov
Yuxiao Hu
Humphrey Shi
Andrii Zadaianchuk
Elisa Ricci
247
5
0
31 Jul 2023
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by
  Using Diffusion Model with ControlNet
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet
Zhihao Hu
Dong Xu
DiffMVGen
328
89
0
26 Jul 2023
HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and
  Retarget Faces
HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget FacesIEEE International Conference on Computer Vision (ICCV), 2023
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
CVBM
237
56
0
20 Jul 2023
Bidirectionally Deformable Motion Modulation For Video-based Human Pose
  Transfer
Bidirectionally Deformable Motion Modulation For Video-based Human Pose TransferIEEE International Conference on Computer Vision (ICCV), 2023
Weikang Yu
L. Po
Ray C. C. Cheung
Yuzhi Zhao
Yu-Zhi Xue
Kun-Jhih Li
3DH
283
28
0
15 Jul 2023
Text-Guided Synthesis of Eulerian Cinemagraphs
Text-Guided Synthesis of Eulerian CinemagraphsACM Transactions on Graphics (TOG), 2023
Aniruddha Mahapatra
Aliaksandr Siarohin
Hsin-Ying Lee
Sergey Tulyakov
Sitong Su
DiffMVGen
206
24
0
06 Jul 2023
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen
  LLMs
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMsNeural Information Processing Systems (NeurIPS), 2023
Lijun Yu
Yong Cheng
Zhiruo Wang
Vivek Kumar
Wolfgang Macherey
...
Yonatan Bisk
Ming-Hsuan Yang
Kevin Patrick Murphy
Alexander G. Hauptmann
Lu Jiang
MLLM
362
69
0
30 Jun 2023
DisCo: Disentangled Control for Realistic Human Dance Generation
DisCo: Disentangled Control for Realistic Human Dance GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Tan Wang
Linjie Li
Kevin Qinghong Lin
Yuanhao Zhai
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
VGen
453
132
0
30 Jun 2023
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human
  Motion Dataset for Autonomous Robots
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous RobotsIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Meenakshi Sarkar
V. Honkote
D. Das
D. Ghose
187
3
0
28 Jun 2023
GD-VDM: Generated Depth for better Diffusion-based Video Generation
GD-VDM: Generated Depth for better Diffusion-based Video Generation
Ariel Lapid
Idan Achituve
Lior Bracha
Ethan Fetaya
DiffMVGen
213
11
0
19 Jun 2023
MovieFactory: Automatic Movie Creation from Text using Large Generative
  Models for Language and Images
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and ImagesACM Multimedia (ACM MM), 2023
Sitong Su
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
VGenDiffM
172
56
0
12 Jun 2023
Previous
123...101112131415
Next
Page 11 of 15
Pageof 15