v1v2 (latest)

Towards Accurate Generative Models of Video: A New Metric & Challenges

3 December 2018

Thomas Unterthiner

Sjoerd van Steenkiste

Papers citing "Towards Accurate Generative Models of Video: A New Metric & Challenges"

50 / 715 papers shown

SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning

Wei Liu

177

29 Nov 2023

Panacea: Panoramic and Controllable Video Generation for Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2023

Xiangyu Zhang

272

125

28 Nov 2023

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character AnimationComputer Vision and Pattern Recognition (CVPR), 2023

473

655

28 Nov 2023

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion ModelComputer Vision and Pattern Recognition (CVPR), 2023

400

315

27 Nov 2023

FLAIR: A Conditional Diffusion Framework with Applications to Face Video RestorationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Ulugbek S. Kamilov

253

26 Nov 2023

Decouple Content and Motion for Conditional Image-to-Video GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023

220

24 Nov 2023

ADriver-I: A General World Model for Autonomous Driving

Xiangyu Zhang

366

22 Nov 2023

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline

304

22 Nov 2023

MoVideo: Motion-Aware Video Generation with Diffusion Models

Christos Sakaridis

Yuchen Fan

Kai Zhang

Radu Timofte

Luc Van Gool

Rakesh Ranjan

DiffM VGen

207

19 Nov 2023

Make Pixels Dance: High-Dynamic Video Generation

241

148

18 Nov 2023

LLM4Drive: A Survey of Large Language Models for Autonomous Driving

595

171

02 Nov 2023

POS: A Prompts Optimization Suite for Augmenting Text-to-Video Generation

169

02 Nov 2023

One Style is All you Need to Generate a VideoIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Sandeep Manandhar

Auguste Genovesio

VGen

172

27 Oct 2023

FreeNoise: Tuning-Free Longer Video Diffusion via Noise ReschedulingInternational Conference on Learning Representations (ICLR), 2023

Yong Zhang

Ying Shan

Ziwei Liu

DiffM VGen

291

150

23 Oct 2023

Vision Language Models in Autonomous Driving: A Survey and OutlookIEEE Transactions on Intelligent Vehicles (TIV), 2023

304

130

22 Oct 2023

EvalCrafter: Benchmarking and Evaluating Large Video Generation ModelsComputer Vision and Pattern Recognition (CVPR), 2023

Xiaodong Cun

Yong Zhang

Ying Shan

357

238

17 Oct 2023

A Survey on Video Diffusion ModelsACM Computing Surveys (ACM Comput. Surv.), 2023

Zuxuan Wu

457

220

16 Oct 2023

DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model

Xiaofan Li

Yifu Zhang

Xiaoqing Ye

VGen

282

11 Oct 2023

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023

Xiaodong Cun

Yong Zhang

Ying Shan

225

110

11 Oct 2023

State of the Art on Diffusion Models for Visual Computing

Kfir Aberman

...

Matthias Nießner

276

154

11 Oct 2023

Echocardiography video synthesis from end diastolic semantic map via diffusion modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

179

11 Oct 2023

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

...

Ming-Hsuan Yang

435

523

09 Oct 2023

FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery

278

29 Sep 2023

Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model AdaptationAAAI Conference on Artificial Intelligence (AAAI), 2023

Guy Yariv

Itai Gat

Sagie Benaim

Lior Wolf

Idan Schwartz

Yossi Adi

DiffM VGen

275

28 Sep 2023

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video GenerationInternational Journal of Computer Vision (IJCV), 2023

612

294

27 Sep 2023

Automatic Animation of Hair Blowing in Still Portrait PhotosIEEE International Conference on Computer Vision (ICCV), 2023

Bing Li

246

25 Sep 2023

DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

413

233

18 Sep 2023

Generative Image DynamicsComputer Vision and Pattern Recognition (CVPR), 2023

Aleksander Holynski

358

14 Sep 2023

The Power of Sound (TPoS): Audio Reactive Video Generation with Stable DiffusionIEEE International Conference on Computer Vision (ICCV), 2023

174

08 Sep 2023

Hierarchical Masked 3D Diffusion Model for Video OutpaintingACM Multimedia (ACM MM), 2023

259

05 Sep 2023

VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation

Errui Ding

Jingdong Wang

VGen

332

01 Sep 2023

StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video GenerationIEEE International Conference on Computer Vision (ICCV), 2023

234

31 Aug 2023

Learning Modulated Transformation in GANsNeural Information Processing Systems (NeurIPS), 2023

194

29 Aug 2023

Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMsComputer Vision and Pattern Recognition (CVPR), 2023

Hao Fei

Wei Ji

220

26 Aug 2023

Direction-aware Video Demoireing with Temporal-guided Bilateral LearningAAAI Conference on Artificial Intelligence (AAAI), 2023

245

25 Aug 2023

Long-Term Prediction of Natural Video Sequences with Robust Video Predictors

Luke Ditria

Tom Drummond

298

21 Aug 2023

SimDA: Simple Diffusion Adapter for Efficient Video GenerationComputer Vision and Pattern Recognition (CVPR), 2023

Zuxuan Wu

268

105

18 Aug 2023

OnUVS: Online Feature Decoupling Framework for High-Fidelity Ultrasound Video Synthesis

...

Xin Yang

176

16 Aug 2023

Shortcut-V2V: Compression Framework for Video-to-Video Translation based on Temporal Redundancy ReductionIEEE International Conference on Computer Vision (ICCV), 2023

Chaeyeon Chung

227

15 Aug 2023

ModelScope Text-to-Video Technical Report

348

604

12 Aug 2023

Interactive Neural PaintingComputer Vision and Image Understanding (CVIU), 2023

...

247

31 Jul 2023

VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet

Zhihao Hu

Dong Xu

DiffM VGen

328

26 Jul 2023

HyperReenact: One-Shot Reenactment via Jointly Learning to Refine and Retarget FacesIEEE International Conference on Computer Vision (ICCV), 2023

Georgios Tzimiropoulos

CVBM

237

20 Jul 2023

Bidirectionally Deformable Motion Modulation For Video-based Human Pose TransferIEEE International Conference on Computer Vision (ICCV), 2023

283

15 Jul 2023

Text-Guided Synthesis of Eulerian CinemagraphsACM Transactions on Graphics (TOG), 2023

Hsin-Ying Lee

206

06 Jul 2023

SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMsNeural Information Processing Systems (NeurIPS), 2023

...

Alexander G. Hauptmann

Lu Jiang

MLLM

362

30 Jun 2023

DisCo: Disentangled Control for Realistic Human Dance GenerationComputer Vision and Pattern Recognition (CVPR), 2023

Hanwang Zhang

Zicheng Liu

Lijuan Wang

VGen

453

132

30 Jun 2023

Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous RobotsIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023

187

28 Jun 2023

GD-VDM: Generated Depth for better Diffusion-based Video Generation

213

19 Jun 2023

MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and ImagesACM Multimedia (ACM MM), 2023

Lianli Gao

Jingkuan Song

Jianlong Fu

VGen DiffM

172

12 Jun 2023