v1v2 (latest)

Video Diffusion Models

Neural Information Processing Systems (NeurIPS), 2022

7 April 2022

David J. Fleet

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)

Papers citing "Video Diffusion Models"

50 / 1,552 papers shown

DynamiCrafter: Animating Open-domain Images with Video Diffusion PriorsEuropean Conference on Computer Vision (ECCV), 2023

Yong Zhang

Tien-Tsin Wong

Ying Shan

VGen

323

429

18 Oct 2023

EvalCrafter: Benchmarking and Evaluating Large Video Generation ModelsComputer Vision and Pattern Recognition (CVPR), 2023

Xiaodong Cun

Yong Zhang

Ying Shan

378

254

17 Oct 2023

Towards Generic Semi-Supervised Framework for Volumetric Medical Image SegmentationNeural Information Processing Systems (NeurIPS), 2023

Haonan Wang

Xiaomeng Li

291

17 Oct 2023

Elucidating The Design Space of Classifier-Guided Diffusion GenerationInternational Conference on Learning Representations (ICLR), 2023

Jiajun Ma

Tianyang Hu

Wei Cao

Jiacheng Sun

271

17 Oct 2023

A Survey on Video Diffusion ModelsACM Computing Surveys (ACM Comput. Surv.), 2023

Zuxuan Wu

473

242

16 Oct 2023

Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023

426

257

16 Oct 2023

LOVECon: Text-driven Training-Free Long Video Editing with ControlNet

Zhenyi Liao

Zhijie Deng

DiffM

249

15 Oct 2023

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

219

14 Oct 2023

DDMT: Denoising Diffusion Mask Transformer Models for Multivariate Time Series Anomaly Detection

327

13 Oct 2023

Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic TaskNeural Information Processing Systems (NeurIPS), 2023

564

13 Oct 2023

A Sampling-Based Domain Generalization Study with Diffusion Generative Models

375

13 Oct 2023

Learning to Act from Actionless Videos through Dense CorrespondencesInternational Conference on Learning Representations (ICLR), 2023

Jiayuan Mao

381

165

12 Oct 2023

Consistent123: Improve Consistency for One Image to 3D Object Synthesis

Haohan Weng

Tianyu Yang

Jianan Wang

Yu Li

Tong Zhang

Chong Chen

Lei Zhang

DiffM

234

12 Oct 2023

Efficient Integrators for Diffusion Generative ModelsInternational Conference on Learning Representations (ICLR), 2023

198

11 Oct 2023

ConditionVideo: Training-Free Condition-Guided Text-to-Video GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023

Yu Qiao

240

11 Oct 2023

State of the Art on Diffusion Models for Visual Computing

Kfir Aberman

...

Matthias Nießner

298

165

11 Oct 2023

Echocardiography video synthesis from end diastolic semantic map via diffusion modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

185

11 Oct 2023

Latent Diffusion Model for DNA Sequence Generation

Zehui Li

Yuhao Ni

Tim August B. Huygelen

185

09 Oct 2023

Learning Interactive Real-World SimulatorsInternational Conference on Learning Representations (ICLR), 2023

Pieter Abbeel

361

354

09 Oct 2023

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editingInternational Conference on Learning Representations (ICLR), 2023

Juan-Manuel Perez-Rua

349

150

09 Oct 2023

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

...

Ming-Hsuan Yang

456

556

09 Oct 2023

IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image PromptsInternational Conference on Learning Representations (ICLR), 2023

...

Conghui He

311

09 Oct 2023

VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion ModelAutomatic Speech Recognition & Understanding (ASRU), 2023

190

07 Oct 2023

Aligning Text-to-Image Diffusion Models with Reward Backpropagation

530

225

05 Oct 2023

Stochastic interpolants with data-dependent couplingsInternational Conference on Machine Learning (ICML), 2023

314

05 Oct 2023

MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT ImagesIEEE Transactions on Medical Imaging (TMI), 2023

Yanwu Xu

Kayhan Batmanghelich

375

05 Oct 2023

Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent DiffusionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

350

126

05 Oct 2023

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023

Bohan Zhuang

574

05 Oct 2023

Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic ModelsIEEE International Conference on Computer Vision (ICCV), 2023

209

03 Oct 2023

Score-based Data Assimilation for a Two-Layer Quasi-Geostrophic Model

Sacha Lewin

Gilles Louppe

279

03 Oct 2023

Sequential Data Generation with Groupwise Diffusion Process

345

02 Oct 2023

FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery

293

29 Sep 2023

GAIA-1: A Generative World Model for Autonomous Driving

Masane Fuchi

Lloyd Russell

Hudson Yeo

Alex Kendall

Gianluca Corrado

407

452

29 Sep 2023

AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive ComputationEuropean Conference on Computer Vision (ECCV), 2023

Yaqing Wang

Dongkuan Xu

274

29 Sep 2023

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video GenerationInternational Journal of Computer Vision (IJCV), 2023

646

304

27 Sep 2023

Warfare:Breaking the Watermark Protection of AI-Generated Content

Shangwei Guo

Jiwei Li

Tianwei Zhang

WIGM

305

27 Sep 2023

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion ModelsInternational Journal of Computer Vision (IJCV), 2023

...

Yu Qiao

Ziwei Liu

VGen DiffM

293

332

26 Sep 2023

A Simple Text to Video Model via Transformer

Gang Chen

ViT

26 Sep 2023

GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodERNeural Information Processing Systems (NeurIPS), 2023

183

23 Sep 2023

A Diffusion-Model of Joint Interactive NavigationNeural Information Processing Systems (NeurIPS), 2023

...

265

21 Sep 2023

DreamLLM: Synergistic Multimodal Comprehension and CreationInternational Conference on Learning Representations (ICLR), 2023

Runpei Dong

Chunrui Han

Yuang Peng

...

Xiangyu Zhang

344

290

20 Sep 2023

A Generative Framework for Self-Supervised Facial Representation Learning

Ruian He

Zhen Xing

Weimin Tan

Bo Yan

DiffM

358

15 Sep 2023

VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple GuidanceIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Carlos Hernandez-Olivan

Koichi Saito

Naoki Murata

Chieh-Hsin Lai

Marco A. Martínez-Ramírez

Wei-Hsiang Liao

Yuki Mitsufuji

DiffM

193

13 Sep 2023

Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion ModelsInternational Conference on Machine Learning (ICML), 2023

264

12 Sep 2023

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image GenerationInternational Conference on Learning Representations (ICLR), 2023

Qiang Liu

620

324

12 Sep 2023

SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023

549

10 Sep 2023

Variations and Relaxations of Normalizing Flows

276

08 Sep 2023

The Power of Sound (TPoS): Audio Reactive Video Generation with Stable DiffusionIEEE International Conference on Computer Vision (ICCV), 2023

182

08 Sep 2023

SMPLitex: A Generative Model and Dataset for 3D Human Texture Estimation from Single ImageBritish Machine Vision Conference (BMVC), 2023

Dan Casas

M. C. Trinidad

3DH 3DGS

315

04 Sep 2023

Benchmarking Autoregressive Conditional Diffusion Models for Turbulent Flow Simulation

370

04 Sep 2023