v1v2 (latest)

Video Diffusion Models

Neural Information Processing Systems (NeurIPS), 2022

7 April 2022

David J. Fleet

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)

Papers citing "Video Diffusion Models"

50 / 1,538 papers shown

PriorGuide: Test-Time Prior Adaptation for Simulation-Based Inference

140

15 Oct 2025

Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance

104

14 Oct 2025

VIDMP3: Video Editing by Representing Motion with Pose and Position Priors

122

14 Oct 2025

BIGFix: Bidirectional Image Generation with Token Fixing

136

14 Oct 2025

LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference

163

13 Oct 2025

Understanding Sampler Stochasticity in Training Diffusion Models for RLHF

139

12 Oct 2025

Multi-Scale Diffusion Transformer for Jointly Simulating User Mobility and Mobile Traffic Pattern

11 Oct 2025

Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

146

10 Oct 2025

TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

166

09 Oct 2025

An approach for systematic decomposition of complex llm tasks

147

09 Oct 2025

A Diffusion Model for Regular Time Series Generation from Irregular Data with Completion and Masking

157

08 Oct 2025

Vision-Language-Action Models for Robotics: A Review Towards Real-World ApplicationsIEEE Access (IEEE Access), 2025

259

08 Oct 2025

Drive&Gen: Co-Evaluating End-to-End Driving and Video Generation Models

...

101

07 Oct 2025

Riddled basin geometry sets fundamental limits to predictability and reproducibility in deep learning

Andrew Ly

Pulin Gong

AI4CE

180

07 Oct 2025

Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model

Danush Kumar Venkatesh

Adam Schmidt

Muhammad Abdullah Jamal

Omid Mohareri

VGen MedIm

142

07 Oct 2025

Demystifying MaskGIT Sampler and Beyond: Adaptive Order Selection in Masked Diffusion

335

06 Oct 2025

Learning Robust Diffusion Models from Imprecise Supervision

336

03 Oct 2025

What Drives Compositional Generalization in Visual Generative Models?

313

03 Oct 2025

Learning to Generate Rigid Body Interactions with Video Diffusion Models

444

02 Oct 2025

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

225

02 Oct 2025

LVTINO: LAtent Video consisTency INverse sOlver for High Definition Video Restoration

171

01 Oct 2025

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

...

140

01 Oct 2025

Code2Video: A Code-centric Paradigm for Educational Video Generation

138

01 Oct 2025

Diffusion Alignment as Variational Expectation-Maximization

107

01 Oct 2025

PRPO: Paragraph-level Policy Optimization for Vision-Language Deepfake Detection

170

30 Sep 2025

Contrastive Diffusion Guidance for Spatial Inverse Problems

30 Sep 2025

3DiFACE: Synthesizing and Editing Holistic 3D Facial AnimationInternational Conference on 3D Vision (3DV), 2025

Balamurugan Thambiraja

152

30 Sep 2025

AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

106

30 Sep 2025

VRWKV-Editor: Reducing quadratic complexity in transformer-based video editing

Abdelilah Aitrouga

Youssef Hmamouche

Amal El Fallah Seghrouchni

VGen

214

30 Sep 2025

Diffusion Bridge or Flow Matching? A Unifying Framework and Comparative Analysis

201

29 Sep 2025

UI2V-Bench: An Understanding-based Image-to-video Generation Benchmark

29 Sep 2025

Enhancing Physical Plausibility in Video Generation by Reasoning the Implausibility

140

29 Sep 2025

UniVid: The Open-Source Unified Video Model

276

29 Sep 2025

Diff-3DCap: Shape Captioning with Diffusion ModelsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025

123

28 Sep 2025

Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution

188

28 Sep 2025

Autoregressive Video Generation beyond Next Frames Prediction

166

28 Sep 2025

CREPE: Controlling Diffusion with Replica Exchange

José Miguel Hernández-Lobato

27 Sep 2025

ARSS: Taming Decoder-only Autoregressive Visual Generation for View Synthesis From Single View

154

27 Sep 2025

^2

Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching

27 Sep 2025

JointDiff: Bridging Continuous and Discrete in Multi-Agent Trajectory Generation

124

26 Sep 2025

High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling

131

26 Sep 2025

Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers

1.4K

26 Sep 2025

LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE

148

26 Sep 2025

X-Streamer: Unified Human World Modeling with Audiovisual Interaction

181

25 Sep 2025

DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models

184

25 Sep 2025

NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics

Yu Yuan

Xijun Wang

Tharindu Wickremasinghe

1.5K

25 Sep 2025

PIRF: Physics-Informed Reward Fine-Tuning for Diffusion Models

139

24 Sep 2025

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation

619

24 Sep 2025

Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters

120

23 Sep 2025

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

...

319

23 Sep 2025