Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2204.03458
Cited By
v1
v2 (latest)
Video Diffusion Models
Neural Information Processing Systems (NeurIPS), 2022
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Video Diffusion Models"
50 / 1,539 papers shown
Preacher: Paper-to-Video Agentic System
Jingwei Liu
Ling Yang
Hao Luo
Fan Wang
Hongyan Li
M. Y. Wang
DiffM
VGen
455
2
0
13 Aug 2025
OneVAE: Joint Discrete and Continuous Optimization Helps Discrete Video VAE Train Better
Yupeng Zhou
Zhen Li
Ziheng Ouyang
Yuming Chen
Ruoyi Du
...
Bin Fu
Yihao Liu
Peng Gao
Ming-Ming Cheng
Qibin Hou
204
1
0
13 Aug 2025
Towards Safe Imitation Learning via Potential Field-Guided Flow Matching
Haoran Ding
Anqing Duan
Zezhou Sun
Leonel Rozo
Noémie Jaquier
Dezhen Song
Yoshihiko Nakamura
140
0
0
12 Aug 2025
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models
Wen Wang
Bozhen Fang
Chenchen Jing
Yongliang Shen
Yangyi Shen
Qiuyu Wang
Hao Ouyang
Hao Chen
Chunhua Shen
DiffM
AI4CE
208
15
0
12 Aug 2025
Preview WB-DH: Towards Whole Body Digital Human Bench for the Generation of Whole-body Talking Avatar Videos
Chaoyi Wang
Yifan Yang
Jun Pei
Lijie Xia
Jianpo Liu
Xiaobing Yuan
Xinhan Di
VGen
92
0
0
12 Aug 2025
DiffPose-Animal: A Language-Conditioned Diffusion Framework for Animal Pose Estimation
Tianyu Xiong
Dayi Tan
Wei Tian
149
0
0
12 Aug 2025
CD-TVD: Contrastive Diffusion for 3D Super-Resolution with Scarce High-Resolution Time-Varying Data
Chongke Bi
Xin Gao
Jiangkang Deng
Guan
Jun Han
DiffM
167
1
0
11 Aug 2025
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Wenhui Song
Hanhui Li
Jiehui Huang
Panwen Hu
Yuhao Cheng
Long Chen
Yiqiang Yan
Xiaodan Liang
DiffM
VGen
145
2
0
11 Aug 2025
Learning an Implicit Physics Model for Image-based Fluid Simulation
Emily Yue-Ting Jia
Jiageng Mao
Zhiyuan Gao
Yajie Zhao
Yue Wang
3DH
VGen
AI4CE
79
0
0
11 Aug 2025
S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix
Peng Dai
Feitong Tan
Qiangeng Xu
Yihua Huang
David Futschik
Ruofei Du
S. Fanello
Yinda Zhang
Xiaojuan Qi
VGen
137
0
0
11 Aug 2025
CObL: Toward Zero-Shot Ordinal Layering without User Prompting
Aneel Damaraju
D. Hazineh
Todd E. Zickler
BDL
124
0
0
11 Aug 2025
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers
Xin Ma
Yaohui Wang
Genyun Jia
Xinyuan Chen
Tien-Tsin Wong
C. L. P. Chen
VGen
160
0
0
10 Aug 2025
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Yue Liao
Pengfei Zhou
Siyuan Huang
Donglin Yang
Shengcong Chen
...
Jianlan Luo
Liliang Chen
Shuicheng Yan
Maoqing Yao
Maoqing Yao
VGen
LM&Ro
282
24
0
07 Aug 2025
Intention Enhanced Diffusion Model for Multimodal Pedestrian Trajectory Prediction
Yu Liu
Zhijie Liu
Xiao Ren
You-Fu Li
He Kong
72
1
0
06 Aug 2025
LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation
Kangrui Cen
Baixuan Zhao
Yi Xin
Siqi Luo
Guoquan Zheng
Xiaohong Liu
DiffM
VGen
144
0
0
06 Aug 2025
Macro-from-Micro Planning for High-Quality and Parallelized Autoregressive Long Video Generation
Xunzhi Xiang
Y. Chen
Guiyu Zhang
Zhongyu Wang
Zhe Gao
...
Haibin Huang
Yang Gao
C. Zhang
Qi Fan
Xuelong Li
DiffM
VGen
202
5
0
05 Aug 2025
Fine-Tuning Text-to-Speech Diffusion Models Using Reinforcement Learning with Human Feedback
Jingyi Chen
Ju-Seung Byun
Micha Elsner
Pichao Wang
Andrew Perrault
105
1
0
05 Aug 2025
Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis
Kaiyang Ji
Ye-ling Shi
Zichen Jin
Kangyi Chen
Yongjian Luo
Y. Ma
Jingyi Yu
Jingya Wang
155
5
0
04 Aug 2025
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference
Yuxuan Song
Zheng Zhang
Cheng-hsin Luo
Pengyang Gao
Fan Xia
...
Jingjing Liu
Wei-Ying Ma
Y. Zhang
Yonghui Wu
Hao Zhou
VLM
164
66
0
04 Aug 2025
QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots
Sheng Wu
Fei Teng
Hao Shi
Qi Jiang
Kai Luo
Kaiwei Wang
Kailun Yang
VGen
254
1
0
04 Aug 2025
DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing
Yufeng Chi
Huimin Ma
Kafeng Wang
Jianmin Li
3DGS
139
2
0
03 Aug 2025
DBLP: Noise Bridge Consistency Distillation For Efficient And Reliable Adversarial Purification
Chihan Huang
Belal Alsinglawi
Islam Al-qudah
DiffM
AAML
167
0
0
01 Aug 2025
Unraveling Hidden Representations: A Multi-Modal Layer Analysis for Better Synthetic Content Forensics
Tom Or
Omri Azencot
AAML
188
1
0
01 Aug 2025
GuidPaint: Class-Guided Image Inpainting with Diffusion Models
Qimin Wang
Xinda Liu
Guohua Geng
DiffM
234
0
0
29 Jul 2025
Reconstructing 4D Spatial Intelligence: A Survey
Yukang Cao
Jiahao Lu
Z. Huang
Zhuowei Shen
Chengfeng Zhao
...
Z. Chen
Xin Li
Wenping Wang
Yuan Liu
Ziwei Liu
VGen
351
8
0
28 Jul 2025
JWB-DH-V1: Benchmark for Joint Whole-Body Talking Avatar and Speech Generation Version 1
Xinhan Di
Kristin Qi
Pengqian Yu
DiffM
VGen
221
0
0
28 Jul 2025
Compositional Video Synthesis by Temporal Object-Centric Learning
Adil Kaan Akan
Yucel Yemez
DiffM
OCL
234
0
0
28 Jul 2025
MagicAnime: A Hierarchically Annotated, Multimodal and Multitasking Dataset with Benchmarks for Cartoon Animation Generation
Shuolin Xu
Bingyuan Wang
Zeyu Cai
Fangteng Fu
Yue Ma
Tongyi Lee
Hongchuan Yu
Zeyu Wang
VGen
171
1
0
27 Jul 2025
SonicGauss: Position-Aware Physical Sound Synthesis for 3D Gaussian Representations
Chunshi Wang
Hongxing Li
Yawei Luo
3DGS
115
1
0
26 Jul 2025
ChoreoMuse: Robust Music-to-Dance Video Generation with Style Transfer and Beat-Adherent Motion
Xuanchen Wang
Heng Wang
Weidong (Tom) Cai
226
3
0
26 Jul 2025
HumanSAM: Classifying Human-centric Forgery Videos in Human Spatial, Appearance, and Motion Anomaly
Chang Liu
Yunfan Ye
Fan Zhang
Q. Zhou
Yuchuan Luo
Zhiping Cai
253
2
0
26 Jul 2025
A Comprehensive Review of Diffusion Models in Smart Agriculture: Progress, Applications, and Challenges
Xing Hua
Haodong Chen
Qianqian Duan
Danfeng Hong
Ruijiao Li
Huiliang Shang
MedIm
432
2
0
24 Jul 2025
Unmasking Synthetic Realities in Generative AI: A Comprehensive Review of Adversarially Robust Deepfake Detection Systems
Naseem Khan
Tuan Nguyen
Amine Bermak
Issa Khalil
AAML
218
3
0
24 Jul 2025
Captain Cinema: Towards Short Movie Generation
Junfei Xiao
Ceyuan Yang
Lvmin Zhang
S. Cai
Yang Zhao
Yuwei Guo
Gordon Wetzstein
Maneesh Agrawala
Alan Yuille
Lu Jiang
DiffM
VGen
178
20
0
24 Jul 2025
Improving Multislice Electron Ptychography with a Generative Prior
Christian K. Belardi
Chia-Hao Lee
Yingheng Wang
Justin Lovelace
Kilian Q. Weinberger
David A. Muller
Daniel Schwalbe-Koda
DiffM
MedIm
294
3
0
23 Jul 2025
An h-space Based Adversarial Attack for Protection Against Few-shot Personalization
Xide Xu
Sandesh Kamath
Muhammad Atif Butt
Bogdan Raducanu
DiffM
AAML
153
0
0
23 Jul 2025
Sparse-View 3D Reconstruction: Recent Advances and Open Challenges
Tanveer Younis
Zhanglin Cheng
3DGS
201
1
0
22 Jul 2025
PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation
Yaofang Liu
Y. Ren
Aitor Artola
Yuxuan Hu
Xiaodong Cun
...
Raymond H. F. Chan
Suiyun Zhang
Rui Liu
Dandan Tu
Jean-Michel Morel
DiffM
VGen
189
1
0
22 Jul 2025
CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers
Jiaqi Han
Haotian Ye
Puheng Li
Minkai Xu
James Zou
Stefano Ermon
DiffM
216
0
0
21 Jul 2025
Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models
B. Zhu
Ruoyu Wang
Tong Zhao
Hanwang Zhang
Chi Zhang
DiffM
143
4
0
20 Jul 2025
Light Future: Multimodal Action Frame Prediction via InstructPix2Pix
Zesen Zhong
Duomin Zhang
Yijia Li
VGen
268
0
0
20 Jul 2025
Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey
Jiahui Zhang
Yuelei Li
Anpei Chen
Muyu Xu
Kunhao Liu
...
Hanspeter Pfister
Paul Liang
Shijian Lu
Fangneng Zhan
Fangneng Zhan
638
8
0
19 Jul 2025
VITA: Vision-to-Action Flow Matching Policy
D. Gao
Boqi Zhao
Andrew Lee
Ian Chuang
Hanchu Zhou
Hang Wang
Zhe Zhao
Junshan Zhang
Iman Soltani
VGen
214
3
0
17 Jul 2025
RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models
Yiqi Tian
Pengfei Jin
Mingze Yuan
Na Li
Bo Zeng
Shijie Zhao
DiffM
154
0
0
16 Jul 2025
Contrastive Conditional-Unconditional Alignment for Long-tailed Diffusion Model
Fang Chen
Alex Villa
Gongbo Liang
Xiaoyi Lu
Meng Tang
164
1
0
11 Jul 2025
Beyond Scores: Proximal Diffusion Models
Zhenghan Fang
Mateo Díaz
Sam Buchanan
Jeremias Sulam
DiffM
147
2
0
11 Jul 2025
Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling
Meihua Dang
Jiaqi Han
Minkai Xu
Kai Xu
Akash Srivastava
Stefano Ermon
DiffM
116
7
0
11 Jul 2025
Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers
Wongi Jeong
Kyungryeol Lee
H. Seo
Se Young Chun
203
5
0
11 Jul 2025
Identity-Preserving Text-to-Video Generation Guided by Simple yet Effective Spatial-Temporal Decoupled Representations
Yuji Wang
Moran Li
Xiaobin Hu
Ran Yi
J. Zhang
Han Feng
Weijian Cao
Yabiao Wang
Chengjie Wang
Lizhuang Ma
VGen
DiffM
294
2
0
07 Jul 2025
Discrete Diffusion Trajectory Alignment via Stepwise Decomposition
Jiaqi Han
Austin Wang
Minkai Xu
Wenda Chu
Meihua Dang
Yisong Yue
Stefano Ermon
181
4
0
07 Jul 2025
Previous
1
2
3
4
5
6
...
29
30
31
Next