Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2204.03458
Cited By
v1
v2 (latest)
Video Diffusion Models
Neural Information Processing Systems (NeurIPS), 2022
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Video Diffusion Models"
50 / 1,538 papers shown
Title
FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis
Shijie Chen
Peixi Peng
VGen
3DV
424
0
0
04 Dec 2025
Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
Hyeongyu Kang
Jaewoo Lee
Woocheol Shin
Kiyoung Om
Jinkyoo Park
68
0
0
04 Dec 2025
BulletTime: Decoupled Control of Time and Camera Pose for Video Generation
Yiming Wang
Qihang Zhang
S. Cai
Tong Wu
Jan Ackermann
Zhengfei Kuang
Yang Zheng
Frano Rajič
Siyu Tang
Gordon Wetzstein
DiffM
VGen
140
0
0
04 Dec 2025
VideoSSM: Autoregressive Long Video Generation with Hybrid State-Space Memory
Yifei Yu
Xiaoshan Wu
Xinting Hu
Tao Hu
Yangtian Sun
...
Bo Wang
Lin Ma
Yuewen Ma
Zhongrui Wang
Xiaojuan Qi
DiffM
VGen
138
1
0
04 Dec 2025
Denoise to Track: Harnessing Video Diffusion Priors for Robust Correspondence
Tianyu Yuan
Yuanbo Yang
Lin Chen
Yao Yao
Zhuzhong Qian
DiffM
VGen
203
0
0
04 Dec 2025
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image
Yanran Zhang
Ziyi Wang
Wenzhao Zheng
Zheng Zhu
Jie Zhou
Jiwen Lu
VGen
3DV
193
0
0
04 Dec 2025
Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer
Tasmiah Haque
Srinjoy Das
AI4TS
52
0
0
03 Dec 2025
AdaPower: Specializing World Foundation Models for Predictive Manipulation
Yuhang Huang
SHilong Zou
J. Zhang
Xinwang Liu
Ruizhen Hu
Kai Xu
48
0
0
03 Dec 2025
FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation
Yiyi Cai
Y. Wu
Kunhang Li
You Zhou
Bo Zheng
Haiyang Liu
VGen
93
0
0
03 Dec 2025
Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling
Yueru Jia
Jiaming Liu
Shengbang Liu
Rui Zhou
W. Yu
Yuyang Yan
Xiaowei Chi
Yandong Guo
Boxin Shi
Shanghang Zhang
VGen
296
1
0
02 Dec 2025
Taming Camera-Controlled Video Generation with Verifiable Geometry Reward
Zhaoqing Wang
Xiaobo Xia
Zhuolin Bie
Jinlin Liu
Dongdong Yu
Jia-Wang Bian
Changhu Wang
EGVM
VGen
145
0
0
02 Dec 2025
RULER-Bench: Probing Rule-based Reasoning Abilities of Next-level Video Generation Models for Vision Foundation Intelligence
Xuming He
Zehao Fan
Hengjia Li
Fan Zhuo
Hankun Xu
Senlin Cheng
Di Weng
Haifeng Liu
Can Ye
Boxi Wu
VGen
ELM
184
0
0
02 Dec 2025
A Diffusion Model Framework for Maximum Entropy Reinforcement Learning
Sebastian Sanokowski
Kaustubh Patil
Alois Knoll
DiffM
80
0
0
01 Dec 2025
Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don't Know Galileo's Principle...for now
Varun Varma Thozhiyoor
Shivam Tripathi
Venkatesh Babu Radhakrishnan
Anand Bhattad
VGen
64
0
0
01 Dec 2025
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Jeremy Irvin
Jiaqi Han
Z. Wang
Abdulaziz Alharbi
Yufei Zhao
Nomin-Erdene Bayarsaikhan
Daniele Visioni
A. Ng
Duncan Watson-Parris
AI4TS
84
0
0
01 Dec 2025
Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos
Xavier Thomas
Youngsun Lim
Ananya Srinivasan
Audrey Zheng
Deepti Ghadiyaram
EGVM
VGen
316
0
0
01 Dec 2025
SpriteHand: Real-Time Versatile Hand-Object Interaction with Autoregressive Video Generation
Zisu Li
Hengye Lyu
Jiaxin Shi
Yufeng Zeng
Mingming Fan
Hanwang Zhang
Chen Liang
VGen
152
0
0
01 Dec 2025
TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model
Alireza Javanmardi
Pragati Jaiswal
T. Habtegebrial
Christen Millerdurai
Shaoxiang Wang
A. Pagani
Didier Stricker
DiffM
VGen
106
0
0
30 Nov 2025
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Minh-Quan Le
Yuanzhi Zhu
Vicky Kalogeiton
Dimitris Samaras
EGVM
VGen
87
0
0
29 Nov 2025
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation
Hongfei Zhang
Kanghao Chen
Zixin Zhang
Harold Haodong Chen
Yuanhuiyi Lyu
Yuqi Zhang
Shuai Yang
Kun Zhou
Yingcong Chen
DiffM
VGen
150
1
0
28 Nov 2025
Flow Straighter and Faster: Efficient One-Step Generative Modeling via MeanFlow on Rectified Trajectories
Xinxi Zhang
Shiwei Tan
Quang Nguyen
Quan Dao
Ligong Han
Xiaoxiao He
Tunyu Zhang
Alen Mrdovic
Dimitris N. Metaxas
244
0
0
28 Nov 2025
Toward Diffusible High-Dimensional Latent Spaces: A Frequency Perspective
Bolin Lai
Xudong Wang
Saketh Rambhatla
James M. Rehg
Zsolt Kira
Rohit Girdhar
Ishan Misra
DiffM
104
0
0
27 Nov 2025
ITS3D: Inference-Time Scaling for Text-Guided 3D Diffusion Models
Zhenglin Zhou
Fan Ma
Xiaobo Xia
Hehe Fan
Yi Yang
Tat-Seng Chua
DiffM
3DGS
105
0
0
27 Nov 2025
HybridWorldSim: A Scalable and Controllable High-fidelity Simulator for Autonomous Driving
Qiang Li
Yingwenqi Jiang
Tuoxi Li
Duyu Chen
Xiang Feng
...
Bingtao Gao
Xueyuan Wang
Shuchang Zhou
Xianming Liu
Ligang Liu
152
0
0
27 Nov 2025
Closed-Loop Transformers: Autoregressive Modeling as Iterative Latent Equilibrium
Akbar Anbar Jafari
G. Anbarjafari
62
0
0
26 Nov 2025
FaithFusion: Harmonizing Reconstruction and Generation via Pixel-wise Information Gain
Y. Wang
Xiaofan Li
Chi Huang
Wenhao Zhang
Hao Li
Bosheng Wang
Xun Sun
Jun Wang
DiffM
190
0
0
26 Nov 2025
Efficient Training for Human Video Generation with Entropy-Guided Prioritized Progressive Learning
Changlin Li
Jiawei Zhang
Shuhao Liu
Sihao Lin
Z. Shi
Zhihui Li
Xiaojun Chang
DiffM
VGen
258
0
0
26 Nov 2025
MotionV2V: Editing Motion in a Video
R. Burgert
Charles Herrmann
Forrester Cole
Michael S. Ryoo
Neal Wadhwa
Andrey Voynov
Nataniel Ruiz
DiffM
VGen
226
0
0
25 Nov 2025
Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis
Mohammad Mahdi
Yuqian Fu
N. Savov
Jiancheng Pan
Danda Pani Paudel
Luc Van Gool
VGen
193
1
0
25 Nov 2025
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout
Hidir Yesiltepe
Tuna Han Salih Meral
Adil Kaan Akan
Kaan Oktay
Pinar Yanardag
VGen
201
3
0
25 Nov 2025
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
Jiatao Gu
Ying Shen
Tianrong Chen
Laurent Dinh
Y. Wang
Miguel Angel Bautista
David Berthelot
Josh Susskind
Shuangfei Zhai
DiffM
VGen
294
3
0
25 Nov 2025
Learning Plug-and-play Memory for Guiding Video Diffusion Models
Selena Song
Ziming Xu
Zijun Zhang
Kun Zhou
Jiaxian Guo
Lianhui Qin
Biwei Huang
VGen
276
0
0
24 Nov 2025
MonoMSK: Monocular 3D Musculoskeletal Dynamics Estimation
Farnoosh Koleini
Hongfei Xue
Ahmed Helmy
Pu Wang
243
0
0
24 Nov 2025
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
Duolikun Danier
Ge Gao
Steven McDonagh
Changjian Li
Hakan Bilen
Oisin Mac Aodha
DiffM
VGen
134
0
0
24 Nov 2025
MagicWorld: Interactive Geometry-driven Video World Exploration
Guangyuan Li
Siming Zheng
Shuolin Xu
Jinwei Chen
Bo Li
Xiaobin Hu
Lei Zhao
Peng-Tao Jiang
VGen
133
0
0
24 Nov 2025
Predicting partially observable dynamical systems via diffusion models with a multiscale inference scheme
Rudy Morel
Francesco Pio Ramunno
Jeff Shen
Alberto Bietti
Kyunghyun Cho
...
François Rozet
K. Leka
F. Lanusse
David Fouhey
Shirley Ho
DiffM
AI4CE
424
0
0
24 Nov 2025
Demystifying Diffusion Objectives: Reweighted Losses are Better Variational Bounds
Jiaxin Shi
Michalis K. Titsias
DiffM
254
0
0
24 Nov 2025
The Locally Deployable Virtual Doctor: LLM Based Human Interface for Automated Anamnesis and Database Conversion
Jan Benedikt Ruhland
Doguhan Bahcivan
Jan-Peter Sowa
A. Canbay
D. Heider
MedIm
158
0
0
23 Nov 2025
Zero-Shot Video Deraining with Video Diffusion Models
Tuomas Varanka
Juan Luis Gonzalez
Hyeongwoo Kim
Pablo Garrido
Xu Yao
DiffM
VGen
148
0
0
23 Nov 2025
EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses
Enrico Pallotta
Sina Mokhtarzadeh Azar
Lars Doorenbos
Serdar Ozsoy
Umar Iqbal
Juergen Gall
DiffM
VGen
123
0
0
22 Nov 2025
Counterfactual World Models via Digital Twin-conditioned Video Diffusion
Yiqing Shen
Aiza Maksutova
Chenjia Li
Mathias Unberath
DiffM
VGen
165
0
0
21 Nov 2025
PostCam: Camera-Controllable Novel-View Video Generation with Query-Shared Cross-Attention
Yipeng Chen
Zhichao Ye
Zhenzhou Fang
Xinyu Chen
Xiaoyu Zhang
Jialing Liu
Nan Wang
Haomin Liu
Guofeng Zhang
DiffM
VGen
170
1
0
21 Nov 2025
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
Yujiang Pu
Zhanbo Huang
Vishnu Boddeti
Yu Kong
DiffM
VGen
108
0
0
21 Nov 2025
Flow and Depth Assisted Video Prediction with Latent Transformer
Eliyas Suleyman
Paul Henderson
Eksan Firkat
Nicolas Pugeault
146
0
0
20 Nov 2025
Generative Photographic Control for Scene-Consistent Video Cinematic Editing
Huiqiang Sun
Liao Shen
Zhan Peng
Kun Wang
Size Wu
...
Z. Huang
Xingyu Zeng
Zhiguo Cao
Wei Li
Chen Change Loy
DiffM
VGen
170
0
0
17 Nov 2025
Recurrent Autoregressive Diffusion: Global Memory Meets Local Attention
Taiye Chen
Zihan Ding
Anjian Li
Christina Zhang
Zeqi Xiao
Yisen Wang
Chi Jin
VGen
165
1
0
17 Nov 2025
Chemistry-Enhanced Diffusion-Based Framework for Small-to-Large Molecular Conformation Generation
Yifei Zhu
J. Zhang
Jiawei Peng
Mengge Li
Chao Xu
Zhenggang Lan
DiffM
124
0
0
15 Nov 2025
ProAV-DiT: A Projected Latent Diffusion Transformer for Efficient Synchronized Audio-Video Generation
Jiahui Sun
Weining Wang
Mingzhen Sun
Y. Yang
Xinxin Zhu
Jing Liu
DiffM
VGen
187
0
0
15 Nov 2025
Adaptive Begin-of-Video Tokens for Autoregressive Video Diffusion Models
Tianle Cheng
Zeyan Zhang
Kaifeng Gao
Jun Xiao
DiffM
VGen
243
0
0
15 Nov 2025
A Best-of-Both-Worlds Proof for Tsallis-INF without Fenchel Conjugates
Wei-Cheng Lee
Francesco Orabona
120
0
0
14 Nov 2025
1
2
3
4
...
29
30
31
Next