Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2204.03458
Cited By
v1
v2 (latest)
Video Diffusion Models
Neural Information Processing Systems (NeurIPS), 2022
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Video Diffusion Models"
50 / 1,543 papers shown
SVAD: From Single Image to 3D Avatar via Synthetic Data Generation with Video Diffusion and Data Augmentation
Yonwoo Choi
3DGS
VGen
291
1
0
08 May 2025
T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao Song
Jiahao Zhang
Jiale Zhao
VGen
1.1K
8
0
08 May 2025
DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization
Wenchuan Wang
Mengqi Huang
Yijing Tu
Zhendong Mao
VGen
438
5
0
04 May 2025
VIDSTAMP: A Temporally-Aware Watermark for Ownership and Integrity in Video Diffusion Models
Mohammadreza Teymoorianfard
Siddarth Sitaraman
Shiqing Ma
Amir Houmansadr
WIGM
485
1
0
02 May 2025
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
Computer Vision and Pattern Recognition (CVPR), 2025
Jiangtong Tan
Hu Yu
Jie Huang
Jie Xiao
Feng Zhao
338
5
0
02 May 2025
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
Antoni Bigata
Rodrigo Mira
Stella Bounareli
Michał Stypułkowski
Konstantinos Vougioukas
Stavros Petridis
Maja Pantic
341
6
0
01 May 2025
T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao Song
Jiahao Zhang
Jiale Zhao
EGVM
VGen
PINN
488
22
0
01 May 2025
A Survey of Interactive Generative Video
Jiwen Yu
Yiran Qin
Haoxuan Che
Quande Liu
Xinyu Wang
Pengfei Wan
Di Zhang
Kun Gai
Hao Chen
Xihui Liu
VGen
435
16
0
30 Apr 2025
ReVision: Refining Video Diffusion with Explicit 3D Motion Modeling
Qihao Liu
Ju He
Qihang Yu
Liang-Chieh Chen
Alan Yuille
DiffM
VGen
515
5
0
30 Apr 2025
Direct Motion Models for Assessing Generated Videos
Kelsey R. Allen
Carl Doersch
Guangyao Zhou
Mohammed Suhail
Danny Driess
...
Thomas Kipf
Mehdi S. M. Sajjadi
Kevin P. Murphy
João Carreira
Sjoerd van Steenkiste
EGVM
DiffM
VGen
491
5
0
30 Apr 2025
ADiff4TPP: Asynchronous Diffusion Models for Temporal Point Processes
Amartya Mukherjee
Ruizhi Deng
He Zhao
Yuzhen Mao
Leonid Sigal
Frederick Tung
DiffM
AI4TS
277
0
0
29 Apr 2025
AnimateAnywhere: Rouse the Background in Human Image Animation
Xiaoyu Liu
Mingshuai Yao
Y. Zhang
Xianhui Lin
Peiran Ren
Xiaochen Li
Ming-Yu Liu
W. Zuo
3DH
DiffM
384
4
0
28 Apr 2025
Global Stress Generation and Spatiotemporal Super-Resolution Physics-Informed Operator under Dynamic Loading for Two-Phase Random Materials
Tengfei Xing
Xiaodan Ren
Jie Li
DiffM
315
0
0
26 Apr 2025
Stealing Creator's Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation
J. Park
Maanas Taneja
Qianwen Wang
Luan Tuyen Chau
VGen
314
0
0
26 Apr 2025
We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback
Minkyu Choi
S P Sharan
Harsh Goel
Sahil Shah
Sandeep Chinchali
DiffM
VGen
425
4
0
24 Apr 2025
Synthetic Power Flow Data Generation Using Physics-Informed Denoising Diffusion Probabilistic Models
Junfei Wang
Darshana Upadhyay
Marzia Zaman
Pirathayini Srikantha
DiffM
169
1
0
24 Apr 2025
DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks
IEEE transactions on multimedia (TMM), 2025
Yinqi Li
Hong Chang
Ruibing Hou
Shiguang Shan
Xilin Chen
DiffM
327
1
0
24 Apr 2025
VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models
Xuming Hu
Haoyang Li
Jiajun Li
Yu Huang
Aiwei Liu
Qi Zheng
Junhao Chen
Aiwei Liu
WIGM
VGen
510
6
0
23 Apr 2025
PMG: Progressive Motion Generation via Sparse Anchor Postures Curriculum Learning
Yingjie Xi
Jiangning Zhang
Xiaosong Yang
298
0
0
23 Apr 2025
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment
Xuzhao Li
Chenming Wu
Zhao Yang
Zhihao Xu
Dingkang Liang
Yanzhe Zhang
Ji Wan
Jiadong Wang
VGen
410
7
0
22 Apr 2025
T2VShield: Model-Agnostic Jailbreak Defense for Text-to-Video Models
Yaning Tan
Jiayang Liu
Jiecheng Zhai
Tianmeng Fang
Rongcheng Tu
A. Liu
Xiaochun Cao
Dacheng Tao
VGen
377
12
0
22 Apr 2025
DRAGON: Distributional Rewards Optimize Diffusion Generative Models
Yatong Bai
Jonah Casebeer
Somayeh Sojoudi
Nicholas J. Bryan
DiffM
VLM
500
3
0
21 Apr 2025
DC4CR: When Cloud Removal Meets Diffusion Control in Remote Sensing
Zhenyu Yu
Mohd Yamani Idna Idris
Pei Wang
DiffM
275
0
0
21 Apr 2025
Emergence and Evolution of Interpretable Concepts in Diffusion Models
Berk Tinaz
Zalan Fabian
Mahdi Soltanolkotabi
DiffM
285
7
0
21 Apr 2025
MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World
Computer Vision and Pattern Recognition (CVPR), 2025
Tao Lu
Manan Shah
R. V. Babu
299
1
0
21 Apr 2025
Solving New Tasks by Adapting Internet Video Knowledge
International Conference on Learning Representations (ICLR), 2025
Calvin Luo
Zilai Zeng
Yilun Du
Chen Sun
242
12
0
21 Apr 2025
FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models
Kuanting Wu
Kei Ota
Asako Kanezaki
DiffM
VGen
367
0
0
20 Apr 2025
Entropic Time Schedulers for Generative Diffusion Models
Dejan Stancevic
Luca Ambrogioni
Luca Ambrogioni
DiffM
OOD
344
3
0
18 Apr 2025
SkyReels-V2: Infinite-length Film Generative Model
Guibin Chen
D. Lin
Jiangping Yang
Chunze Lin
J. Zhu
...
Di Qiu
Debang Li
Zhengcong Fei
Yang Li
Yahui Zhou
DiffM
VGen
517
83
0
17 Apr 2025
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Bingjie Gao
Xinyu Gao
Xiaoxue Wu
Yujie Zhou
Yu Qiao
Li Niu
Xinyuan Chen
Yaohui Wang
496
6
0
16 Apr 2025
LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation
Wei-Jer Chang
Weidong Zhan
Masayoshi Tomizuka
Manmohan Chandraker
Francesco Pittaluga
520
1
0
15 Apr 2025
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
Dianbing Xi
Jiadong Wang
Yuanzhi Liang
Xi Qiu
Yuchi Huo
Ruiqi Wang
Fangqiu Yi
Xuzhao Li
DiffM
VGen
607
12
0
15 Apr 2025
Analysis of Attention in Video Diffusion Transformers
Yuxin Wen
Jim Wu
Ajay Jain
Tom Goldstein
Ashwinee Panda
281
8
0
14 Apr 2025
On Equivariance and Fast Sampling in Video Diffusion Models Trained with Warped Noise
Chao Liu
Arash Vahdat
DiffM
VGen
402
5
0
14 Apr 2025
Scalable Motion In-betweening via Diffusion and Physics-Based Character Adaptation
Jia Qin
DiffM
VGen
236
0
0
13 Apr 2025
KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation
Xingrui Wang
Jiang-Long Liu
Liang Luo
Xiaodong Yu
Jialian Wu
Xingwu Sun
Yusheng Su
Yaoyao Liu
Zicheng Liu
Emad Barsoum
DiffM
VGen
283
4
0
13 Apr 2025
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Team Seawead
Ceyuan Yang
Zhijie Lin
Yang Zhao
Shanchuan Lin
...
Zuquan Song
Zhenheng Yang
Jiashi Feng
Jianchao Yang
Lu Jiang
DiffM
580
66
0
11 Apr 2025
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Ruineng Li
Daitao Xing
Huiming Sun
Yuanzhou Ha
Jinglin Shen
C. Ho
DiffM
VGen
289
5
0
11 Apr 2025
Discriminator-Free Direct Preference Optimization for Video Diffusion
Haoran Cheng
Qide Dong
Liang Peng
Zhizhou Sha
Weiguo Feng
Jinghui Xie
Zhao Song
Shilei Wen
Xiaofei He
Boxi Wu
VGen
853
2
0
11 Apr 2025
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
Zeren Jiang
Chuanxia Zheng
Iro Laina
Diane Larlus
Andrea Vedaldi
VGen
442
39
0
10 Apr 2025
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
425
7
0
10 Apr 2025
IGG: Image Generation Informed by Geodesic Dynamics in Deformation Spaces
Information Processing in Medical Imaging (IPMI), 2025
Nian Wu
Nivetha Jayakumar
Jiarui Xing
Miaomiao Zhang
352
1
0
09 Apr 2025
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Diljeet Jagpal
Xi Chen
Vinay P. Namboodiri
DiffM
VGen
148
2
0
09 Apr 2025
Human Activity Recognition using RGB-Event based Sensors: A Multi-modal Heat Conduction Model and A Benchmark Dataset
Shiao Wang
Xinyu Wang
Bo Jiang
Lin Zhu
G. Li
Longji Xu
Yonghong Tian
Jin Tang
713
0
0
08 Apr 2025
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Mengchao Wang
Qiang Wang
Fan Jiang
Yaqi Fan
Yunpeng Zhang
Yonggang Qi
Kun Zhao
Mu Xu
DiffM
VGen
216
46
0
07 Apr 2025
Multi-identity Human Image Animation with Structural Video Diffusion
Zhenzhi Wang
Yongqian Li
Yanhong Zeng
Yuwei Guo
Dahua Lin
Tianfan Xue
Bo Dai
VGen
267
5
0
05 Apr 2025
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
Xuyang Guo
Zekai Huang
Jiayan Huo
Yingyu Liang
Zhenmei Shi
Zhao Song
Jiahao Zhang
ALM
VGen
508
13
0
05 Apr 2025
Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
Chuning Zhu
Raymond Yu
S. Feng
Benjamin Burchfiel
Paarth Shah
Abhishek Gupta
VGen
505
50
0
03 Apr 2025
MG-Gen: Single Image to Motion Graphics Generation
Takahiro Shirakawa
Tomoyuki Suzuki
Takuto Narumoto
Daichi Haraguchi
VGen
623
0
0
03 Apr 2025
Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization
Computer Vision and Pattern Recognition (CVPR), 2025
Jiadong Wang
Jingyuan Liu
Xin Sun
Krishna Kumar Singh
Zhixin Shu
...
Nanxuan Zhao
Tuanfeng Y. Wang
Simon Chen
Ulrich Neumann
Jae Shin Yoon
315
4
0
03 Apr 2025
Previous
1
2
3
...
7
8
9
...
29
30
31
Next
Page 8 of 31
Page
of 31
Go