ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.03458
  4. Cited By
Video Diffusion Models
v1v2 (latest)

Video Diffusion Models

Neural Information Processing Systems (NeurIPS), 2022
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
    DiffMVGen
ArXiv (abs)PDFHTMLHuggingFace (5 upvotes)

Papers citing "Video Diffusion Models"

50 / 1,539 papers shown
LACONIC: A 3D Layout Adapter for Controllable Image Creation
LACONIC: A 3D Layout Adapter for Controllable Image Creation
Léopold Maillard
Tom Durand
Adrien Ramanana Rahary
Maks Ovsjanikov
DiffM
204
0
0
04 Jul 2025
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation
Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation
François Rozet
Ruben Ohana
Michael McCabe
Gilles Louppe
F. Lanusse
S. Ho
DiffM
247
7
0
03 Jul 2025
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation
Yue-Jiang Dong
Wang Zhao
Jiale Xu
Ying Shan
Song-Hai Zhang
DiffMMDE
297
2
0
02 Jul 2025
Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning
Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning
Chuangtao Chen
Qinglin Zhao
Mengchu Zhou
Dusit Niyato
Zhimin He
Haozhen Situ
DiffM
487
3
0
01 Jul 2025
A Survey: Learning Embodied Intelligence from Physical Simulators and World Models
A Survey: Learning Embodied Intelligence from Physical Simulators and World Models
Xiaoxiao Long
Qingrui Zhao
Kaiwen Zhang
Zihao Zhang
Dingrui Wang
...
Jia Pan
Qiu Shen
Ruigang Yang
X. Cao
Qionghai Dai
LM&RoAI4CE
303
22
0
01 Jul 2025
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
Gaojie Lin
Jianwen Jiang
Jiaqi Yang
Zerong Zheng
Chao Liang
DiffMVGen
1.3K
85
0
01 Jul 2025
TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced Efficiency
TRACE: Temporally Reliable Anatomically-Conditioned 3D CT Generation with Enhanced EfficiencyInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Minye Shao
Xingyu Miao
Haoran Duan
Zeyu Wang
Jingkun Chen
Yawen Huang
Xian Wu
Jingjing Deng
Yang Long
Yefeng Zheng
DiffMMedIm
93
2
0
01 Jul 2025
Guided Unconditional and Conditional Generative Models for Super-Resolution and Inference of Quasi-Geostrophic Turbulence
Guided Unconditional and Conditional Generative Models for Super-Resolution and Inference of Quasi-Geostrophic Turbulence
Anantha N.S. Babu
Akhil Sadam
Pierre F.J. Lermusiaux
DiffM
158
2
0
01 Jul 2025
OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions
OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions
Yuanhao Cai
Chentao Song
Xi Chen
Jinbo Xing
Yiwei Hu
...
Tianyu Wang
Y. Zhang
Xiaokang Yang
Zhe Lin
Alan Yuille
DiffMVGen
275
5
0
29 Jun 2025
StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
Haodong Li
Chen Wang
Jiahui Lei
Kostas Daniilidis
Lingjie Liu
DiffMVGenMDE
247
3
0
25 Jun 2025
Ctrl-Z Sampling: Diffusion Sampling with Controlled Random Zigzag Explorations
Ctrl-Z Sampling: Diffusion Sampling with Controlled Random Zigzag Explorations
Shunqi Mao
Wei Guo
Chaoyi Zhang
Jieting Long
Ke Xie
Weidong Cai
DiffM
331
1
0
25 Jun 2025
VS-Singer: Vision-Guided Stereo Singing Voice Synthesis with Consistency Schrödinger Bridge
VS-Singer: Vision-Guided Stereo Singing Voice Synthesis with Consistency Schrödinger Bridge
Zijing Zhao
Kai Wang
Hao-Ming Huang
Ying Hu
Liang He
J. Yang
177
0
0
19 Jun 2025
Improving Rectified Flow with Boundary Conditions
Improving Rectified Flow with Boundary Conditions
Xixi Hu
Runlong Liao
Keyang Xu
B. Liu
Yeqing Li
Eugene Ie
Hongliang Fei
Qiang Liu
219
1
0
18 Jun 2025
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Anirud Aggarwal
Abhinav Shrivastava
M. Gwilliam
415
0
0
18 Jun 2025
ViSAGe: Video-to-Spatial Audio Generation
ViSAGe: Video-to-Spatial Audio GenerationInternational Conference on Learning Representations (ICLR), 2025
Jaeyeon Kim
Heeseung Yun
Gunhee Kim
VGen
217
9
0
13 Jun 2025
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis
Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis
Yuan Gao
Mattia Piccinini
Yuchen Zhang
Dingrui Wang
Korbinian Moller
...
Steven Peters
Andrea Stocco
Bassam Alrifaee
Marco Pavone
Johannes Betz
347
19
0
13 Jun 2025
Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models
Donghoon Ahn
Jiwon Kang
Sanghyun Lee
Minjae Kim
Jaewon Min
Wooseok Jang
Saungwu Lee
Sayak Paul
S. Hong
Seungryong Kim
DiffMAAML
471
0
0
12 Jun 2025
TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy
TARDIS STRIDE: A Spatio-Temporal Road Image Dataset and World Model for Autonomy
Héctor Carrión
Yutong Bai
Víctor A. Hernández Castro
Kishan Panaganti
Ayush Zenith
Matthew Trang
Tony Zhang
Pietro Perona
Jitendra Malik
VGen
280
0
0
12 Jun 2025
The Diffusion Duality
The Diffusion Duality
Subham S. Sahoo
Justin Deschenaux
Aaron Gokaslan
Guanghan Wang
Justin T Chiu
Volodymyr Kuleshov
DiffM
399
32
0
12 Jun 2025
Build the web for agents, not agents for the web
Build the web for agents, not agents for the web
Xing Han Lù
Gaurav Kamath
Marius Mosbach
Siva Reddy
LLMAGLM&Ro
346
2
0
12 Jun 2025
Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models
Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models
Defang Chen
Zhenyu Zhou
C. Wang
Siwei Lyu
DiffM
330
1
0
11 Jun 2025
Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization
Qilin Yin
Wei Lu
Xiangyang Luo
Xiaochun Cao
208
1
0
10 Jun 2025
Bias Analysis in Unconditional Image Generative Models
Xiaofeng Zhang
Michelle Lin
Damien Scieur
Aaron Courville
Yash Goyal
189
0
0
10 Jun 2025
MagCache: Fast Video Generation with Magnitude-Aware Cache
Zehong Ma
Longhui Wei
Feng Wang
Shiliang Zhang
Q. Tian
262
9
0
10 Jun 2025
Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance
June Suk Choi
Kyungmin Lee
Sihyun Yu
Yisol Choi
Jinwoo Shin
Kimin Lee
DiffMVGen
248
3
0
10 Jun 2025
FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian Velocity
FreeGave: 3D Physics Learning from Dynamic Videos by Gaussian VelocityComputer Vision and Pattern Recognition (CVPR), 2025
Jinxi Li
Ziyang Song
Siyuan Zhou
Bo Yang
AI4CE
240
4
0
09 Jun 2025
Synthesize Privacy-Preserving High-Resolution Images via Private Textual Intermediaries
Synthesize Privacy-Preserving High-Resolution Images via Private Textual Intermediaries
Haoxiang Wang
Zinan Lin
Da Yu
Huishuai Zhang
288
3
0
09 Jun 2025
Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution
Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution
Junseo Bang
Joonhee Lee
Kyeonghyun Lee
Haechang Lee
Dong un Kang
Se Young Chun
281
0
0
09 Jun 2025
Snap-and-tune: combining deep learning and test-time optimization for high-fidelity cardiovascular volumetric meshing
Daniel H. Pak
Shubh Thaker
Kyle Baylous
Xiaoran Zhang
Danny Bluestein
James S. Duncan
AI4CE
229
9
0
09 Jun 2025
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Kevin Rojas
Yuchen Zhu
Sichen Zhu
Felix X.-F. Ye
Molei Tao
DiffM
263
11
0
09 Jun 2025
EgoM2P: Egocentric Multimodal Multitask Pretraining
EgoM2P: Egocentric Multimodal Multitask Pretraining
Gen Li
Yutong Chen
Yiqian Wu
Kaifeng Zhao
Marc Pollefeys
Siyu Tang
EgoVVLM
412
4
0
09 Jun 2025
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
FADE: Frequency-Aware Diffusion Model Factorization for Video EditingComputer Vision and Pattern Recognition (CVPR), 2025
Yixuan Zhu
Haolin Wang
Shilin Ma
Wenliang Zhao
Yansong Tang
Lei Chen
Jie Zhou
DiffMVGen
452
2
0
06 Jun 2025
RNE: plug-and-play diffusion inference-time control and energy-based training
RNE: plug-and-play diffusion inference-time control and energy-based training
Jiajun He
Jose Miguel Hernandez-Lobato
Yuanqi Du
Francisco Vargas
466
4
0
06 Jun 2025
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
Guangzhao Li
Yanming Yang
Chenxi Song
Chi Zhang
DiffMVGen
276
6
0
05 Jun 2025
Video World Models with Long-term Spatial Memory
Tong Wu
Shuai Yang
Ryan Po
Yinghao Xu
Ziwei Liu
Dahua Lin
Gordon Wetzstein
VGenKELMVLM
327
41
0
05 Jun 2025
LumosFlow: Motion-Guided Long Video Generation
LumosFlow: Motion-Guided Long Video Generation
Jiahao Chen
Hangjie Yuan
Yichen Qian
Jingyun Liang
Jiazheng Xing
Pengwei Liu
Weihua Chen
Fan Wang
Bing Su
VGen
269
1
0
03 Jun 2025
Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
Zhengyao Lv
Chenyang Si
Tianlin Pan
Zhaoxi Chen
Kwan-Yee K. Wong
Yu Qiao
Ziwei Liu
DiffMVGen
352
4
0
03 Jun 2025
HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation
HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation
Yicheng Xiao
Lin Song
Rui Yang
Cheng Cheng
Zunnan Xu
Zhaoyang Zhang
Yixiao Ge
Xiu Li
Mingyu Ding
237
6
0
03 Jun 2025
OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation
OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation
Sen Liang
Zhentao Yu
Zhengguang Zhou
Teng Hu
Hongmei Wang
...
Qin Lin
Yuan Zhou
Xin Li
Qinglin Lu
Zhibo Chen
DiffMVGenSyDa
275
6
0
02 Jun 2025
Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment
Diff2Flow: Training Flow Matching Models via Diffusion Model AlignmentComputer Vision and Pattern Recognition (CVPR), 2025
Johannes Schusterbauer
Ming Gui
Frank Fundel
Bjorn Ommer
215
10
0
02 Jun 2025
G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models
G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models
Tianjiao Zhang
Fei Zhang
Jiangchao Yao
Ya Zhang
Yanfeng Wang
DiffM
341
4
0
02 Jun 2025
DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion
DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion
Geunmin Hwang
Hyun-kyu Ko
Younghyun Kim
S. W. Lee
Eunbyung Park
VGen
229
1
0
02 Jun 2025
MoCA-Video: Motion-Aware Concept Alignment for Consistent Video Editing
MoCA-Video: Motion-Aware Concept Alignment for Consistent Video Editing
Tong Zhang
Juan Carlos León Alcázar
Bernard Ghanem
Bernard Ghanem
DiffMVGen
410
3
0
01 Jun 2025
Using Diffusion Ensembles to Estimate Uncertainty for End-to-End Autonomous Driving
Using Diffusion Ensembles to Estimate Uncertainty for End-to-End Autonomous Driving
Florian Wintel
Sigmund H. Høeg
G. Kiss
Frank Lindseth
196
2
0
31 May 2025
Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction
Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction
Chenyou Fan
Fangzheng Yan
Fuchun Sun
Jiepeng Wang
Fangqiu Yi
Zhen Wang
Xuelong Li
VGen
1.1K
2
0
30 May 2025
Interactive Video Generation via Domain Adaptation
Interactive Video Generation via Domain Adaptation
Ishaan Rawal
Suryansh Kumar
DiffMVGen
163
0
0
30 May 2025
Inference-Time Alignment of Diffusion Models via Evolutionary Algorithms
Inference-Time Alignment of Diffusion Models via Evolutionary Algorithms
Purvish Jajal
Nick Eliopoulos
Benjamin Shiue-Hal Chou
George K. Thiruvathukal
James C. Davis
Yung-Hsiang Lu
187
1
0
30 May 2025
STORK: Faster Diffusion And Flow Matching Sampling By Resolving Both Stiffness And Structure-Dependence
STORK: Faster Diffusion And Flow Matching Sampling By Resolving Both Stiffness And Structure-Dependence
Zheng Tan
Weizhen Wang
Andrea L. Bertozzi
Ernest K. Ryu
DiffM
184
2
0
30 May 2025
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion
Gwanghyun Kim
Xueting Li
Ye Yuan
Koki Nagano
Tianye Li
Jan Kautz
Se Young Chun
Umar Iqbal
DiffM
207
0
0
29 May 2025
MOVi: Training-free Text-conditioned Multi-Object Video Generation
MOVi: Training-free Text-conditioned Multi-Object Video Generation
Aimon Rahman
Jiang Liu
Ze Wang
Ximeng Sun
Jialian Wu
Xiaodong Yu
Yusheng Su
Vishal M. Patel
Zicheng Liu
Emad Barsoum
DiffMVGen
275
0
0
29 May 2025
Previous
123...567...293031
Next