ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.15127
  4. Cited By
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets

Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

25 November 2023
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
Dominik Lorenz
Yam Levi
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
    VGen
ArXiv (abs)PDFHTMLHuggingFace (13 upvotes)Github (25943★)

Papers citing "Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets"

50 / 967 papers shown
Title
EgoM2P: Egocentric Multimodal Multitask Pretraining
EgoM2P: Egocentric Multimodal Multitask Pretraining
Gen Li
Yutong Chen
Yiqian Wu
Kaifeng Zhao
Marc Pollefeys
Siyu Tang
EgoVVLM
367
4
0
09 Jun 2025
Audio-Sync Video Generation with Multi-Stream Temporal Control
Audio-Sync Video Generation with Multi-Stream Temporal Control
Shuchen Weng
Haojie Zheng
Zheng Chang
Si Li
Boxin Shi
Xinlong Wang
DiffMVGen
181
4
0
09 Jun 2025
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Teng Hu
Zhentao Yu
Zhengguang Zhou
Jiangning Zhang
Yuan Zhou
Qinglin Lu
Ran Yi
VGen
192
4
0
09 Jun 2025
NOVA3D: Normal Aligned Video Diffusion Model for Single Image to 3D Generation
NOVA3D: Normal Aligned Video Diffusion Model for Single Image to 3D Generation
Yuxiao Yang
Peihao Li
Yuhong Zhang
Junzhe Lu
Xianglong He
Minghan Qin
Weitao Wang
Haoqian Wang
DiffMVGen
199
0
0
09 Jun 2025
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Ge Wang
Songlin Fan
Hangxu Liu
Quanjian Song
Hewei Wang
Jinfeng Xu
DiffMVGen
221
4
0
09 Jun 2025
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models
Sangwon Jang
Taekyung Ki
Jaehyeong Jo
Jaehong Yoon
Soo Ye Kim
Zhe Lin
Sung Ju Hwang
DiffMVGen
159
1
0
08 Jun 2025
Self-Adapting Improvement Loops for Robotic Learning
Self-Adapting Improvement Loops for Robotic Learning
Calvin Luo
Zilai Zeng
Mingxi Jia
Yilun Du
Chen Sun
131
1
0
07 Jun 2025
Identity Deepfake Threats to Biometric Authentication Systems: Public and Expert Perspectives
Identity Deepfake Threats to Biometric Authentication Systems: Public and Expert Perspectives
Shijing He
Yaxiong Lei
Zihan Zhang
Yuzhou Sun
S. Li
Chi Zhang
Juan Ye
166
2
0
07 Jun 2025
FADE: Frequency-Aware Diffusion Model Factorization for Video Editing
FADE: Frequency-Aware Diffusion Model Factorization for Video EditingComputer Vision and Pattern Recognition (CVPR), 2025
Yixuan Zhu
Haolin Wang
Shilin Ma
Wenliang Zhao
Yansong Tang
Lei Chen
Jie Zhou
DiffMVGen
408
2
0
06 Jun 2025
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Yuping He
Yifei Huang
Guo Chen
Lidong Lu
Baoqi Pei
Jilan Xu
Tong Lu
Yoichi Sato
EgoV
357
2
0
06 Jun 2025
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free SimulationComputer Vision and Pattern Recognition (CVPR), 2025
Chuhao Chen
Bushi Liu
Chen Wang
Yiming Huang
Anjun Chen
Qiao Feng
Jiatao Gu
Lingjie Liu
3DHVGenAI4CE
149
5
0
06 Jun 2025
Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
Yao Ni
Song Wen
Piotr Koniusz
A. Cherian
180
1
0
06 Jun 2025
LLIA -- Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models
LLIA -- Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models
Haojie Yu
Zhaonian Wang
Yihan Pan
Meng Cheng
Hao Yang
Chao Wang
Tao Xie
Xiaoming Xu
Xiaoming Wei
Xunliang Cai
VGen
232
2
0
06 Jun 2025
Restereo: Diffusion stereo video generation and restoration
Restereo: Diffusion stereo video generation and restoration
Xingchang Huang
Ashish Kumar Singh
Florian Dubost
C. N. Vasconcelos
Sakar Khattar
Liang Shi
Christian Theobalt
Steven Chacko
Gurprit Singh
DiffMVGen
207
3
0
06 Jun 2025
FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video GenerationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Huihan Wang
Zhiwen Yang
Hui Zhang
Dan Zhao
Bingzheng Wei
Yan Xu
MedImViT
239
0
0
05 Jun 2025
Video World Models with Long-term Spatial Memory
Tong Wu
Shuai Yang
Ryan Po
Yinghao Xu
Ziwei Liu
Dahua Lin
Gordon Wetzstein
VGenKELMVLM
304
30
0
05 Jun 2025
EX-4D: EXtreme Viewpoint 4D Video Synthesis via Depth Watertight Mesh
EX-4D: EXtreme Viewpoint 4D Video Synthesis via Depth Watertight Mesh
Tao Hu
Haoyang Peng
Xiao Liu
Yuewen Ma
VGenMDE
158
8
0
05 Jun 2025
CamCloneMaster: Enabling Reference-based Camera Control for Video Generation
CamCloneMaster: Enabling Reference-based Camera Control for Video Generation
Yawen Luo
J. Bai
Xiaoyu Shi
Menghan Xia
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
Tianfan Xue
DiffMVGen
182
7
0
03 Jun 2025
LumosFlow: Motion-Guided Long Video Generation
LumosFlow: Motion-Guided Long Video Generation
Jiahao Chen
Hangjie Yuan
Yichen Qian
Jingyun Liang
Jiazheng Xing
Pengwei Liu
Weihua Chen
Fan Wang
Bing Su
VGen
222
1
0
03 Jun 2025
Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences
Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences
Yunhong Lu
Qichao Wang
H. Cao
Xiaoyin Xu
Min Zhang
314
5
0
03 Jun 2025
Generative Perception of Shape and Material from Differential Motion
Generative Perception of Shape and Material from Differential Motion
Xinran Nicole Han
Ko Nishino
T. Zickler
DiffMVGen
341
0
0
03 Jun 2025
NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results
NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results
Xiaohong Liu
Xiongkuo Min
Qiang Hu
X. Zhang
Jie Guo
...
Lihuo He
Jia-Wei Liu
Yuting Xing
Tida Fang
Yuchun Jin
DiffM
150
23
0
03 Jun 2025
Controllable Human-centric Keyframe Interpolation with Generative Prior
Controllable Human-centric Keyframe Interpolation with Generative Prior
Z. Guo
Size Wu
Zhongang Cai
Wei Li
Chen Change Loy
DiffMVGen
171
1
0
03 Jun 2025
SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios
SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios
Lingwei Dang
Ruizhi Shao
Hongwen Zhang
Wei Min
Zichen Liu
Qingyao Wu
DiffMVGen
367
3
0
03 Jun 2025
OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation
OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation
Sen Liang
Zhentao Yu
Zhengguang Zhou
Teng Hu
Hongmei Wang
...
Qin Lin
Yuan Zhou
Xin Li
Qinglin Lu
Zhibo Chen
DiffMVGenSyDa
222
6
0
02 Jun 2025
WorldExplorer: Towards Generating Fully Navigable 3D Scenes
WorldExplorer: Towards Generating Fully Navigable 3D Scenes
Manuel-Andreas Schneider
Lukas Höllein
Matthias Nießner
VGen
217
6
0
02 Jun 2025
DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion
DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion
Geunmin Hwang
Hyun-kyu Ko
Younghyun Kim
S. W. Lee
Eunbyung Park
VGen
194
0
0
02 Jun 2025
Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks
Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks
Tao Yang
Ruibin Li
Yangming Shi
Yuqi Zhang
Qide Dong
Haoran Cheng
Weiguo Feng
Shilei Wen
Bingyue Peng
Lei Zhang
DiffMVGen
256
0
0
02 Jun 2025
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning
Yijun Yang
Zhao-Yang Wang
Qiuping Liu
Shuwen Sun
Kang Wang
...
Zongwei Zhou
Alan Yuille
Lei Zhu
Yu Zhang
Jieneng Chen
163
10
0
02 Jun 2025
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers
Zhengcong Fei
Hao Jiang
Di Qiu
Baoxuan Gu
Youqiang Zhang
...
Jialin Bai
Debang Li
Mingyuan Fan
Guibin Chen
Yahui Zhou
DiffMVGen
208
6
0
01 Jun 2025
IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection
IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection
Wayne Zhang
Changjiang Jiang
Zhonghao Zhang
Chenyang Si
Fengchang Yu
...
Xinbin Yuan
Yifei Bi
Ming Zhao
Zian Zhou
Caifeng Shan
311
8
0
01 Jun 2025
PromptVFX: Text-Driven Fields for Open-World 3D Gaussian Animation
PromptVFX: Text-Driven Fields for Open-World 3D Gaussian Animation
Mert Kiray
Paul Uhlenbruck
Nassir Navab
Benjamin Busam
VGen3DGSAI4CE
182
1
0
01 Jun 2025
Temporal In-Context Fine-Tuning with Temporal Reasoning for Versatile Control of Video Diffusion Models
Temporal In-Context Fine-Tuning with Temporal Reasoning for Versatile Control of Video Diffusion Models
Kinam Kim
J. Hyung
Jaegul Choo
DiffMVGen
322
3
0
01 Jun 2025
Video Signature: Implicit Watermarking for Video Diffusion Models
Video Signature: Implicit Watermarking for Video Diffusion Models
Yu Huang
Junhao Chen
Shuliang Liu
Hanqian Li
Qi Zheng
Yi R.
Fung
Yi R. Fung
Xuming Hu
DiffMWIGMVGen
403
1
0
31 May 2025
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
Yang-tian Sun
Xin Yu
Zehuan Huang
Yi-Hua Huang
Yuan-Chen Guo
Ziyi Yang
Yan-Pei Cao
Xiaojuan Qi
DiffMVGenMDE
198
4
0
30 May 2025
MUSE: Model-Agnostic Tabular Watermarking via Multi-Sample Selection
MUSE: Model-Agnostic Tabular Watermarking via Multi-Sample Selection
Liancheng Fang
Aiwei Liu
Henry Peng Zou
Yankai Chen
Hengrui Zhang
Zhongfen Deng
Philip S. Yu
214
2
0
30 May 2025
DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds
DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds
Jiaxu Zhang
Xianfang Zeng
Xin Chen
W. Zuo
Gang Yu
Guosheng Lin
Zhigang Tu
DiffM3DGSVGen
169
0
0
30 May 2025
Generating Fit Check Videos with a Handheld Camera
Generating Fit Check Videos with a Handheld Camera
B. Chen
Brian L. Curless
Ira Kemelmacher-Shlizerman
Steven M. Seitz
DiffM
181
0
0
29 May 2025
Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing
Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing
Tongtong Su
Chengyu Wang
Yanjie Liang
Dongming Lu
DiffMVGen
176
1
0
29 May 2025
Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization
Hallo4: High-Fidelity Dynamic Portrait Animation via Direct Preference Optimization
Jiahao Cui
Yan Chen
Mingwang Xu
Hanlin Shang
Yuxuan Chen
Yun Zhan
Zilong Dong
Yao Yao
Jingdong Wang
Siyu Zhu
DiffMVGen
480
8
0
29 May 2025
EquiReg: Equivariance Regularized Diffusion for Inverse Problems
EquiReg: Equivariance Regularized Diffusion for Inverse Problems
Bahareh Tolooshams
Aditi Chandrashekar
Rayhan Zirvi
Abbas Mammadov
Jiachen Yao
Chuwei Wang
Julius Berner
DiffM
202
2
0
29 May 2025
MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement
MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement
Yufan Deng
Xun Guo
Yuanyang Yin
Yizhi Wang
Yiding Yang
...
Shenghai Yuan
Angtian Wang
Bo Liu
Haibin Huang
Chongyang Ma
DiffMVGenVOS
246
4
0
29 May 2025
MOVi: Training-free Text-conditioned Multi-Object Video Generation
MOVi: Training-free Text-conditioned Multi-Object Video Generation
Aimon Rahman
Jiang Liu
Ze Wang
Ximeng Sun
Jialian Wu
Xiaodong Yu
Yusheng Su
Vishal M. Patel
Zicheng Liu
Emad Barsoum
DiffMVGen
257
0
0
29 May 2025
RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer
RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer
Liu Liu
Xiaofeng Wang
Guosheng Zhao
Keyu Li
Wenkang Qin
Jiaxiong Qiu
Zheng Hua Zhu
Guan Huang
Zhizhong Su
VGen
260
10
0
29 May 2025
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion
Gwanghyun Kim
Xueting Li
Ye Yuan
Koki Nagano
Tianye Li
Jan Kautz
Se Young Chun
Umar Iqbal
DiffM
186
0
0
29 May 2025
PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms
PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms
Yifei Xia
Shuchen Weng
Siqi Yang
Jingqi Liu
Chengxuan Zhu
Minggui Teng
Zijian Jia
Han Jiang
Boxin Shi
DiffMVGen
264
4
0
28 May 2025
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control
Anthony Chen
Wenzhao Zheng
Yida Wang
Xueyang Zhang
Kun Zhan
Fu Liu
Kurt Keutzer
Shanghang Zhang
337
7
0
28 May 2025
ATI: Any Trajectory Instruction for Controllable Video Generation
ATI: Any Trajectory Instruction for Controllable Video Generation
Angtian Wang
Haibin Huang
Yizhi Wang
Yiding Yang
Chongyang Ma
DiffMVGen
311
10
0
28 May 2025
EF-VI: Enhancing End-Frame Injection for Video Inbetweening
EF-VI: Enhancing End-Frame Injection for Video Inbetweening
Liuhan Chen
Xiaodong Cun
Xiaoyu Li
Xianyi He
Shenghai Yuan
Jie Chen
Mingyu Ding
Lichao Sun
VGen
243
0
0
27 May 2025
Advancing high-fidelity 3D and Texture Generation with 2.5D latents
Advancing high-fidelity 3D and Texture Generation with 2.5D latents
Xin Yang
Jiantao Lin
Yingjie Xu
Haodong Li
Yingcong Chen
3DV
242
3
0
27 May 2025
Previous
123...789...181920
Next