Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.08818
Cited By
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
18 April 2023
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models"
50 / 827 papers shown
Title
Improved DDIM Sampling with Moment Matching Gaussian Mixtures
Prasad Gabbur
DiffM
18
1
0
08 Nov 2023
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features
Chenfeng Xu
Huan Ling
Sanja Fidler
Or Litany
8
14
0
07 Nov 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Z. Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffM
VGen
26
198
0
07 Nov 2023
Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video
Yanqin Jiang
Li Zhang
Jin Gao
Weiming Hu
Yao Yao
VGen
27
57
0
06 Nov 2023
POS: A Prompts Optimization Suite for Augmenting Text-to-Video Generation
Shijie Ma
Huayi Xu
Mengjian Li
Weidong Geng
Yaxiong Wang
Meng Wang
DiffM
VGen
11
0
0
02 Nov 2023
Consistent Video-to-Video Transfer Using Synthetic Dataset
Jiaxin Cheng
Tianjun Xiao
Tong He
VGen
DiffM
28
14
0
01 Nov 2023
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Xinyuan Chen
Yaohui Wang
Lingjun Zhang
Shaobin Zhuang
Xin Ma
Jiashuo Yu
Yali Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
DiffM
10
126
0
31 Oct 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
21
277
0
30 Oct 2023
One Style is All you Need to Generate a Video
Sandeep Manandhar
Auguste Genovesio
VGen
19
0
0
27 Oct 2023
CVPR 2023 Text Guided Video Editing Competition
Jay Zhangjie Wu
Xiuyu Li
Difei Gao
Zhen Dong
Jinbin Bai
...
Xu Cheng
Jie Tang
Mike Zheng Shou
Kurt Keutzer
Forrest N. Iandola
28
34
0
24 Oct 2023
FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling
Haonan Qiu
Menghan Xia
Yong Zhang
Yin-Yin He
Xintao Wang
Ying Shan
Ziwei Liu
DiffM
VGen
17
88
0
23 Oct 2023
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
Zhongzhan Huang
Pan Zhou
Shuicheng Yan
Liang Lin
16
26
0
20 Oct 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
28
199
0
18 Oct 2023
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Yaofang Liu
Xiaodong Cun
Xuebo Liu
Xintao Wang
Yong Zhang
Haoxin Chen
Yang Liu
Tieyong Zeng
Raymond H. F. Chan
Ying Shan
VGen
EGVM
11
127
0
17 Oct 2023
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
86
52
0
16 Oct 2023
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
55
115
0
16 Oct 2023
DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
Jia-Wei Liu
Yan-Pei Cao
Jay Zhangjie Wu
Weijia Mao
Yuchao Gu
Rui Zhao
Jussi Keppo
Ying Shan
Mike Zheng Shou
VGen
DiffM
30
14
0
16 Oct 2023
ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
Jiayu Yang
Ziang Cheng
Yunfei Duan
Pan Ji
Hongdong Li
DiffM
34
53
0
16 Oct 2023
Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space
Hengrui Zhang
Jiani Zhang
Balasubramaniam Srinivasan
Zhengyuan Shen
Xiao Qin
Christos Faloutsos
Huzefa Rangwala
George Karypis
DiffM
25
80
0
14 Oct 2023
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Maya Okawa
Ekdeep Singh Lubana
Robert P. Dick
Hidenori Tanaka
CoGe
DiffM
35
43
0
13 Oct 2023
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao
Yuchao Gu
Jay Zhangjie Wu
David Junhao Zhang
Jia-Wei Liu
Weijia Wu
Jussi Keppo
Mike Zheng Shou
DiffM
VGen
20
103
0
12 Oct 2023
DrivingDiffusion: Layout-Guided multi-view driving scene video generation with latent diffusion model
Xiaofan Li
Yifu Zhang
Xiaoqing Ye
VGen
65
71
0
11 Oct 2023
ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Bo Peng
Xinyuan Chen
Yaohui Wang
Chaochao Lu
Yu Qiao
DiffM
VGen
14
7
0
11 Oct 2023
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
22
101
0
11 Oct 2023
Mitigating stereotypical biases in text to image generative systems
Piero Esposito
Parmida Atighehchian
Anastasis Germanidis
Deepti Ghadiyaram
25
16
0
10 Oct 2023
Learning Interactive Real-World Simulators
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&Ro
PINN
16
174
0
09 Oct 2023
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong
Mengmeng Xu
Christian Simon
Shoufa Chen
Jiawei Ren
Yanping Xie
Juan-Manuel Perez-Rua
Bodo Rosenhahn
Tao Xiang
Sen He
DiffM
VGen
17
74
0
09 Oct 2023
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
10
278
0
09 Oct 2023
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion
Anton Razzhigaev
Arseniy Shakhmatov
Anastasia Maltseva
V.Ya. Arkhipkin
Igor Pavlov
Ilya Ryabov
Angelina Kuts
Alexander Panchenko
Andrey Kuznetsov
Denis Dimitrov
43
78
0
05 Oct 2023
Realistic Speech-to-Face Generation with Speech-Conditioned Latent Diffusion Model with Face Prior
Jinting Wang
Li Liu
Jun Wang
Hei Victor Cheng
DiffM
15
2
0
05 Oct 2023
Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Yifan Jiang
Hao Tang
Jen-Hao Rick Chang
Liangchen Song
Zhangyang Wang
Liangliang Cao
DiffM
20
10
0
04 Oct 2023
FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery
Tasin Islam
A. Miron
Xiaohui Liu
Yongmin Li
DiffM
26
2
0
29 Sep 2023
LLM-grounded Video Diffusion Models
Long Lian
Baifeng Shi
Semih Yavuz
Ye Liu
Boyi Li
DiffM
17
54
0
29 Sep 2023
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
16
215
0
29 Sep 2023
CCEdit: Creative and Controllable Video Editing via Diffusion Models
Danfeng Hong
Wenming Weng
Hao Li
Yuhui Yuan
Jing Yao
Chong Luo
Zhibo Chen
Baining Guo
DiffM
VGen
14
42
0
28 Sep 2023
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
David Junhao Zhang
Jay Zhangjie Wu
Jia-Wei Liu
Rui Zhao
L. Ran
Yuchao Gu
Difei Gao
Mike Zheng Shou
DiffM
VGen
25
214
0
27 Sep 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Yaohui Wang
Xinyuan Chen
Xin Ma
Shangchen Zhou
Ziqi Huang
...
Chen Change Loy
Bo Dai
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
DiffM
26
220
0
26 Sep 2023
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Han Lin
Abhaysinh Zala
Jaemin Cho
Mohit Bansal
LM&Ro
VGen
DiffM
37
74
0
26 Sep 2023
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Hanzhuo Huang
Yufan Feng
Cheng Shi
Lan Xu
Jingyi Yu
Sibei Yang
DiffM
VGen
21
64
0
25 Sep 2023
NetDiffus: Network Traffic Generation by Diffusion Models through Time-Series Imaging
Nirhoshan Sivaroopan
Dumindu Bandara
Chamara Madarasingha
Guilluame Jourjon
A. Jayasumana
Kanchana Thilakarathna
DiffM
18
17
0
23 Sep 2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER
Mingzhen Sun
Weining Wang
Zihan Qin
Jiahui Sun
Si-Qing Chen
J. Liu
DiffM
14
3
0
23 Sep 2023
PIE: Simulating Disease Progression via Progressive Image Editing
Kaizhao Liang
Xu Cao
Kuei-Da Liao
Tianren Gao
Wenqian Ye
Zhengyu Chen
Jianguo Cao
Tejas Nama
Jimeng Sun
MedIm
AI4CE
14
5
0
21 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
33
128
0
20 Sep 2023
Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Toan Tien Nguyen
Minh Nhat Vu
Baoru Huang
Tuan V. Vo
Vy Truong
Ngan Le
T. Vo
Bac Le
Anh Nguyen
DiffM
28
16
0
19 Sep 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
16
148
0
18 Sep 2023
Generative Image Dynamics
Zhengqi Li
Richard Tucker
Noah Snavely
Aleksander Holynski
DiffM
29
63
0
14 Sep 2023
Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models
Sumeet Singh
Stephen Tu
Vikas Sindhwani
DiffM
15
8
0
11 Sep 2023
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Shuchen Xue
Mingyang Yi
Weijian Luo
Shifeng Zhang
Jiacheng Sun
Z. Li
Zhi-Ming Ma
DiffM
27
48
0
10 Sep 2023
Create Your World: Lifelong Text-to-Image Diffusion
Gan Sun
Wenqi Liang
Jiahua Dong
Jun Li
Zhengming Ding
Yang Cong
DiffM
VLM
22
28
0
08 Sep 2023
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu
Shicong Wang
Haoyu Zhao
Tianyi Lu
Xing Zhang
Zuxuan Wu
Songcen Xu
Wei Zhang
Yu-Gang Jiang
Hang Xu
DiffM
VGen
34
43
0
07 Sep 2023
Previous
1
2
3
...
14
15
16
17
Next