Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.08818
Cited By
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
18 April 2023
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models"
50 / 827 papers shown
Title
M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes
Sixu Yan
Zeyu Zhang
Muzhi Han
Zaijin Wang
Qi Xie
Zhitian Li
Zhehan Li
Hangxin Liu
Xinggang Wang
Song-Chun Zhu
52
4
0
15 Oct 2024
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Ziyue Li
Tianyi Zhou
MoE
66
16
0
14 Oct 2024
Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning
Etai Littwin
Vimal Thilak
Anand Gopalakrishnan
37
8
0
14 Oct 2024
VideoAgent: Self-Improving Video Generation
Achint Soni
Sreyas Venkataraman
Abhranil Chandra
Sebastian Fischmeister
Percy Liang
Bo Dai
Sherry Yang
LM&Ro
VGen
50
7
0
14 Oct 2024
Asymptotic Analysis of Sample-averaged Q-learning
Saunak Kumar Panda
Ruiqi Liu
Yisha Xiang
OnRL
52
8
0
14 Oct 2024
Generating Intermediate Representations for Compositional Text-To-Image Generation
Ran Galun
Sagie Benaim
23
0
0
13 Oct 2024
Losing dimensions: Geometric memorization in generative diffusion
Beatrice Achilli
Enrico Ventura
Gianluigi Silvestri
Bao Pham
G. Raya
Dmitry Krotov
Carlo Lucibello
L. Ambrogioni
40
4
0
11 Oct 2024
E-Motion: Future Motion Simulation via Event Sequence Diffusion
Song Wu
Zhiyu Zhu
Junhui Hou
Guangming Shi
Jinjian Wu
DiffM
VGen
35
0
0
11 Oct 2024
Progressive Autoregressive Video Diffusion Models
Desai Xie
Zhan Xu
Yicong Hong
Hao Tan
Difan Liu
Feng Liu
Arie E. Kaufman
Yang Zhou
VGen
DiffM
56
10
0
10 Oct 2024
HARIVO: Harnessing Text-to-Image Models for Video Generation
Mingi Kwon
Seoung Wug Oh
Yang Zhou
Difan Liu
Joon-Young Lee
Haoran Cai
Baqiao Liu
Feng Liu
Youngjung Uh
VGen
38
1
0
10 Oct 2024
MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting
Ruijie Zhu
Yanzhe Liang
Hanzhi Chang
Jiacheng Deng
Jiahao Lu
Wenfei Yang
Tianzhu Zhang
Yongdong Zhang
3DGS
23
8
0
10 Oct 2024
MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion
Onkar Susladkar
Jishu Sen Gupta
Chirag Sehgal
Sparsh Mittal
Rekha Singhal
DiffM
VGen
33
0
0
10 Oct 2024
Diversity-Rewarded CFG Distillation
Geoffrey Cideron
A. Agostinelli
Johan Ferret
Sertan Girgin
Romuald Elie
Olivier Bachem
Sarah Perrin
Alexandre Ramé
34
2
0
08 Oct 2024
Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion
Enrico Ventura
Beatrice Achilli
Gianluigi Silvestri
Carlo Lucibello
L. Ambrogioni
DiffM
30
5
0
08 Oct 2024
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li
Qian Long
Jian Zheng
Xiaofeng Gao
Robinson Piramuthu
Wenhu Chen
William Yang Wang
VGen
25
22
0
08 Oct 2024
Pyramidal Flow Matching for Efficient Video Generative Modeling
Yang Jin
Zhicheng Sun
Ningyuan Li
Kun Xu
K. Xu
...
Nan Zhuang
Quzhe Huang
Yang Song
Yadong Mu
Zhouchen Lin
VGen
66
64
0
08 Oct 2024
ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
Jiazi Bu
Pengyang Ling
Pan Zhang
Tong Wu
Xiaoyi Dong
Yuhang Zang
Yuhang Cao
Dahua Lin
Jiaqi Wang
DiffM
VGen
28
0
0
08 Oct 2024
ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler
Serin Yang
Taesung Kwon
Jong Chul Ye
VGen
DiffM
27
3
0
08 Oct 2024
GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting
Yukang Cao
Masoud Hadi
Liang Pan
Ziwei Liu
3DGS
DiffM
53
4
0
07 Oct 2024
L-C4: Language-Based Video Colorization for Creative and Consistent Color
Zheng Chang
Shuchen Weng
Huan Ouyang
Yu Li
Si Li
Boxin Shi
DiffM
VGen
VLM
28
0
0
07 Oct 2024
Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration
Zhiyu Zhu
Jinhui Hou
Hui Liu
Huanqiang Zeng
Junhui Hou
35
0
0
07 Oct 2024
ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction
Hyungjin Chung
Dohun Lee
Jong Chul Ye
VGen
DiffM
21
2
0
07 Oct 2024
CAR: Controllable Autoregressive Modeling for Visual Generation
Ziyu Yao
Jialin Li
Yifeng Zhou
Yong Liu
Xi Jiang
Chengjie Wang
Feng Zheng
Yuexian Zou
Lei Li
DiffM
35
13
0
07 Oct 2024
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
Han Lin
Tushar Nagarajan
Nicolas Ballas
Mido Assran
Mojtaba Komeili
Mohit Bansal
Koustuv Sinha
AI4TS
52
3
0
04 Oct 2024
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models
Seyedmorteza Sadat
Otmar Hilliges
Romann M. Weber
DiffM
18
8
0
03 Oct 2024
MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation
T. Pham
Tri Ton
Chang D. Yoo
36
3
0
03 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
46
23
0
03 Oct 2024
Text2PDE: Latent Diffusion Models for Accessible Physics Simulation
Anthony Y. Zhou
Zijie Li
Michael Schneier
John R Buchanan Jr
Amir Barati Farimani
AI4CE
DiffM
52
5
0
02 Oct 2024
Replace Anyone in Videos
Xiang Wang
Shiwei Zhang
Haonan Qiu
Ruihang Chu
Zekun Li
Y. Zhang
Changxin Gao
Yuehuan Wang
Chunhua Shen
Nong Sang
VGen
DiffM
64
1
0
30 Sep 2024
Simple and Fast Distillation of Diffusion Models
Zhenyu Zhou
Defang Chen
Can Wang
Chun Chen
Siwei Lyu
DiffM
35
5
0
29 Sep 2024
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation
Shaowei Liu
Zhongzheng Ren
Saurabh Gupta
Shenlong Wang
VGen
DiffM
PINN
42
33
0
27 Sep 2024
Gradient-free Decoder Inversion in Latent Diffusion Models
Seongmin Hong
Suh Yoon Jeon
Kyeonghyun Lee
Ernest K. Ryu
S. Chun
24
0
0
27 Sep 2024
Physics-aligned Schrödinger bridge
Zeyu Li
Hongkun Dou
Shen Fang
Wang Han
Yue Deng
Lijun Yang
AI4CE
DiffM
28
0
0
26 Sep 2024
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation
Masato Ishii
Akio Hayakawa
Takashi Shibuya
Yuki Mitsufuji
VGen
DiffM
63
4
0
26 Sep 2024
Disco4D: Disentangled 4D Human Generation and Animation from a Single Image
Hui En Pang
Shuai Liu
Zhongang Cai
Lei Yang
Tianwei Zhang
Ziwei Liu
3DGS
31
3
0
25 Sep 2024
Mitigating Covariate Shift in Imitation Learning for Autonomous Vehicles Using Latent Space Generative World Models
A. Popov
Alperen Degirmenci
David Wehr
Shashank Hegde
Ryan Oldja
...
David Nistér
Urs Muller
Ruchi Bhargava
Stan Birchfield
Nikolai Smolyanskiy
75
9
0
25 Sep 2024
Single Image, Any Face: Generalisable 3D Face Generation
Wenqing Wang
Haosen Yang
Josef Kittler
Xiatian Zhu
3DH
71
0
0
25 Sep 2024
Ctrl-GenAug: Controllable Generative Augmentation for Medical Sequence Classification
Xinrui Zhou
Yuhao Huang
Haoran Dou
Shijing Chen
Ao Chang
...
Jie Jessie Ren
Ruobing Huang
Jun Cheng
Wufeng Xue
Dong Ni
MedIm
93
0
0
25 Sep 2024
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
Yifang Men
Yuan Yao
Miaomiao Cui
Liefeng Bo
DiffM
29
17
0
24 Sep 2024
DepthART: Monocular Depth Estimation as Autoregressive Refinement Task
Bulat Gabdullin
Nina Konovalova
Nikolay Patakin
Dmitry Senushkin
Anton Konushin
MDE
25
0
0
23 Sep 2024
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond
Hong Chen
Xin Wang
Yuwei Zhou
Bin Huang
Yipeng Zhang
Wei Feng
Houlun Chen
Zeyang Zhang
Siao Tang
Wenwu Zhu
DiffM
47
7
0
23 Sep 2024
Video-to-Audio Generation with Fine-grained Temporal Semantics
Yuchen Hu
Yu Gu
Chenxing Li
Rilin Chen
Dong Yu
VGen
DiffM
24
1
0
23 Sep 2024
JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation
Hadrien Reynaud
Matthew Baugh
Mischa Dombrowski
Sarah Cechnicka
Qingjie Meng
Bernhard Kainz
VLM
31
0
0
21 Sep 2024
Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation
Chenyu Wang
Shuo Yan
Yixuan Chen
Yujiang Wang
Mingzhi Dong
...
Qin Lv
Fan Yang
Tun Lu
Ning Gu
Li Shang
DiffM
VGen
33
0
0
19 Sep 2024
HSIGene: A Foundation Model For Hyperspectral Image Generation
Li Pang
Datao Tang
Shuang Xu
Deyu Meng
Xiangyong Cao
DiffM
28
11
0
19 Sep 2024
SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation
Mingze Sun
Chen Guo
Puhua Jiang
Shiwei Mao
Yurun Chen
Ruqi Huang
44
4
0
18 Sep 2024
PainDiffusion: Learning to Express Pain
Quang Tien Dam
Tri Tung Nguyen Nguyen
D. Tran
Joo-Ho Lee
Joo-Ho Lee
VGen
30
0
0
18 Sep 2024
OSV: One Step is Enough for High-Quality Image to Video Generation
Xiaofeng Mao
Zhengkai Jiang
Fu-Yun Wang
Wenbing Zhu
Hao Chen
Mingmin Chi
Yabiao Wang
Wenhan Luo
DiffM
VGen
72
7
0
17 Sep 2024
MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior
Weijing Tao
Xiaofeng Yang
Miaomiao Cui
Guosheng Lin
DiffM
26
1
0
16 Sep 2024
Think Twice Before You Act: Improving Inverse Problem Solving With MCMC
Y. Zhu
Zehao Dou
Haoxin Zheng
Yasi Zhang
Ying Nian Wu
Ruiqi Gao
DiffM
22
4
0
13 Sep 2024
Previous
1
2
3
...
5
6
7
...
15
16
17
Next