ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.08818
  4. Cited By
Align your Latents: High-Resolution Video Synthesis with Latent
  Diffusion Models

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

18 April 2023
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
    3DGS
    VGen
ArXivPDFHTML

Papers citing "Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models"

50 / 827 papers shown
Title
My Art My Choice: Adversarial Protection Against Unruly AI
My Art My Choice: Adversarial Protection Against Unruly AI
Anthony Rhodes
Ram Bhagat
U. Ciftci
Ilke Demir
DiffM
35
4
0
06 Sep 2023
Hierarchical Masked 3D Diffusion Model for Video Outpainting
Hierarchical Masked 3D Diffusion Model for Video Outpainting
Fanda Fan
Chaoxu Guo
Litong Gong
Biao Wang
T. Ge
Yuning Jiang
Chunjie Luo
Jianfeng Zhan
DiffM
VGen
19
13
0
05 Sep 2023
Benchmarking Autoregressive Conditional Diffusion Models for Turbulent
  Flow Simulation
Benchmarking Autoregressive Conditional Diffusion Models for Turbulent Flow Simulation
Georg Kohl
Li-Wei Chen
Nils Thuerey
AI4CE
DiffM
31
22
0
04 Sep 2023
MagicProp: Diffusion-based Video Editing via Motion-aware Appearance
  Propagation
MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation
Hanshu Yan
Jun Hao Liew
Long Mai
Shanchuan Lin
Jiashi Feng
VGen
DiffM
19
14
0
02 Sep 2023
MVDream: Multi-view Diffusion for 3D Generation
MVDream: Multi-view Diffusion for 3D Generation
Yichun Shi
Peng Wang
Jianglong Ye
Mai Long
Kejie Li
X. Yang
20
588
0
31 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
J. Liu
62
31
0
27 Aug 2023
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
Hao Fei
Shengqiong Wu
Wei Ji
Hanwang Zhang
Tat-Seng Chua
VGen
DiffM
11
32
0
26 Aug 2023
Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG
  Translation
Region-Disentangled Diffusion Model for High-Fidelity PPG-to-ECG Translation
Debaditya Shome
Pritam Sarkar
Ali Etemad
DiffM
19
8
0
25 Aug 2023
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Emanuele Bugliarello
Hernan Moraldo
Ruben Villegas
Mohammad Babaeizadeh
M. Saffar
Han Zhang
D. Erhan
V. Ferrari
Pieter-Jan Kindermans
P. Voigtlaender
VGen
20
10
0
22 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation
  with Temporal Correspondence Guidance
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
19
13
0
19 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
SimDA: Simple Diffusion Adapter for Efficient Video Generation
Zhen Xing
Qi Dai
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
24
81
0
18 Aug 2023
Diffusion Models for Image Restoration and Enhancement -- A
  Comprehensive Survey
Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey
Xin Li
Yulin Ren
Xin Jin
Cuiling Lan
X. Wang
Wenjun Zeng
Xinchao Wang
Zhibo Chen
39
46
0
18 Aug 2023
Edit Temporal-Consistent Videos with Image Diffusion Model
Edit Temporal-Consistent Videos with Image Diffusion Model
Yuan-Zheng Wang
Yong Li
Xiaoya Zhang
Xin Liu
Anbo Dai
Antoni B. Chan
Zhen Cui
DiffM
25
6
0
17 Aug 2023
Dual-Stream Diffusion Net for Text-to-Video Generation
Dual-Stream Diffusion Net for Text-to-Video Generation
Binhui Liu
Xin Liu
Anbo Dai
Zhiyong Zeng
Dan Wang
Zhen Cui
Jian Yang
DiffM
VGen
14
9
0
16 Aug 2023
ModelScope Text-to-Video Technical Report
ModelScope Text-to-Video Technical Report
Jiuniu Wang
Hangjie Yuan
Dayou Chen
Yingya Zhang
Xiang Wang
Shiwei Zhang
VGen
DiffM
25
387
0
12 Aug 2023
DiffSynth: Latent In-Iteration Deflickering for Realistic Video
  Synthesis
DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis
Zhongjie Duan
Lizhou You
Chengyu Wang
Cen Chen
Ziheng Wu
Weining Qian
Jun Huang
DiffM
29
8
0
07 Aug 2023
MobileVidFactory: Automatic Diffusion-Based Social Media Video
  Generation for Mobile Devices from Text
MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
Junchen Zhu
Huan Yang
Wenjing Wang
Huiguo He
Zixi Tuo
...
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
Jiebo Luo
DiffM
26
6
0
31 Jul 2023
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by
  Using Diffusion Model with ControlNet
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet
Zhihao Hu
Dong Xu
DiffM
VGen
11
64
0
26 Jul 2023
AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of
  Diffusion Probabilistic Models
AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models
Jiachun Pan
Jun Hao Liew
Vincent Y. F. Tan
Jiashi Feng
Hanshu Yan
DiffM
16
9
0
20 Jul 2023
TokenFlow: Consistent Diffusion Features for Consistent Video Editing
TokenFlow: Consistent Diffusion Features for Consistent Video Editing
Michal Geyer
Omer Bar-Tal
Shai Bagon
Tali Dekel
VGen
DiffM
18
250
0
19 Jul 2023
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding
  and Generation
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Yi Wang
Yinan He
Yizhuo Li
Kunchang Li
Jiashuo Yu
...
Ping Luo
Ziwei Liu
Yali Wang
Limin Wang
Yu Qiao
VLM
VGen
25
244
0
13 Jul 2023
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Yin-Yin He
Menghan Xia
Haoxin Chen
Xiaodong Cun
Yuan Gong
...
Yong Zhang
Xintao Wang
Chao-Liang Weng
Ying Shan
Qifeng Chen
DiffM
VGen
12
74
0
13 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
  without Specific Tuning
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
18
781
0
10 Jul 2023
Text-Guided Synthesis of Eulerian Cinemagraphs
Text-Guided Synthesis of Eulerian Cinemagraphs
Aniruddha Mahapatra
Aliaksandr Siarohin
Hsin-Ying Lee
Sergey Tulyakov
Junchen Zhu
DiffM
VGen
16
21
0
06 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
21
2,118
0
04 Jul 2023
Disentanglement in a GAN for Unconditional Speech Synthesis
Disentanglement in a GAN for Unconditional Speech Synthesis
Matthew Baas
Herman Kamper
DiffM
11
2
0
04 Jul 2023
Unsupervised Video Anomaly Detection with Diffusion Models Conditioned
  on Compact Motion Representations
Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations
Anil Osman Tur
Nicola Dall’Asen
Cigdem Beyan
Elisa Ricci
DiffM
VGen
19
14
0
04 Jul 2023
Squeezing Large-Scale Diffusion Models for Mobile
Squeezing Large-Scale Diffusion Models for Mobile
Jiwoong Choi
Minkyu Kim
Daehyun Ahn
Taesu Kim
Yulhwa Kim
Do-Hyun Jo
H. Jeon
Jae-Joon Kim
Hyungjun Kim
15
9
0
03 Jul 2023
MVDiffusion: Enabling Holistic Multi-view Image Generation with
  Correspondence-Aware Diffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion
Shitao Tang
Fuyang Zhang
Jiacheng Chen
Peng Wang
Yasutaka Furukawa
18
150
0
03 Jul 2023
Solving Linear Inverse Problems Provably via Posterior Sampling with
  Latent Diffusion Models
Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models
Litu Rout
Negin Raoof
Giannis Daras
C. Caramanis
A. Dimakis
Sanjay Shakkottai
DiffM
19
92
0
02 Jul 2023
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text
  Aligned Latent Representation
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Zibo Zhao
Wen Liu
Xin Chen
Xi Zeng
Rui Wang
Pei Cheng
Bin-Bin Fu
Tao Chen
Gang Yu
Shenghua Gao
DiffM
17
87
0
29 Jun 2023
Federated Generative Learning with Foundation Models
Federated Generative Learning with Foundation Models
Jie M. Zhang
Xiaohua Qi
Bo-Lu Zhao
FedML
31
21
0
28 Jun 2023
GD-VDM: Generated Depth for better Diffusion-based Video Generation
GD-VDM: Generated Depth for better Diffusion-based Video Generation
Ariel Lapid
Idan Achituve
Lior Bracha
Ethan Fetaya
DiffM
VGen
79
6
0
19 Jun 2023
Relation-Aware Diffusion Model for Controllable Poster Layout Generation
Relation-Aware Diffusion Model for Controllable Poster Layout Generation
Fengheng Li
An Liu
Wei Feng
Honghe Zhu
Yaoyu Li
...
Jingjing Lv
Xin Zhu
Jun-Jun Shen
Zhangang Lin
Jingping Shao
17
21
0
15 Jun 2023
MovieFactory: Automatic Movie Creation from Text using Large Generative
  Models for Language and Images
MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images
Junchen Zhu
Huan Yang
Huiguo He
Wenjing Wang
Zixi Tuo
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
VGen
DiffM
25
39
0
12 Jun 2023
The Age of Synthetic Realities: Challenges and Opportunities
The Age of Synthetic Realities: Challenges and Opportunities
J. P. Cardenuto
Jing Yang
Rafael Padilha
Renjie Wan
Daniel Moreira
Haoliang Li
Shiqi Wang
Fernanda A. Andaló
Sébastien Marcel
Anderson de Rezende Rocha
DeLMO
37
29
0
09 Jun 2023
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions
Yuseung Lee
Kunho Kim
Hyunjin Kim
Minhyuk Sung
DiffM
9
62
0
08 Jun 2023
Multi-modal Latent Diffusion
Multi-modal Latent Diffusion
Mustapha Bounoua
Giulio Franzese
Pietro Michiardi
DiffM
16
12
0
07 Jun 2023
ATT3D: Amortized Text-to-3D Object Synthesis
ATT3D: Amortized Text-to-3D Object Synthesis
Jonathan Lorraine
Kevin Xie
Xiaohui Zeng
Chen-Hsuan Lin
Towaki Takikawa
Nicholas Sharp
Tsung-Yi Lin
Ming-Yu Liu
Sanja Fidler
James Lucas
DiffM
14
86
0
06 Jun 2023
HeadSculpt: Crafting 3D Head Avatars with Text
HeadSculpt: Crafting 3D Head Avatars with Text
Xiaoping Han
Yukang Cao
Kai Han
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
Kwan-Yee Kenneth Wong
DiffM
16
45
0
05 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGen
DiffM
25
315
0
03 Jun 2023
Probabilistic Adaptation of Text-to-Video Models
Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang
Yilun Du
Bo Dai
Dale Schuurmans
J. Tenenbaum
Pieter Abbeel
VGen
DiffM
29
23
0
02 Jun 2023
DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery
  and Data Poisoning Detection
DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery and Data Poisoning Detection
Hossein Aboutalebi
Daniel Mao
Rongqi Fan
Carol Xu
Chris He
Alexander Wong
AAML
12
8
0
02 Jun 2023
Make-Your-Video: Customized Video Generation Using Textual and
  Structural Guidance
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Jinbo Xing
Menghan Xia
Yuxin Liu
Yuechen Zhang
Yong Zhang
...
Haoxin Chen
Xiaodong Cun
Xintao Wang
Ying Shan
T. Wong
VGen
DiffM
30
84
0
01 Jun 2023
A Geometric Perspective on Diffusion Models
A Geometric Perspective on Diffusion Models
Defang Chen
Zhenyu Zhou
Jianhan Mei
Chunhua Shen
Chun-Yen Chen
C. Wang
DiffM
15
18
0
31 May 2023
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for
  Text-driven Video Editing
SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing
Nazmul Karim
Umar Khalid
M. Joneidi
Chen Chen
Nazanin Rahnavard
DiffM
VGen
19
5
0
30 May 2023
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Zeyue Xue
Guanglu Song
Qiushan Guo
Boxiao Liu
Zhuofan Zong
Yu Liu
Ping Luo
DiffM
29
132
0
29 May 2023
InstructEdit: Improving Automatic Masks for Diffusion-based Image
  Editing With User Instructions
InstructEdit: Improving Automatic Masks for Diffusion-based Image Editing With User Instructions
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
28
31
0
29 May 2023
Negative-prompt Inversion: Fast Image Inversion for Editing with
  Text-guided Diffusion Models
Negative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Models
Daiki Miyake
Akihiro Iohara
Yuriko Saito
Toshiyuki Tanaka
DiffM
16
110
0
26 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion
  Models
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Xingqian Xu
Jiayi Guo
Zhangyang Wang
Gao Huang
Irfan Essa
Humphrey Shi
VLM
DiffM
25
57
0
25 May 2023
Previous
123...151617
Next