ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.08818
  4. Cited By
Align your Latents: High-Resolution Video Synthesis with Latent
  Diffusion Models

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

18 April 2023
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
    3DGS
    VGen
ArXivPDFHTML

Papers citing "Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models"

50 / 827 papers shown
Title
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
W. Ma
J. P. Lewis
W. Kleijn
DiffM
VGen
19
34
0
31 Dec 2023
Diffusion Model with Perceptual Loss
Diffusion Model with Perceptual Loss
Shanchuan Lin
Xiao Yang
DiffM
23
15
0
30 Dec 2023
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face
  Synthesis
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis
Jingjing Ren
Cheng Xu
Haoyu Chen
Xinran Qin
Lei Zhu
CVBM
DiffM
24
4
0
26 Dec 2023
GenCast: Diffusion-based ensemble forecasting for medium-range weather
GenCast: Diffusion-based ensemble forecasting for medium-range weather
Ilan Price
Alvaro Sanchez-Gonzalez
Ferran Alet
Tom R. Andersson
Andrew El-Kadi
...
Jacklynn Stott
Shakir Mohamed
Peter W. Battaglia
Rémi R. Lam
Matthew Willson
26
105
0
25 Dec 2023
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Zhiwu Qing
Biao Gong
Yingya Zhang
Yujun Shen
Changxin Gao
Nong Sang
DiffM
VGen
31
26
0
25 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
18
237
0
21 Dec 2023
PIA: Your Personalized Image Animator via Plug-and-Play Modules in
  Text-to-Image Models
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Yiming Zhang
Zhening Xing
Yanhong Zeng
Youqing Fang
Kai Chen
VGen
31
27
0
21 Dec 2023
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed
  Diffusion Models
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM
3DGS
32
112
0
21 Dec 2023
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive
  Generation
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation
Akio Kodaira
Chenfeng Xu
Toshiki Hazama
Takanori Yoshimoto
Kohei Ohno
Shogo Mitsuhori
Soichi Sugano
Hanying Cho
Zhijian Liu
Kurt Keutzer
18
31
0
19 Dec 2023
InstructVideo: Instructing Video Diffusion Models with Human Feedback
InstructVideo: Instructing Video Diffusion Models with Human Feedback
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Yujie Wei
Tao Feng
Yining Pan
Yingya Zhang
Ziwei Liu
Samuel Albanie
Dong Ni
VGen
24
42
0
19 Dec 2023
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model
Lingjun Zhang
Xinyuan Chen
Yaohui Wang
Yue Lu
Yu Qiao
DiffM
11
32
0
19 Dec 2023
Continual Learning: Forget-free Winning Subnetworks for Video
  Representations
Continual Learning: Forget-free Winning Subnetworks for Video Representations
Haeyong Kang
Jaehong Yoon
Sung Ju Hwang
Chang D. Yoo
CLL
27
2
0
19 Dec 2023
MaskINT: Video Editing via Interpolative Non-autoregressive Masked
  Transformers
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffM
VGen
12
4
0
19 Dec 2023
VidToMe: Video Token Merging for Zero-Shot Video Editing
VidToMe: Video Token Merging for Zero-Shot Video Editing
Xirui Li
Chao Ma
Xiaokang Yang
Ming-Hsuan Yang
DiffM
VGen
27
40
0
17 Dec 2023
Anomaly Score: Evaluating Generative Models and Individual Generated
  Images based on Complexity and Vulnerability
Anomaly Score: Evaluating Generative Models and Individual Generated Images based on Complexity and Vulnerability
Jaehui Hwang
Junghyuk Lee
Jong-Seok Lee
EGVM
19
2
0
17 Dec 2023
VecFusion: Vector Font Generation with Diffusion
VecFusion: Vector Font Generation with Diffusion
Vikas Thamizharasan
Difan Liu
Shantanu Agarwal
Matthew Fisher
Michael Gharbi
Oliver Wang
Alec Jacobson
E. Kalogerakis
DiffM
22
8
0
16 Dec 2023
Iterative Motion Editing with Natural Language
Iterative Motion Editing with Natural Language
Purvi Goel
Kuan-Chieh Wang
C. Karen Liu
Kayvon Fatahalian
DiffM
22
22
0
15 Dec 2023
Latent Diffusion Models with Image-Derived Annotations for Enhanced
  AI-Assisted Cancer Diagnosis in Histopathology
Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology
Pedro Osório
Guillermo Jiménez-Pérez
Javier Montalt-Tordera
Jens Hooge
Guillem Duran Ballester
...
Sabrina Schroeder
K. Siudak
Julia Vienenkoetter
Bettina Lawrenz
Sadegh Mohammadi
MedIm
25
8
0
15 Dec 2023
VideoLCM: Video Latent Consistency Model
VideoLCM: Video Latent Consistency Model
Xiang Wang
Shiwei Zhang
Han Zhang
Yu Liu
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
22
48
0
14 Dec 2023
Motion Flow Matching for Human Motion Synthesis and Editing
Motion Flow Matching for Human Motion Synthesis and Editing
Vincent Tao Hu
Wenzhe Yin
Pingchuan Ma
Yunlu Chen
Basura Fernando
Yuki M. Asano
E. Gavves
Pascal Mettes
Bjorn Ommer
Cees G. M. Snoek
DiffM
30
19
0
14 Dec 2023
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head
  Models
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
Shivangi Aneja
Justus Thies
Angela Dai
Matthias Nießner
DiffM
VGen
24
29
0
13 Dec 2023
FreeInit: Bridging Initialization Gap in Video Diffusion Models
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Tianxing Wu
Chenyang Si
Yuming Jiang
Ziqi Huang
Ziwei Liu
DiffM
VGen
30
45
0
12 Dec 2023
Boosting Latent Diffusion with Flow Matching
Boosting Latent Diffusion with Flow Matching
Johannes S. Fischer
Ming Gui
Pingchuan Ma
Nick Stracke
S. A. Baumann
Bjorn Ommer
22
20
0
12 Dec 2023
LatentMan: Generating Consistent Animated Characters using Image
  Diffusion Models
LatentMan: Generating Consistent Animated Characters using Image Diffusion Models
Abdelrahman Eldesokey
Peter Wonka
24
4
0
12 Dec 2023
Photorealistic Video Generation with Diffusion Models
Photorealistic Video Generation with Diffusion Models
Agrim Gupta
Lijun Yu
Kihyuk Sohn
Xiuye Gu
Meera Hahn
Fei-Fei Li
Irfan Essa
Lu Jiang
José Lezama
VGen
39
174
0
11 Dec 2023
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World
  Video Super-Resolution
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Shangchen Zhou
Peiqing Yang
Jianyi Wang
Yihang Luo
Chen Change Loy
VGen
99
37
0
11 Dec 2023
DiT-Head: High-Resolution Talking Head Synthesis using Diffusion
  Transformers
DiT-Head: High-Resolution Talking Head Synthesis using Diffusion Transformers
Aaron Mir
Eduardo Alonso
Esther Mondragón
DiffM
28
2
0
11 Dec 2023
Precipitation Downscaling with Spatiotemporal Video Diffusion
Precipitation Downscaling with Spatiotemporal Video Diffusion
Prakhar Srivastava
Ruihan Yang
Gavin Kerrigan
Gideon Dresdner
Jeremy McGibbon
Christopher S. Bretherton
Stephan Mandt
DiffM
29
3
0
11 Dec 2023
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization
  Inversion for Zero-Shot Video Editing
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Maomao Li
Yu Li
Tianyu Yang
Yunfei Liu
Dongxu Yue
Zhihui Lin
Dong Xu
VGen
10
8
0
10 Dec 2023
CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional
  Modeling
CMMD: Contrastive Multi-Modal Diffusion for Video-Audio Conditional Modeling
Ruihan Yang
H. Gamper
Sebastian Braun
DiffM
22
5
0
08 Dec 2023
MotionCrafter: One-Shot Motion Customization of Diffusion Models
MotionCrafter: One-Shot Motion Customization of Diffusion Models
Yuxin Zhang
Fan Tang
Nisha Huang
Haibin Huang
Chongyang Ma
Weiming Dong
Changsheng Xu
DiffM
VGen
19
14
0
08 Dec 2023
GenDeF: Learning Generative Deformation Field for Video Generation
GenDeF: Learning Generative Deformation Field for Video Generation
Wen Wang
Kecheng Zheng
Qiuyu Wang
Hao Chen
Zifan Shi
Ceyuan Yang
Yujun Shen
Chunhua Shen
VGen
DiffM
46
2
0
07 Dec 2023
GenTron: Diffusion Transformers for Image and Video Generation
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen
Mengmeng Xu
Jiawei Ren
Yuren Cong
Sen He
Yanping Xie
Animesh Sinha
Ping Luo
Tao Xiang
Juan-Manuel Perez-Rua
VGen
31
38
0
07 Dec 2023
Generating Illustrated Instructions
Generating Illustrated Instructions
Sachit Menon
Ishan Misra
Rohit Girdhar
DiffM
24
4
0
07 Dec 2023
Free3D: Consistent Novel View Synthesis without 3D Representation
Free3D: Consistent Novel View Synthesis without 3D Representation
Chuanxia Zheng
Andrea Vedaldi
3DV
37
48
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
24
37
0
07 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and
  Motion
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffM
VGen
11
89
0
07 Dec 2023
MEVG: Multi-event Video Generation with Text-to-Video Models
MEVG: Multi-event Video Generation with Text-to-Video Models
Gyeongrok Oh
Jaehwan Jeong
Sieun Kim
Wonmin Byeon
Jinkyu Kim
Sungwoong Kim
Sangpil Kim
VGen
DiffM
33
20
0
07 Dec 2023
Natural-language-driven Simulation Benchmark and Copilot for Efficient
  Production of Object Interactions in Virtual Road Scenes
Natural-language-driven Simulation Benchmark and Copilot for Efficient Production of Object Interactions in Virtual Road Scenes
Kairui Yang
Zihao Guo
Gengjie Lin
Haotian Dong
Die Zuo
...
Zhao Huang
Zhecheng Xu
Fupeng Li
Ziyun Bai
Di Lin
24
1
0
07 Dec 2023
FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models
FitDiff: Robust monocular 3D facial shape and reflectance estimation using Diffusion Models
Stathis Galanakis
Alexandros Lattas
Stylianos Moschoglou
S. Zafeiriou
19
2
0
07 Dec 2023
AVID: Any-Length Video Inpainting with Diffusion Model
AVID: Any-Length Video Inpainting with Diffusion Model
Zhixing Zhang
Bichen Wu
Xiaoyan Wang
Yaqiao Luo
Luxin Zhang
Yinan Zhao
Peter Vajda
Dimitris N. Metaxas
Licheng Yu
VGen
DiffM
34
33
0
06 Dec 2023
MotionCtrl: A Unified and Flexible Motion Controller for Video
  Generation
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
Zhouxia Wang
Ziyang Yuan
Xintao Wang
Tianshui Chen
Menghan Xia
Ping Luo
Ying Shan
DiffM
VGen
33
195
0
06 Dec 2023
DiffusionSat: A Generative Foundation Model for Satellite Imagery
DiffusionSat: A Generative Foundation Model for Satellite Imagery
Samar Khanna
Patrick Liu
Linqi Zhou
Chenlin Meng
Robin Rombach
Marshall Burke
David B. Lobell
Stefano Ermon
22
57
0
06 Dec 2023
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Xuanchi Ren
Jiahui Huang
Xiaohui Zeng
Ken Museth
Sanja Fidler
Francis Williams
18
47
0
06 Dec 2023
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with
  Diffusion Models
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models
Shaoan Xie
Yang Zhao
Zhisheng Xiao
Kelvin C. K. Chan
Yandong Li
Yanwu Xu
Kun Zhang
Tingbo Hou
DiffM
25
26
0
05 Dec 2023
LivePhoto: Real Image Animation with Text-guided Motion Control
LivePhoto: Real Image Animation with Text-guided Motion Control
Xi Chen
Zhiheng Liu
Mengting Chen
Yutong Feng
Yu Liu
Yujun Shen
Hengshuang Zhao
VGen
DiffM
34
28
0
05 Dec 2023
MagicStick: Controllable Video Editing via Control Handle
  Transformations
MagicStick: Controllable Video Editing via Control Handle Transformations
Yue Ma
Xiaodong Cun
Yin-Yin He
Chenyang Qi
Xintao Wang
Ying Shan
Xiu Li
Qifeng Chen
VGen
14
24
0
05 Dec 2023
Fine-grained Controllable Video Generation via Object Appearance and
  Context
Fine-grained Controllable Video Generation via Object Appearance and Context
Hsin-Ping Huang
Yu-Chuan Su
Deqing Sun
Lu Jiang
Xuhui Jia
Yukun Zhu
Ming-Hsuan Yang
DiffM
VGen
13
13
0
05 Dec 2023
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis
  via Bridging Image and Video Diffusion Models
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
Fengyuan Shi
Jiaxi Gu
Hang Xu
Songcen Xu
Wei Zhang
Limin Wang
VGen
DiffM
28
12
0
05 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
28
153
0
05 Dec 2023
Previous
123...121314151617
Next