ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.07685
  4. Cited By
Video Probabilistic Diffusion Models in Projected Latent Space

Video Probabilistic Diffusion Models in Projected Latent Space

15 February 2023
Sihyun Yu
Kihyuk Sohn
Subin Kim
Jinwoo Shin
    VGen
    DiffM
ArXivPDFHTML

Papers citing "Video Probabilistic Diffusion Models in Projected Latent Space"

29 / 29 papers shown
Title
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation
R. Vidaurre
Elena Garces
Dan Casas
DiffM
AI4CE
79
1
0
24 Mar 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
69
0
0
18 Feb 2025
CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation
CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation
Zheng Chong
Wenqing Zhang
Shiyue Zhang
Jun Zheng
Xiao Dong
Haoxiang Li
Yiling Wu
D. Jiang
Xiaodan Liang
DiffM
26
1
0
20 Jan 2025
InterDyn: Controllable Interactive Dynamics with Video Diffusion Models
InterDyn: Controllable Interactive Dynamics with Video Diffusion Models
Rick Akkerman
Haiwen Feng
M. Black
Dimitrios Tzionas
Victoria Fernandez-Abrevaya
VGen
AI4CE
100
3
0
16 Dec 2024
Video Diffusion Transformers are In-Context Learners
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
130
2
0
14 Dec 2024
I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength
I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength
Wanquan Feng
Jiawei Liu
Pengqi Tu
Tianhao Qi
Mingzhen Sun
Tianxiang Ma
Songtao Zhao
Siyu Zhou
Qian He
VGen
47
7
0
10 Nov 2024
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content
Qiuheng Wang
Yukai Shi
Jiarong Ou
R. J. Chen
Ke Lin
...
Mingwu Zheng
Xin Tao
Fei Yang
Pengfei Wan
Di Zhang
VGen
86
18
0
10 Oct 2024
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
Kepan Nan
Rui Xie
Penghao Zhou
Tiehan Fan
Zhenheng Yang
Zhijie Chen
Xiang Li
Jian Yang
Ying Tai
73
68
0
02 Jul 2024
Visual Representation Learning with Stochastic Frame Prediction
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
29
2
0
11 Jun 2024
MaSkel: A Model for Human Whole-body X-rays Generation from Human
  Masking Images
MaSkel: A Model for Human Whole-body X-rays Generation from Human Masking Images
Yingjie Xi
Boyuan Cheng
Jingyao Cai
Jian Jun Zhang
Xiaosong Yang
MedIm
31
0
0
13 Apr 2024
Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models
  for Scene Graphs
Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models for Scene Graphs
Rameshwar Mishra
A. V. Subramanyam
DiffM
14
2
0
25 Jan 2024
Latte: Latent Diffusion Transformer for Video Generation
Latte: Latent Diffusion Transformer for Video Generation
Xin Ma
Yaohui Wang
Gengyun Jia
Xinyuan Chen
Z. Liu
Yuan-Fang Li
Cunjian Chen
Yu Qiao
DiffM
VGen
123
233
0
05 Jan 2024
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced
  Hierarchical Diffusion Model
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model
Zhenyu Xie
Yang Wu
Xuehao Gao
Zhongqian Sun
Wei Yang
Xiaodan Liang
DiffM
22
11
0
18 Dec 2023
Photorealistic Video Generation with Diffusion Models
Photorealistic Video Generation with Diffusion Models
Agrim Gupta
Lijun Yu
Kihyuk Sohn
Xiuye Gu
Meera Hahn
Fei-Fei Li
Irfan Essa
Lu Jiang
José Lezama
VGen
36
172
0
11 Dec 2023
DiffCast: A Unified Framework via Residual Diffusion for Precipitation
  Nowcasting
DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting
Demin Yu
Xutao Li
Yunming Ye
Baoquan Zhang
Chuyao Luo
Kuai Dai
Rui Wang
Xunlai Chen
28
19
0
11 Dec 2023
GenDeF: Learning Generative Deformation Field for Video Generation
GenDeF: Learning Generative Deformation Field for Video Generation
Wen Wang
Kecheng Zheng
Qiuyu Wang
Hao Chen
Zifan Shi
Ceyuan Yang
Yujun Shen
Chunhua Shen
VGen
DiffM
41
2
0
07 Dec 2023
Flow-Guided Diffusion for Video Inpainting
Flow-Guided Diffusion for Video Inpainting
Bohai Gu
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGen
DiffM
28
12
0
26 Nov 2023
FusionFrames: Efficient Architectural Aspects for Text-to-Video
  Generation Pipeline
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
V.Ya. Arkhipkin
Zein Shaheen
Viacheslav Vasilev
E. Dakhova
Andrey Kuznetsov
Denis Dimitrov
DiffM
VGen
16
5
0
22 Nov 2023
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Rui Zhao
Yuchao Gu
Jay Zhangjie Wu
David Junhao Zhang
Jia-Wei Liu
Weijia Wu
Jussi Keppo
Mike Zheng Shou
DiffM
VGen
20
103
0
12 Oct 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
J. Liu
62
31
0
27 Aug 2023
PreDiff: Precipitation Nowcasting with Latent Diffusion Models
PreDiff: Precipitation Nowcasting with Latent Diffusion Models
Zhihan Gao
Xingjian Shi
Boran Han
Hongya Wang
Xiaoyong Jin
Danielle C. Maddix
Yi Zhu
Mu Li
Bernie Wang
BDL
DiffM
23
53
0
19 Jul 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal
  Co-Denoising
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGen
DiffM
28
87
0
29 May 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for
  High-Fidelity Text-to-Image Synthesis
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
Qiucheng Wu
Yujian Liu
Handong Zhao
T. Bui
Zhe-nan Lin
Yang Zhang
Shiyu Chang
DiffM
37
43
0
07 Apr 2023
Flexible Diffusion Modeling of Long Videos
Flexible Diffusion Modeling of Long Videos
William Harvey
Saeid Naderiparizi
Vaden Masrani
Christian Weilbach
Frank D. Wood
DiffM
BDL
VGen
174
284
0
23 May 2022
StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN
StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN
Gereon Fox
A. Tewari
Mohamed A. Elgharib
Christian Theobalt
163
51
0
15 Jul 2021
VideoGPT: Video Generation using VQ-VAE and Transformers
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViT
VGen
242
482
0
20 Apr 2021
Diffusion Probabilistic Models for 3D Point Cloud Generation
Diffusion Probabilistic Models for 3D Point Cloud Generation
Shitong Luo
Wei Hu
3DPC
172
711
0
02 Mar 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
278
1,939
0
09 Feb 2021
Generative Spoken Language Modeling from Raw Audio
Generative Spoken Language Modeling from Raw Audio
Kushal Lakhotia
Evgeny Kharitonov
Wei-Ning Hsu
Yossi Adi
Adam Polyak
...
Tu Nguyen
Jade Copet
Alexei Baevski
A. Mohamed
Emmanuel Dupoux
AuLLM
174
336
0
01 Feb 2021
1