ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.03011
  4. Cited By
Structure and Content-Guided Video Synthesis with Diffusion Models

Structure and Content-Guided Video Synthesis with Diffusion Models

6 February 2023
Patrick Esser
Johnathan Chiu
Parmida Atighehchian
Jonathan Granskog
Anastasis Germanidis
    DiffM
    VGen
ArXivPDFHTML

Papers citing "Structure and Content-Guided Video Synthesis with Diffusion Models"

50 / 422 papers shown
Title
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face
  Synthesis
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis
Jingjing Ren
Cheng Xu
Haoyu Chen
Xinran Qin
Lei Zhu
CVBM
DiffM
16
4
0
26 Dec 2023
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Xiang Wang
Shiwei Zhang
Hangjie Yuan
Zhiwu Qing
Biao Gong
Yingya Zhang
Yujun Shen
Changxin Gao
Nong Sang
DiffM
VGen
25
26
0
25 Dec 2023
Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Tao Huang
Guangqi Jiang
Yanjie Ze
Huazhe Xu
VGen
26
22
0
21 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
12
235
0
21 Dec 2023
PIA: Your Personalized Image Animator via Plug-and-Play Modules in
  Text-to-Image Models
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Yiming Zhang
Zhening Xing
Yanhong Zeng
Youqing Fang
Kai Chen
VGen
25
27
0
21 Dec 2023
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
Bichen Wu
Ching-Yao Chuang
Xiaoyan Wang
Yichen Jia
K. Krishnakumar
Tong Xiao
Feng Liang
Licheng Yu
Peter Vajda
DiffM
VGen
17
22
0
20 Dec 2023
RealCraft: Attention Control as A Tool for Zero-Shot Consistent Video Editing
RealCraft: Attention Control as A Tool for Zero-Shot Consistent Video Editing
Shutong Jin
Ruiyu Wang
Florian T. Pokorny
DiffM
VGen
76
1
0
19 Dec 2023
InstructVideo: Instructing Video Diffusion Models with Human Feedback
InstructVideo: Instructing Video Diffusion Models with Human Feedback
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Yujie Wei
Tao Feng
Yining Pan
Yingya Zhang
Ziwei Liu
Samuel Albanie
Dong Ni
VGen
13
41
0
19 Dec 2023
MaskINT: Video Editing via Interpolative Non-autoregressive Masked
  Transformers
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffM
VGen
10
4
0
19 Dec 2023
Urban Generative Intelligence (UGI): A Foundational Platform for Agents
  in Embodied City Environment
Urban Generative Intelligence (UGI): A Foundational Platform for Agents in Embodied City Environment
Fengli Xu
Jun Zhang
Chen Gao
J. Feng
Yong Li
AI4CE
LLMAG
19
28
0
19 Dec 2023
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced
  Hierarchical Diffusion Model
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model
Zhenyu Xie
Yang Wu
Xuehao Gao
Zhongqian Sun
Wei Yang
Xiaodan Liang
DiffM
19
11
0
18 Dec 2023
VideoLCM: Video Latent Consistency Model
VideoLCM: Video Latent Consistency Model
Xiang Wang
Shiwei Zhang
Han Zhang
Yu Liu
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
17
48
0
14 Dec 2023
HeadArtist: Text-conditioned 3D Head Generation with Self Score
  Distillation
HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation
Hongyu Liu
Xuan Wang
Ziyu Wan
Yujun Shen
Yibing Song
Jing Liao
Qifeng Chen
DiffM
33
17
0
12 Dec 2023
PEEKABOO: Interactive Video Generation via Masked-Diffusion
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Yash Jain
Anshul Nasery
Vibhav Vineet
Harkirat Singh Behl
VGen
26
30
0
12 Dec 2023
Boosting Latent Diffusion with Flow Matching
Boosting Latent Diffusion with Flow Matching
Johannes S. Fischer
Ming Gui
Pingchuan Ma
Nick Stracke
S. A. Baumann
Bjorn Ommer
17
20
0
12 Dec 2023
LatentMan: Generating Consistent Animated Characters using Image
  Diffusion Models
LatentMan: Generating Consistent Animated Characters using Image Diffusion Models
Abdelrahman Eldesokey
Peter Wonka
13
4
0
12 Dec 2023
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World
  Video Super-Resolution
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Shangchen Zhou
Peiqing Yang
Jianyi Wang
Yihang Luo
Chen Change Loy
VGen
99
37
0
11 Dec 2023
Neutral Editing Framework for Diffusion-based Video Editing
Neutral Editing Framework for Diffusion-based Video Editing
Sunjae Yoon
Gwanhyeong Koo
Jiajing Hong
Changdong Yoo
VGen
DiffM
11
1
0
10 Dec 2023
Generating Illustrated Instructions
Generating Illustrated Instructions
Sachit Menon
Ishan Misra
Rohit Girdhar
DiffM
24
4
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
24
37
0
07 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and
  Motion
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffM
VGen
11
89
0
07 Dec 2023
MEVG: Multi-event Video Generation with Text-to-Video Models
MEVG: Multi-event Video Generation with Text-to-Video Models
Gyeongrok Oh
Jaehwan Jeong
Sieun Kim
Wonmin Byeon
Jinkyu Kim
Sungwoong Kim
Sangpil Kim
VGen
DiffM
33
19
0
07 Dec 2023
AVID: Any-Length Video Inpainting with Diffusion Model
AVID: Any-Length Video Inpainting with Diffusion Model
Zhixing Zhang
Bichen Wu
Xiaoyan Wang
Yaqiao Luo
Luxin Zhang
Yinan Zhao
Peter Vajda
Dimitris N. Metaxas
Licheng Yu
VGen
DiffM
34
33
0
06 Dec 2023
MotionCtrl: A Unified and Flexible Motion Controller for Video
  Generation
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
Zhouxia Wang
Ziyang Yuan
Xintao Wang
Tianshui Chen
Menghan Xia
Ping Luo
Ying Shan
DiffM
VGen
24
195
0
06 Dec 2023
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with
  Diffusion Models
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models
Shaoan Xie
Yang Zhao
Zhisheng Xiao
Kelvin C. K. Chan
Yandong Li
Yanwu Xu
Kun Zhang
Tingbo Hou
DiffM
14
26
0
05 Dec 2023
LivePhoto: Real Image Animation with Text-guided Motion Control
LivePhoto: Real Image Animation with Text-guided Motion Control
Xi Chen
Zhiheng Liu
Mengting Chen
Yutong Feng
Yu Liu
Yujun Shen
Hengshuang Zhao
VGen
DiffM
34
27
0
05 Dec 2023
MagicStick: Controllable Video Editing via Control Handle
  Transformations
MagicStick: Controllable Video Editing via Control Handle Transformations
Yue Ma
Xiaodong Cun
Yin-Yin He
Chenyang Qi
Xintao Wang
Ying Shan
Xiu Li
Qifeng Chen
VGen
14
24
0
05 Dec 2023
Fine-grained Controllable Video Generation via Object Appearance and
  Context
Fine-grained Controllable Video Generation via Object Appearance and Context
Hsin-Ping Huang
Yu-Chuan Su
Deqing Sun
Lu Jiang
Xuhui Jia
Yukun Zhu
Ming-Hsuan Yang
DiffM
VGen
13
13
0
05 Dec 2023
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis
  via Bridging Image and Video Diffusion Models
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
Fengyuan Shi
Jiaxi Gu
Hang Xu
Songcen Xu
Wei Zhang
Limin Wang
VGen
DiffM
22
12
0
05 Dec 2023
SAVE: Protagonist Diversification with Structure Agnostic Video Editing
SAVE: Protagonist Diversification with Structure Agnostic Video Editing
Yeji Song
Wonsik Shin
Junsoo Lee
Jeesoo Kim
Nojun Kwak
DiffM
VGen
101
4
0
05 Dec 2023
VideoSwap: Customized Video Subject Swapping with Interactive Semantic
  Point Correspondence
VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Yuchao Gu
Yipin Zhou
Bichen Wu
Licheng Yu
Jia-Wei Liu
Rui Zhao
Jay Zhangjie Wu
David Junhao Zhang
Mike Zheng Shou
Kevin Tang
DiffM
VGen
60
36
0
04 Dec 2023
Generative Rendering: Controllable 4D-Guided Video Generation with 2D
  Diffusion Models
Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models
Shengqu Cai
Duygu Ceylan
Matheus Gadelha
C. Huang
Tuanfeng Y. Wang
Gordon Wetzstein
VGen
14
16
0
03 Dec 2023
VideoBooth: Diffusion-based Video Generation with Image Prompts
VideoBooth: Diffusion-based Video Generation with Image Prompts
Yuming Jiang
Tianxing Wu
Shuai Yang
Chenyang Si
Dahua Lin
Yu Qiao
Chen Change Loy
Ziwei Liu
DiffM
VGen
32
65
0
01 Dec 2023
VMC: Video Motion Customization using Temporal Attention Adaption for
  Text-to-Video Diffusion Models
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Hyeonho Jeong
Geon Yeong Park
Jong Chul Ye
VGen
DiffM
106
53
0
01 Dec 2023
ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with
  Diffusion Models
ART⋅\boldsymbol{\cdot}⋅V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Wenming Weng
Ruoyu Feng
Yanhui Wang
Qi Dai
Chunyu Wang
...
Jianmin Bao
Yuhui Yuan
Chong Luo
Yueyi Zhang
Zhiwei Xiong
VGen
17
32
0
30 Nov 2023
One-step Diffusion with Distribution Matching Distillation
One-step Diffusion with Distribution Matching Distillation
Tianwei Yin
Michael Gharbi
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
Taesung Park
DiffM
124
215
0
30 Nov 2023
Motion-Conditioned Image Animation for Video Editing
Motion-Conditioned Image Animation for Video Editing
Wilson Yan
Andrew Brown
Pieter Abbeel
Rohit Girdhar
S. Azadi
DiffM
VGen
58
12
0
30 Nov 2023
CosAvatar: Consistent and Animatable Portrait Video Tuning with Text
  Prompt
CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt
Haiyao Xiao
Chenglai Zhong
Xuan Gao
Yudong Guo
Juyong Zhang
33
0
0
30 Nov 2023
Driving into the Future: Multiview Visual Forecasting and Planning with
  World Model for Autonomous Driving
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
Yu-Quan Wang
Jiawei He
Lue Fan
Hongxin Li
Yuntao Chen
Zhaoxiang Zhang
VGen
46
116
0
29 Nov 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
VBench: Comprehensive Benchmark Suite for Video Generative Models
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
59
341
0
29 Nov 2023
SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion
  Models for One-shot Video Tuning
SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Liang Peng
Haoran Cheng
Zheng Yang
Ruisi Zhao
Linxuan Xia
Chaotian Song
Qinglin Lu
Boxi Wu
Wei Liu
VGen
10
2
0
29 Nov 2023
DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with
  Diffusion Model
DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Diffusion Model
Jiuming Liu
Guangming Wang
Weicai Ye
Chaokang Jiang
Jinru Han
Zhe Liu
Guofeng Zhang
Dalong Du
Hesheng Wang
21
10
0
29 Nov 2023
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
Juntao Zhang
Yuehuai Liu
Yu-Wing Tai
Chi-Keung Tang
DiffM
30
4
0
29 Nov 2023
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation
  and Editing
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao
Tianyi Lu
Jiaxi Gu
Xing Zhang
Qingping Zheng
Zuxuan Wu
Hang Xu
Yu-Gang Jiang
VGen
DiffM
27
10
0
29 Nov 2023
Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer
Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer
Danah Yatim
Rafail Fridman
Omer Bar-Tal
Yoni Kasten
Tali Dekel
DiffM
VGen
19
50
0
28 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffM
VGen
15
113
0
28 Nov 2023
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for
  Character Animation
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Liucheng Hu
Xin Gao
Peng Zhang
Ke Sun
Bang Zhang
Liefeng Bo
DiffM
VGen
28
329
0
28 Nov 2023
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video
  Generation
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Sitong Su
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
VGen
21
4
0
28 Nov 2023
Enhancing Perceptual Quality in Video Super-Resolution through
  Temporally-Consistent Detail Synthesis using Diffusion Models
Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models
C. Rota
M. Buzzelli
J. Weijer
DiffM
23
3
0
27 Nov 2023
InterControl: Zero-shot Human Interaction Generation by Controlling
  Every Joint
InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint
Zhenzhi Wang
Jingbo Wang
Yixuan Li
Dahua Lin
Bo Dai
34
1
0
27 Nov 2023
Previous
123456789
Next