Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.13077
Cited By
ControlVideo: Training-free Controllable Text-to-Video Generation
22 May 2023
Yabo Zhang
Yuxiang Wei
Dongsheng Jiang
Xiaopeng Zhang
W. Zuo
Qi Tian
VGen
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ControlVideo: Training-free Controllable Text-to-Video Generation"
48 / 198 papers shown
Title
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffM
VGen
12
4
0
19 Dec 2023
Decoupled Textual Embeddings for Customized Image Generation
Yufei Cai
Yuxiang Wei
Zhilong Ji
Jinfeng Bai
Hu Han
Wangmeng Zuo
DiffM
15
14
0
19 Dec 2023
VideoLCM: Video Latent Consistency Model
Xiang Wang
Shiwei Zhang
Han Zhang
Yu Liu
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
17
48
0
14 Dec 2023
DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior
Tianyu Huang
Yihan Zeng
Zhilu Zhang
Wan Xu
Hang Xu
Songcen Xu
Rynson W. H. Lau
Wangmeng Zuo
23
25
0
11 Dec 2023
MotionCrafter: One-Shot Motion Customization of Diffusion Models
Yuxin Zhang
Fan Tang
Nisha Huang
Haibin Huang
Chongyang Ma
Weiming Dong
Changsheng Xu
DiffM
VGen
14
14
0
08 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffM
VGen
11
89
0
07 Dec 2023
Drag-A-Video: Non-rigid Video Editing with Point-based Interaction
Yao Teng
Enze Xie
Yue Wu
Haoyu Han
Zhenguo Li
Xihui Liu
DiffM
VGen
20
11
0
05 Dec 2023
MagicStick: Controllable Video Editing via Control Handle Transformations
Yue Ma
Xiaodong Cun
Yin-Yin He
Chenyang Qi
Xintao Wang
Ying Shan
Xiu Li
Qifeng Chen
VGen
14
24
0
05 Dec 2023
Fine-grained Controllable Video Generation via Object Appearance and Context
Hsin-Ping Huang
Yu-Chuan Su
Deqing Sun
Lu Jiang
Xuhui Jia
Yukun Zhu
Ming-Hsuan Yang
DiffM
VGen
13
13
0
05 Dec 2023
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
Fengyuan Shi
Jiaxi Gu
Hang Xu
Songcen Xu
Wei Zhang
Limin Wang
VGen
DiffM
28
12
0
05 Dec 2023
SAVE: Protagonist Diversification with Structure Agnostic Video Editing
Yeji Song
Wonsik Shin
Junsoo Lee
Jeesoo Kim
Nojun Kwak
DiffM
VGen
101
4
0
05 Dec 2023
DragVideo: Interactive Drag-style Video Editing
Yufan Deng
Ruida Wang
Yuhao Zhang
Yu-Wing Tai
Chi-Keung Tang
DiffM
VGen
19
20
0
03 Dec 2023
VideoBooth: Diffusion-based Video Generation with Image Prompts
Yuming Jiang
Tianxing Wu
Shuai Yang
Chenyang Si
Dahua Lin
Yu Qiao
Chen Change Loy
Ziwei Liu
DiffM
VGen
32
65
0
01 Dec 2023
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Zhen Xing
Qi Dai
Zihao Zhang
Hui Zhang
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
33
17
0
30 Nov 2023
ART
⋅
\boldsymbol{\cdot}
⋅
V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Wenming Weng
Ruoyu Feng
Yanhui Wang
Qi Dai
Chunyu Wang
...
Jianmin Bao
Yuhui Yuan
Chong Luo
Yueyi Zhang
Zhiwei Xiong
VGen
22
32
0
30 Nov 2023
MotionEditor: Editing Video Motion via Content-Aware Diffusion
Shuyuan Tu
Qi Dai
Zhi-Qi Cheng
Hang-Rui Hu
Xintong Han
Zuxuan Wu
Yu-Gang Jiang
DiffM
VGen
28
30
0
30 Nov 2023
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Yanhui Wang
Jianmin Bao
Wenming Weng
Ruoyu Feng
Dacheng Yin
...
Yuhui Yuan
Chuanxin Tang
Xiaoyan Sun
Chong Luo
Baining Guo
DiffM
VGen
66
15
0
30 Nov 2023
Motion-Conditioned Image Animation for Video Editing
Wilson Yan
Andrew Brown
Pieter Abbeel
Rohit Girdhar
S. Azadi
DiffM
VGen
58
12
0
30 Nov 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
62
346
0
29 Nov 2023
SmoothVideo: Smooth Video Synthesis with Noise Constraints on Diffusion Models for One-shot Video Tuning
Liang Peng
Haoran Cheng
Zheng Yang
Ruisi Zhao
Linxuan Xia
Chaotian Song
Qinglin Lu
Boxi Wu
Wei Liu
VGen
15
2
0
29 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffM
VGen
18
113
0
28 Nov 2023
MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation
Sitong Su
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
VGen
21
4
0
28 Nov 2023
Highly Detailed and Temporal Consistent Video Stylization via Synchronized Multi-Frame Diffusion
M. Xie
Hanyuan Liu
Chengze Li
Tien-Tsin Wong
VGen
DiffM
14
0
0
24 Nov 2023
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
V.Ya. Arkhipkin
Zein Shaheen
Viacheslav Vasilev
E. Dakhova
Andrey Kuznetsov
Denis Dimitrov
DiffM
VGen
16
5
0
22 Nov 2023
AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance
Zuozhuo Dai
Zhenghao Zhang
Yao Yao
Bingxue Qiu
Siyu Zhu
Long Qin
Weizhi Wang
VGen
23
44
0
21 Nov 2023
MoVideo: Motion-Aware Video Generation with Diffusion Models
Jingyun Liang
Yuchen Fan
Kai Zhang
Radu Timofte
Luc Van Gool
Rakesh Ranjan
DiffM
VGen
28
10
0
19 Nov 2023
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Rohit Girdhar
Mannat Singh
Andrew Brown
Quentin Duval
S. Azadi
Sai Saketh Rambhatla
Akbar Shah
Xi Yin
Devi Parikh
Ishan Misra
DiffM
VGen
35
189
0
17 Nov 2023
AiluRus: A Scalable ViT Framework for Dense Prediction
Jin Li
Yaoming Wang
Xiaopeng Zhang
Bowen Shi
Dongsheng Jiang
Chenglin Li
Wenrui Dai
Hongkai Xiong
Qi Tian
54
5
0
02 Nov 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
21
277
0
30 Oct 2023
WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Jun-Yan He
Zhi-Qi Cheng
Chenyang Li
Jingdong Sun
Wangmeng Xiang
...
Yusen Hu
Bin Luo
Yifeng Geng
Xuansong Xie
Jingren Zhou
19
13
0
20 Oct 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
28
199
0
18 Oct 2023
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
55
115
0
16 Oct 2023
LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
Zhenyi Liao
Zhijie Deng
DiffM
19
7
0
15 Oct 2023
ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Bo Peng
Xinyuan Chen
Yaohui Wang
Chaochao Lu
Yu Qiao
DiffM
VGen
14
7
0
11 Oct 2023
HiFi-123: Towards High-fidelity One Image to 3D Content Generation
Wangbo Yu
Li-ming Yuan
Yan-Pei Cao
Xiangjun Gao
Xiaoyu Li
Wenbo Hu
Long Quan
Ying Shan
Yonghong Tian
DiffM
21
29
0
10 Oct 2023
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Yuren Cong
Mengmeng Xu
Christian Simon
Shoufa Chen
Jiawei Ren
Yanping Xie
Juan-Manuel Perez-Rua
Bodo Rosenhahn
Tao Xiang
Sen He
DiffM
VGen
22
74
0
09 Oct 2023
VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model
Yayun He
Zuheng Kang
Jianzong Wang
Junqing Peng
Jing Xiao
DiffM
14
2
0
07 Oct 2023
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
Hyeonho Jeong
Jong Chul Ye
DiffM
VGen
23
41
0
02 Oct 2023
CCEdit: Creative and Controllable Video Editing via Diffusion Models
Danfeng Hong
Wenming Weng
Hao Li
Yuhui Yuan
Jing Yao
Chong Luo
Zhibo Chen
Baining Guo
DiffM
VGen
14
42
0
28 Sep 2023
Generative Image Dynamics
Zhengqi Li
Richard Tucker
Noah Snavely
Aleksander Holynski
DiffM
29
63
0
14 Sep 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
19
13
0
19 Aug 2023
Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model
Bosheng Qin
Wentao Ye
Qifan Yu
Siliang Tang
Yueting Zhuang
DiffM
VGen
21
13
0
15 Aug 2023
BEVControl: Accurately Controlling Street-view Elements with Multi-perspective Consistency via BEV Sketch Layout
Kairui Yang
Enhui Ma
Jibing Peng
Qing-Wu Guo
Di Lin
Kaicheng Yu
DiffM
22
57
0
03 Aug 2023
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Yin-Yin He
Menghan Xia
Haoxin Chen
Xiaodong Cun
Yuan Gong
...
Yong Zhang
Xintao Wang
Chao-Liang Weng
Ying Shan
Qifeng Chen
DiffM
VGen
14
74
0
13 Jul 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
197
517
0
02 Jan 2023
DiffDreamer: Towards Consistent Unsupervised Single-view Scene Extrapolation with Conditional Diffusion Models
Shengqu Cai
E. R. Chan
Songyou Peng
Mohamad Shahbazi
Anton Obukhov
Luc Van Gool
Gordon Wetzstein
DiffM
18
33
0
22 Nov 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
243
564
0
29 May 2022
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,764
0
24 Feb 2021
Previous
1
2
3
4