Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.13077
Cited By
ControlVideo: Training-free Controllable Text-to-Video Generation
22 May 2023
Yabo Zhang
Yuxiang Wei
Dongsheng Jiang
Xiaopeng Zhang
W. Zuo
Qi Tian
VGen
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ControlVideo: Training-free Controllable Text-to-Video Generation"
50 / 198 papers shown
Title
DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models
Junhao Xia
Chaoyang Zhang
Yecheng Zhang
Chengyang Zhou
Zhichang Wang
Bochun Liu
Dongshuo Yin
DiffM
VGen
24
0
0
11 May 2025
ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images
Xianghao Kong
Qiaosong Qi
Yuanbin Wang
Anyi Rao
Biaolong Chen
Aixi Zhang
Si Liu
Hao Jiang
DiffM
VGen
20
0
0
10 May 2025
T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation
Xuyang Guo
Jiayan Huo
Zhenmei Shi
Zhao-quan Song
Jiahao Zhang
Jiale Zhao
EGVM
VGen
PINN
77
1
0
01 May 2025
Controllable Weather Synthesis and Removal with Video Diffusion Models
Chih-Hao Lin
Z. Wang
Ruofan Liang
Yuxuan Zhang
Sanja Fidler
Shenlong Wang
Zan Gojcic
DiffM
VGen
42
0
0
01 May 2025
AnimateAnywhere: Rouse the Background in Human Image Animation
Xiaoyu Liu
Mingshuai Yao
Y. Zhang
Xianhui Lin
Peiran Ren
X. Li
Ming-Yu Liu
W. Zuo
3DH
DiffM
65
0
0
28 Apr 2025
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration
Haotian Dong
X. Wang
D. Lin
Yipeng Wu
Qin Chen
R. Liu
Kairui Yang
Ping Li
Qing-Wu Guo
VGen
42
0
0
25 Apr 2025
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment
X. Li
Chenming Wu
Zhao Yang
Zhihao Xu
Dingkang Liang
Y. Zhang
Ji Wan
J. Wang
VGen
67
1
0
22 Apr 2025
Satellite to GroundScape -- Large-scale Consistent Ground View Generation from Satellite Views
Ningli Xu
R. Qin
DiffM
22
0
0
22 Apr 2025
DRAWER: Digital Reconstruction and Articulation With Environment Realism
Hongchi Xia
Entong Su
Marius Memmel
Arhan Jain
Raymond Yu
Numfor Mbiziwo-Tiapo
Ali Farhadi
Abhishek Gupta
Shenlong Wang
Wei-Chiu Ma
VGen
28
1
0
21 Apr 2025
Understanding Attention Mechanism in Video Diffusion Models
Bingyan Liu
Chengyu Wang
Tongtong Su
Huan Ten
Jun Huang
K. Guo
Kui Jia
VGen
64
0
0
16 Apr 2025
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
Dianbing Xi
J. Wang
Yuanzhi Liang
Xi Qiu
Yuchi Huo
R. Wang
Chi Zhang
X. Li
DiffM
VGen
65
0
0
15 Apr 2025
All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning
Zheng Yang
Ruoxin Chen
Zhiyuan Yan
Ke-Yue Zhang
Xinghe Fu
...
Xiujun Shu
Taiping Yao
Junchi Yan
Shouhong Ding
Xi Li
29
0
0
02 Apr 2025
Beyond Static Scenes: Camera-controllable Background Generation for Human Motion
Mingshuai Yao
Mengting Chen
Qinye Zhou
Y. Zhang
Ming-Yu Liu
...
Chen Ju
Shuai Xiao
Qingwen Liu
Jinsong Lan
Wangmeng Zuo
DiffM
VGen
36
1
0
01 Apr 2025
JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation
Fangda Chen
Shanshan Zhao
Chuanfu Xu
Long Lan
VGen
37
0
0
31 Mar 2025
SketchVideo: Sketch-based Video Generation and Editing
Feng-Lin Liu
Hongbo Fu
Xintao Wang
Weicai Ye
Pengfei Wan
Di Zhang
Lin Gao
DiffM
VGen
40
0
0
30 Mar 2025
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
Yufei Cai
Hu Han
Yuxiang Wei
Shiguang Shan
Xilin Chen
DiffM
VGen
65
0
0
25 Mar 2025
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention
Xuan Ju
Weicai Ye
Quande Liu
Qiulin Wang
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
Qiang Xu
VGen
39
1
0
25 Mar 2025
Target-Aware Video Diffusion Models
Taeksoo Kim
Hanbyul Joo
DiffM
VGen
89
1
0
24 Mar 2025
Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance
Sicong Feng
Jielong Yang
Li Peng
DiffM
VGen
51
0
0
24 Mar 2025
TransAnimate: Taming Layer Diffusion to Generate RGBA Video
Xuewei Chen
Zhimin Chen
Yiren Song
VGen
61
0
0
23 Mar 2025
AUTV: Creating Underwater Video Datasets with Pixel-wise Annotations
Quang-Trung Truong
Wong Yuk Kwan
Duc Thanh Nguyen
Binh-Son Hua
Sai-Kit Yeung
VGen
48
0
0
17 Mar 2025
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction
Zijian He
Yuwei Ning
Yipeng Qin
Wangrun Wang
Sibei Yang
Liang Lin
G. Li
55
1
0
15 Mar 2025
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Zhengyao Lv
Chenyang Si
Junhao Song
Zhenyu Yang
Yu Qiao
Ziwei Liu
Kwan-Yee K. Wong
VGen
DiffM
76
7
0
13 Mar 2025
DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image
Qi Zhao
Zhan Ma
Pan Zhou
VGen
67
0
0
13 Mar 2025
I2V3D: Controllable image-to-video generation with 3D guidance
Zhiyuan Zhang
Dongdong Chen
J. Liao
VGen
53
0
0
12 Mar 2025
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space
Yifan Zhou
Zeqi Xiao
Shuai Yang
Xingang Pan
62
1
0
12 Mar 2025
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion
Lehan Yang
Jincen Song
Tianlong Wang
Daiqing Qi
Weili Shi
Yuheng Liu
Sheng Li
DiffM
VOS
VGen
69
0
0
11 Mar 2025
VACE: All-in-One Video Creation and Editing
Zeyinzi Jiang
Zhen Han
Chaojie Mao
J. Zhang
Yulin Pan
Yu Liu
DiffM
VGen
44
5
0
10 Mar 2025
Text2Story: Advancing Video Storytelling with Text Guidance
Taewon Kang
D. Kothandaraman
Ming C. Lin
DiffM
VGen
59
0
0
08 Mar 2025
GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning
Zhun Mou
Bin Xia
Zhengchao Huang
Wenming Yang
Jiaya Jia
VGen
ELM
LRM
63
0
0
04 Mar 2025
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think
Jie Tian
Xiaoye Qu
Zhenyi Lu
Wei Wei
Sichen Liu
Yu-Xi Cheng
DiffM
VGen
44
0
0
02 Mar 2025
C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation
Yuhao Li
Mirana Claire Angel
Salman Khan
Yu Zhu
Jinqiu Sun
Yanning Zhang
F. Khan
VGen
46
0
0
27 Feb 2025
FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion model
Lingzhou Mu
Baiji Liu
Ruonan Zhang
Guiming Mo
Jiawei Jin
Kai Zhang
Haozhi Huang
DiffM
VGen
53
1
0
26 Feb 2025
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Xiangpeng Yang
Linchao Zhu
Hehe Fan
Yi Yang
DiffM
VGen
41
5
0
24 Feb 2025
SMITE: Segment Me In TimE
Amirhossein Alimohammadi
Sauradip Nag
Saeid Asgari Taghanaki
Andrea Tagliasacchi
Ghassan Hamarneh
Ali Mahdavi-Amiri
VLM
VOS
87
2
0
20 Feb 2025
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Sili Chen
Hengkai Guo
Shengnan Zhu
Feihu Zhang
Zilong Huang
Jiashi Feng
Bingyi Kang
VLM
AuLLM
MDE
61
10
0
21 Jan 2025
TDM: Temporally-Consistent Diffusion Model for All-in-One Real-World Video Restoration
Yizhou Li
Zihua Liu
Yusuke Monno
Masatoshi Okutomi
DiffM
VGen
26
1
0
04 Jan 2025
Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Guanglei Yang
Rui Tian
Yongqiang Zhang
Zhun Zhong
Yongqiang Li
Wangmeng Zuo
26
0
0
31 Dec 2024
MAKIMA: Tuning-free Multi-Attribute Open-domain Video Editing via Mask-Guided Attention Modulation
Haoyu Zheng
Wenqiao Zhang
Zheqi Lv
Yu Zhong
Yang Dai
...
Yongliang Shen
Juncheng Billy Li
Dongping Zhang
Siliang Tang
Yueting Zhuang
DiffM
VGen
48
0
0
31 Dec 2024
MetricDepth: Enhancing Monocular Depth Estimation with Deep Metric Learning
Chunpu Liu
Guanglei Yang
Wangmeng Zuo
Tianyi Zan
MDE
41
0
0
31 Dec 2024
UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity
Jingbo Lin
Zhilu Zhang
W. J. Li
Renjing Pei
Hang Xu
Hongzhi Zhang
Wangmeng Zuo
32
0
0
28 Dec 2024
How Panel Layouts Define Manga: Insights from Visual Ablation Experiments
Siyuan Feng
Teruya Yoshinaga
Katsuhiko Hayashi
Koki Washio
Hidetaka Kamigaito
28
0
0
26 Dec 2024
GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
Jingqiu Zhou
Lue Fan
Xuesong Chen
Linjiang Huang
Si Liu
Hongsheng Li
3DGS
31
0
0
23 Dec 2024
Retrieval Augmented Image Harmonization
Haolin Wang
Ming-Yu Liu
Zifei Yan
Chao Zhou
Longan Xiao
Wangmeng Zuo
62
0
0
18 Dec 2024
Unsupervised Region-Based Image Editing of Denoising Diffusion Models
Z. Li
Yue Song
R. Tao
Xiaohong Jia
Yao Zhao
Wei Wang
DiffM
78
0
0
17 Dec 2024
Generative Inbetweening through Frame-wise Conditions-Driven Video Generation
Tianyi Zhu
Dongwei Ren
Qilong Wang
Xiaohe Wu
W. Zuo
VGen
67
1
0
16 Dec 2024
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
142
2
0
14 Dec 2024
UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer
Delong Liu
Zhaohui Hou
Mingjie Zhan
Shihao Han
Zhicheng Zhao
Fei Su
VGen
91
0
0
12 Dec 2024
DiffSign: AI-Assisted Generation of Customizable Sign Language Videos With Enhanced Realism
Sudha Krishnamurthy
Vimal Bhat
Abhinav Jain
DiffM
63
0
0
05 Dec 2024
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion
Kai He
Chin-Hsuan Wu
Igor Gilitschenski
DiffM
3DGS
68
0
0
02 Dec 2024
1
2
3
4
Next