Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.03011
Cited By
Structure and Content-Guided Video Synthesis with Diffusion Models
6 February 2023
Patrick Esser
Johnathan Chiu
Parmida Atighehchian
Jonathan Granskog
Anastasis Germanidis
DiffM
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Structure and Content-Guided Video Synthesis with Diffusion Models"
50 / 422 papers shown
Title
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Ivan Skorokhodov
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
VGen
40
10
0
12 Jun 2024
HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness
Zihui Xue
Mi Luo
Changan Chen
Kristen Grauman
DiffM
22
6
0
11 Jun 2024
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Zhen Xing
Qi Dai
Zejia Weng
Zuxuan Wu
Yu-Gang Jiang
VGen
39
14
0
10 Jun 2024
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
Zijian Chen
Wei Sun
Yuan Tian
Jun Jia
Zicheng Zhang
Jiarui Wang
Ru Huang
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
EGVM
45
8
0
10 Jun 2024
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Pengyang Ling
Jiazi Bu
Pan Zhang
Xiaoyi Dong
Yuhang Zang
Tong Wu
H. Chen
Jiaqi Wang
Yi Jin
VGen
DiffM
23
34
0
08 Jun 2024
Zero-Shot Video Editing through Adaptive Sliding Score Distillation
Lianghan Zhu
Yanqi Bao
Jing Huo
Jing Wu
Yu-Kun Lai
Wenbin Li
Yang Gao
VGen
23
2
0
07 Jun 2024
Ada-VE: Training-Free Consistent Video Editing Using Adaptive Motion Prior
Tanvir Mahmud
Mustafa Munir
R. Marculescu
Diana Marculescu
VGen
27
0
0
07 Jun 2024
VideoPhy: Evaluating Physical Commonsense for Video Generation
Hritik Bansal
Zongyu Lin
Tianyi Xie
Zeshun Zong
Michal Yarom
Yonatan Bitton
Chenfanfu Jiang
Yizhou Sun
Kai-Wei Chang
Aditya Grover
EGVM
VGen
32
36
0
05 Jun 2024
Searching Priors Makes Text-to-Video Synthesis Better
Haoran Cheng
Liang Peng
Linxuan Xia
Yuepeng Hu
Hengjia Li
Qinglin Lu
Xiaofei He
Boxi Wu
VGen
DiffM
23
0
0
05 Jun 2024
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
Clement Chadebec
O. Tasar
Eyal Benaroche
Benjamin Aubin
VLM
57
8
0
04 Jun 2024
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
Tianchen Zhao
Tongcheng Fang
Haofeng Huang
Enshu Liu
Widyadewi Soedarmadji
...
Shengen Yan
Huazhong Yang
Xuefei Ning
Xuefei Ning
Yu Wang
MQ
VGen
97
22
0
04 Jun 2024
pOps: Photo-Inspired Diffusion Operators
Elad Richardson
Yuval Alaluf
Ali Mahdavi-Amiri
Daniel Cohen-Or
VLM
29
3
0
03 Jun 2024
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Xiang Wang
Shiwei Zhang
Changxin Gao
Jiayu Wang
Xiaoqiang Zhou
Yingya Zhang
Luxin Yan
Nong Sang
VGen
62
29
0
03 Jun 2024
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
Shuyuan Tu
Qi Dai
Zihao Zhang
Sicheng Xie
Zhi-Qi Cheng
Chong Luo
Xintong Han
Zuxuan Wu
Yu-Gang Jiang
DiffM
VGen
31
10
0
30 May 2024
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
Haoxing Chen
Yan Hong
Zizheng Huang
Zhuoer Xu
Zhangxuan Gu
...
Jun Lan
Huijia Zhu
Jianfu Zhang
Weiqiang Wang
Huaxiong Li
Mamba
80
13
0
30 May 2024
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback
Jiachen Li
Weixi Feng
Tsu-jui Fu
Xinyi Wang
Sugato Basu
Wenhu Chen
William Yang Wang
VGen
29
27
0
29 May 2024
VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
Qilin Wang
Zhengkai Jiang
Chengming Xu
Jiangning Zhang
Yabiao Wang
Xinyi Zhang
Yunkang Cao
Weijian Cao
Chengjie Wang
Yanwei Fu
VGen
19
9
0
28 May 2024
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control
Zhengfei Kuang
Shengqu Cai
Hao He
Yinghao Xu
Hongsheng Li
Leonidas J. Guibas
Gordon Wetzstein
VGen
DiffM
35
30
0
27 May 2024
Memory-efficient High-resolution OCT Volume Synthesis with Cascaded Amortized Latent Diffusion Models
Kun Huang
Xiao Ma
Yuhan Zhang
Na Su
Songtao Yuan
Yong Liu
Qiang Chen
Huazhu Fu
MedIm
DiffM
30
3
0
26 May 2024
User-Friendly Customized Generation with Multi-Modal Prompts
Linhao Zhong
Yan Hong
Wentao Chen
Binglin Zhou
Yiyi Zhang
Jianfu Zhang
Liqing Zhang
DiffM
35
0
0
26 May 2024
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Feng Liang
Akio Kodaira
Chenfeng Xu
M. Tomizuka
Kurt Keutzer
Diana Marculescu
DiffM
VGen
67
7
0
24 May 2024
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control
Yong Zhong
Min Zhao
Zebin You
Xiaofeng Yu
Changwang Zhang
Chongxuan Li
DiffM
29
6
0
23 May 2024
Text Prompting for Multi-Concept Video Customization by Autoregressive Generation
D. Kothandaraman
Kihyuk Sohn
Ruben Villegas
P. Voigtlaender
Dinesh Manocha
Mohammad Babaeizadeh
VGen
DiffM
30
2
0
22 May 2024
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices
Nathaniel Cohen
Vladimir Kulikov
Matan Kleiner
Inbar Huberman-Spiegelglas
T. Michaeli
VGen
DiffM
24
15
0
20 May 2024
The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective
Andrew Shin
Yusuke Mori
Kunitake Kaneko
VGen
EGVM
16
2
0
13 May 2024
A Survey on Personalized Content Synthesis with Diffusion Models
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Zhaoxiang Zhang
Zhen Lei
Qing Li
Zhen Lei
Qing Li
EGVM
131
19
0
09 May 2024
Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos
Junyi Ma
Jingyi Xu
Xieyuanli Chen
Hesheng Wang
VGen
27
7
0
07 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
79
35
0
06 May 2024
Video Diffusion Models: A Survey
Andrew Melnik
Michal Ljubljanac
Cong Lu
Qi Yan
Weiming Ren
Helge J. Ritter
VGen
63
12
0
06 May 2024
TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models
Teng Zhou
Yongchuan Tang
DiffM
35
2
0
30 Apr 2024
V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection
Xuanyu Zhang
You-song Xu
Runyi Li
Jiwen Yu
Weiqi Li
Zhipei Xu
Jian Andrew Zhang
VGen
36
16
0
25 Apr 2024
AudioScenic: Audio-Driven Video Scene Editing
Kaixin Shen
Ruijie Quan
Linchao Zhu
Jun Xiao
Yi Yang
VGen
DiffM
29
1
0
25 Apr 2024
Learning Long-form Video Prior via Generative Pre-Training
Jinheng Xie
Jiajun Feng
Zhaoxu Tian
Kevin Qinghong Lin
Yawen Huang
...
Nanxu Gong
Xu Zuo
Jiaqi Yang
Yefeng Zheng
Mike Zheng Shou
25
6
0
24 Apr 2024
Zero-shot High-fidelity and Pose-controllable Character Animation
Bingwen Zhu
Fanyi Wang
Tianyi Lu
Peng Liu
Jingwen Su
Jinxiu Liu
Yanhao Zhang
Zuxuan Wu
Guo-Jun Qi
Yu-Gang Jiang
DiffM
VGen
47
6
0
21 Apr 2024
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models
Sai Sree Harsha
Ambareesh Revanur
Dhwanit Agarwal
Shradha Agrawal
VGen
DiffM
21
3
0
18 Apr 2024
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation
Hongxin Zhang
Zeyuan Wang
Qiushi Lyu
Zheyuan Zhang
Sunli Chen
Tianmin Shu
Yilun Du
Kwonjoon Lee
Yilun Du
Chuang Gan
41
12
0
16 Apr 2024
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis
S. Sastry
Subash Khanal
A. Dhakal
Nathan Jacobs
44
6
0
09 Apr 2024
AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment
Yuanfeng Xu
Yuhao Chen
Zhongzhan Huang
Zijian He
Guangrun Wang
Philip H. S. Torr
Liang Lin
VGen
18
1
0
07 Apr 2024
A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals
Jiangnan Tang
Jingya Wang
Kaiyang Ji
Lan Xu
Jingyi Yu
Ye-ling Shi
26
5
0
07 Apr 2024
LidarDM: Generative LiDAR Simulation in a Generated World
Vlas Zyrianov
Henry Che
Zhijian Liu
Shenlong Wang
VGen
25
20
0
03 Apr 2024
Motion Inversion for Video Customization
Luozhou Wang
Guibao Shen
Yixun Liang
Xin Tao
Pengfei Wan
Di Zhang
Yijun Li
Yingcong Chen
VGen
DiffM
34
7
0
29 Mar 2024
Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Yurui Qian
Qi Cai
Yingwei Pan
Yehao Li
Ting Yao
Qibin Sun
Tao Mei
DiffM
32
18
0
26 Mar 2024
Opportunities and challenges in the application of large artificial intelligence models in radiology
Liangrui Pan
Zhenyu Zhao
Ying Lu
Kewei Tang
Liyong Fu
Qingchun Liang
Shaoliang Peng
LM&MA
MedIm
AI4CE
37
5
0
24 Mar 2024
Explorative Inbetweening of Time and Space
Haiwen Feng
Zheng Ding
Zhihao Xia
Simon Niklaus
Victoria Fernandez-Abrevaya
Michael J. Black
Xuaner Zhang
DiffM
VGen
29
5
0
21 Mar 2024
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
Max W.F. Ku
Cong Wei
Weiming Ren
Huan Yang
Wenhu Chen
VGen
DiffM
67
21
0
21 Mar 2024
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Roberto Henschel
Levon Khachatryan
Daniil Hayrapetyan
Hayk Poghosyan
Vahram Tadevosyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
VGen
91
77
0
21 Mar 2024
FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGen
DiffM
52
23
0
19 Mar 2024
AnimateDiff-Lightning: Cross-Model Diffusion Distillation
Shanchuan Lin
Xiao Yang
DiffM
VGen
27
18
0
19 Mar 2024
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility
Bojia Zi
Shihao Zhao
Xianbiao Qi
Jianan Wang
Yukai Shi
Qianyu Chen
Bin Liang
Kam-Fai Wong
Lei Zhang
DiffM
VGen
24
15
0
18 Mar 2024
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Axel Sauer
Frederic Boesel
Tim Dockhorn
A. Blattmann
Patrick Esser
Robin Rombach
DiffM
18
104
0
18 Mar 2024
Previous
1
2
3
4
5
6
7
8
9
Next