Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.04145
Cited By
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
7 November 2023
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Z. Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffM
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models"
50 / 155 papers shown
Title
Neuro-Symbolic Evaluation of Text-to-Video Models using Formal Verification
Sundar Sripada V. S.
Minkyu Choi
Sahil Shah
Harsh Goel
Mohammad Omama
Sandeep P. Chinchali
EGVM
105
2
0
22 Nov 2024
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
Weijia Wu
Mingyu Liu
Zeyu Zhu
Xi Xia
Haoen Feng
Wen Wang
Kevin Qinghong Lin
Chunhua Shen
Mike Zheng Shou
DiffM
VGen
114
1
0
22 Nov 2024
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Chang-Shu Liu
Rui Li
Kaidong Zhang
Yunwei Lan
Dong Liu
DiffM
VGen
52
3
0
17 Nov 2024
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
Hmrishav Bandyopadhyay
Yi-Zhe Song
DiffM
VGen
28
3
0
16 Nov 2024
I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength
Wanquan Feng
Jiawei Liu
Pengqi Tu
Tianhao Qi
Mingzhen Sun
Tianxiang Ma
Songtao Zhao
Siyu Zhou
Qian He
VGen
47
7
0
10 Nov 2024
ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
David Junhao Zhang
Roni Paiss
Shiran Zada
Nikhil Karnad
David E. Jacobs
Yael Pritch
Inbar Mosseri
Mike Zheng Shou
Neal Wadhwa
Nataniel Ruiz
DiffM
VGen
66
15
0
07 Nov 2024
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
Panwen Hu
Jin Jiang
Jianqi Chen
Mingfei Han
Shengcai Liao
Xiaojun Chang
Xiaodan Liang
VGen
DiffM
28
5
0
07 Nov 2024
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
Wenhao Wang
Y. Yang
VGen
40
3
0
05 Nov 2024
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
Zheng Zhan
Yushu Wu
Yifan Gong
Zichong Meng
Zhenglun Kong
Changdi Yang
Geng Yuan
Pu Zhao
Wei Niu
Yanzhi Wang
VGen
31
4
0
02 Nov 2024
Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences
Zhihao Zhao
Junjie Yang
Shahrooz Faghihroohi
Yinzheng Zhao
Daniel Zapp
Kai-Qi Huang
Nassir Navab
M. A. Nasseri
DiffM
MedIm
49
0
0
28 Oct 2024
FrameBridge: Improving Image-to-Video Generation with Bridge Models
Yuji Wang
Zehua Chen
Xiaoyu Chen
Jun-Jie Zhu
Jianfei Chen
DiffM
VGen
75
1
0
20 Oct 2024
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Yujie Wei
Shiwei Zhang
Hangjie Yuan
Xiang Wang
Haonan Qiu
...
F. Liu
Zhizhong Huang
Jiaxin Ye
Yingya Zhang
Hongming Shan
DiffM
VGen
67
14
0
17 Oct 2024
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Hancheng Ye
Jiakang Yuan
Renqiu Xia
Xiangchao Yan
Tao Chen
Junchi Yan
Botian Shi
Bo Zhang
DiffM
21
1
0
13 Oct 2024
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation
Gihyun Kwon
Jong Chul Ye
DiffM
51
3
0
08 Oct 2024
Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach
Yaofang Liu
Y. Ren
Xiaodong Cun
Aitor Artola
Yang Liu
Tieyong Zeng
Raymond H. Chan
Jean-Michel Morel
VGen
DiffM
43
2
0
04 Oct 2024
AVID: Adapting Video Diffusion Models to World Models
Marc Rigter
Tarun Gupta
Agrin Hilmkil
Chao Ma
VGen
17
2
0
01 Oct 2024
FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Lingling Cai
Kang Zhao
Hangjie Yuan
Yingya Zhang
Shiwei Zhang
Kejie Huang
VGen
16
0
0
30 Sep 2024
Replace Anyone in Videos
Xiang Wang
Shiwei Zhang
Haonan Qiu
Ruihang Chu
Zekun Li
Y. Zhang
Changxin Gao
Yuehuan Wang
Chunhua Shen
Nong Sang
VGen
DiffM
64
1
0
30 Sep 2024
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation
Shaowei Liu
Zhongzheng Ren
Saurabh Gupta
Shenlong Wang
VGen
DiffM
PINN
37
33
0
27 Sep 2024
MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
Yifang Men
Yuan Yao
Miaomiao Cui
Liefeng Bo
DiffM
16
16
0
24 Sep 2024
Dormant: Defending against Pose-driven Human Image Animation
Jiachen Zhou
Mingsi Wang
Tianlin Li
Guozhu Meng
Kai Chen
42
3
0
22 Sep 2024
MotionCom: Automatic and Motion-Aware Image Composition with LLM and Video Diffusion Prior
Weijing Tao
Xiaofeng Yang
Miaomiao Cui
Guosheng Lin
DiffM
26
1
0
16 Sep 2024
Training-free Long Video Generation with Chain of Diffusion Model Experts
Wenhao Li
Yichao Cao
Xiu Su
Xi Lin
Shan You
Mingkai Zheng
Yi Chen
Chang Xu
VGen
DiffM
43
0
0
24 Aug 2024
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation
Cong Wang
Jiaxi Gu
Panwen Hu
Haoyu Zhao
Yuanfan Guo
J. N. Han
Hang Xu
Xiaodan Liang
VGen
DiffM
24
3
0
23 Aug 2024
TrackGo: A Flexible and Efficient Method for Controllable Video Generation
Haitao Zhou
Chuang Wang
Rui Nie
Jinxiao Lin
Dongdong Yu
Qian Yu
Changhu Wang
VGen
DiffM
46
14
0
21 Aug 2024
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Tao Yang
Yangming Shi
Yunwen Huang
Feng Chen
Yin Zheng
Lei Zhang
DiffM
VGen
59
0
0
19 Aug 2024
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
Zhaowei Li
Wei Wang
Yiqing Cai
Xu Qi
Pengyu Wang
Dong Zhang
Hang Song
Botian Jiang
Zhida Huang
Tao Wang
AIFin
LRM
35
3
0
05 Aug 2024
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Zhenghao Zhang
Junchao Liao
Menghao Li
Zuozhuo Dai
Bingxue Qiu
Hao Hu
Shaowei Cai
Weizhi Wang
VGen
42
41
0
31 Jul 2024
Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Ashkan Taghipour
Morteza Ghahremani
Bennamoun
Aref Miri Rekavandi
Zinuo Li
Hamid Laga
F. Boussaïd
VGen
68
2
0
27 Jul 2024
Multi-sentence Video Grounding for Long Video Generation
Wei Feng
Xin Wang
Hong Chen
Zeyang Zhang
Wenwu Zhu
DiffM
24
0
0
18 Jul 2024
Towards Understanding Unsafe Video Generation
Yan Pang
Aiping Xiong
Yang Zhang
Tianhao Wang
EGVM
27
2
0
17 Jul 2024
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Qinyu Yang
Haoxin Chen
Yong Zhang
Menghan Xia
Xiaodong Cun
Zhixun Su
Ying Shan
DiffM
21
1
0
14 Jul 2024
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
Xuan Ju
Yiming Gao
Zhaoyang Zhang
Ziyang Yuan
Xintao Wang
Ailing Zeng
Yu Xiong
Qiang Xu
Ying Shan
VGen
61
36
0
08 Jul 2024
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
55
20
0
26 Jun 2024
Text-Animator: Controllable Visual Text Video Generation
Lin Liu
Quande Liu
Shengju Qian
Yuan Zhou
Wengang Zhou
Houqiang Li
Lingxi Xie
Qi Tian
VGen
25
1
0
25 Jun 2024
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
35
22
0
24 Jun 2024
EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models
Zhiyu Tan
Xiaomeng Yang
Luozheng Qin
Mengping Yang
Cheng Zhang
Hao Li
42
7
0
24 Jun 2024
MVOC: a training-free multiple video object composition method with diffusion models
Wei Wang
Yaosen Chen
Yuegen Liu
Qi Yuan
Shubin Yang
Yanru Zhang
DiffM
60
2
0
22 Jun 2024
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
Min Zhao
Hongzhou Zhu
Chendong Xiang
Kaiwen Zheng
Chongxuan Li
Jun Zhu
61
8
0
22 Jun 2024
ViD-GPT: Introducing GPT-style Autoregressive Generation in Video Diffusion Models
Kaifeng Gao
Jiaxin Shi
Hanwang Zhang
Chunping Wang
Jun Xiao
DiffM
VGen
62
11
0
16 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
58
2
0
13 Jun 2024
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
Bing Li
Cheng Zheng
Wenxuan Zhu
Jinjie Mai
Biao Zhang
Peter Wonka
Bernard Ghanem
40
16
0
12 Jun 2024
Pandora: Towards General World Model with Natural Language Actions and Video States
Jiannan Xiang
Guangyi Liu
Yi Gu
Qiyue Gao
Yuting Ning
...
Shibo Hao
Yemin Shi
Zhengzhong Liu
Eric P. Xing
Zhiting Hu
VGen
54
35
0
12 Jun 2024
CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion
Xingrui Wang
Xin Li
Zhibo Chen
DiffM
42
1
0
07 Jun 2024
SF-V: Single Forward Video Generation Model
Zhixing Zhang
Yanyu Li
Yushu Wu
Yanwu Xu
Anil Kag
...
Aliaksandr Siarohin
Junli Cao
Dimitris N. Metaxas
Sergey Tulyakov
Jian Ren
DiffM
VGen
25
1
0
06 Jun 2024
UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Xiang Wang
Shiwei Zhang
Changxin Gao
Jiayu Wang
Xiaoqiang Zhou
Yingya Zhang
Luxin Yan
Nong Sang
VGen
62
29
0
03 Jun 2024
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers
Jun Zheng
Fuwei Zhao
Youjiang Xu
Xin Dong
Xiaodan Liang
VGen
DiffM
26
5
0
28 May 2024
ToonCrafter: Generative Cartoon Interpolation
Jinbo Xing
Hanyuan Liu
Menghan Xia
Yong Zhang
Xintao Wang
Ying Shan
Tien-Tsin Wong
29
26
0
28 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
48
75
0
27 May 2024
Controllable Longer Image Animation with Diffusion Models
Qiang Wang
Minghua Liu
Junjun Hu
Fan Jiang
Mu Xu
VGen
25
0
0
27 May 2024
Previous
1
2
3
4
Next