Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2204.03458
Cited By
v1
v2 (latest)
Video Diffusion Models
Neural Information Processing Systems (NeurIPS), 2022
7 April 2022
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (5 upvotes)
Papers citing
"Video Diffusion Models"
50 / 1,543 papers shown
A Sampling-Based Domain Generalization Study with Diffusion Generative Models
Ye Zhu
Yu Wu
Duo Xu
Zhiwei Deng
Yan Yan
Olga Russakovsky
DiffM
348
1
0
13 Oct 2023
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Neural Information Processing Systems (NeurIPS), 2023
Maya Okawa
Ekdeep Singh Lubana
Robert P. Dick
Hidenori Tanaka
CoGe
DiffM
501
84
0
13 Oct 2023
Learning to Act from Actionless Videos through Dense Correspondences
International Conference on Learning Representations (ICLR), 2023
Po-Chen Ko
Jiayuan Mao
Yilun Du
Shao-Hua Sun
Josh Tenenbaum
336
158
0
12 Oct 2023
Consistent123: Improve Consistency for One Image to 3D Object Synthesis
Haohan Weng
Tianyu Yang
Jianan Wang
Yu Li
Tong Zhang
Chong Chen
Lei Zhang
DiffM
220
85
0
12 Oct 2023
Efficient Integrators for Diffusion Generative Models
International Conference on Learning Representations (ICLR), 2023
Kushagra Pandey
Maja R. Rudolph
Stephan Mandt
DiffM
191
12
0
11 Oct 2023
ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
AAAI Conference on Artificial Intelligence (AAAI), 2023
Bo Peng
Xinyuan Chen
Yaohui Wang
Chaochao Lu
Yu Qiao
DiffM
VGen
226
7
0
11 Oct 2023
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
284
154
0
11 Oct 2023
Echocardiography video synthesis from end diastolic semantic map via diffusion model
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Nguyen Van Phi
Tran Minh Duc
Hieu H. Pham
Tran Quoc Long
DiffM
MedIm
VGen
182
9
0
11 Oct 2023
Latent Diffusion Model for DNA Sequence Generation
Zehui Li
Yuhao Ni
Tim August B. Huygelen
Akashaditya Das
Guoxuan Xia
Guy-Bart Stan
Yiren Zhao
180
15
0
09 Oct 2023
Learning Interactive Real-World Simulators
International Conference on Learning Representations (ICLR), 2023
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&Ro
PINN
350
335
0
09 Oct 2023
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
International Conference on Learning Representations (ICLR), 2023
Yuren Cong
Mengmeng Xu
Christian Simon
Shoufa Chen
Jiawei Ren
Yanping Xie
Juan-Manuel Perez-Rua
Bodo Rosenhahn
Tao Xiang
Sen He
DiffM
VGen
341
144
0
09 Oct 2023
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Lijun Yu
José Lezama
N. B. Gundavarapu
Luca Versari
Kihyuk Sohn
...
Boqing Gong
Ming-Hsuan Yang
Irfan Essa
David A. Ross
Lu Jiang
440
530
0
09 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
International Conference on Learning Representations (ICLR), 2023
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
292
7
0
09 Oct 2023
VoiceExtender: Short-utterance Text-independent Speaker Verification with Guided Diffusion Model
Automatic Speech Recognition & Understanding (ASRU), 2023
Yayun He
Zuheng Kang
Jianzong Wang
Junqing Peng
Jing Xiao
DiffM
184
3
0
07 Oct 2023
Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Mihir Prabhudesai
Anirudh Goyal
Deepak Pathak
Katerina Fragkiadaki
471
209
0
05 Oct 2023
Stochastic interpolants with data-dependent couplings
International Conference on Machine Learning (ICML), 2023
M. S. Albergo
Mark Goldstein
Nicholas M. Boffi
Rajesh Ranganath
Eric Vanden-Eijnden
OT
284
64
0
05 Oct 2023
MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
IEEE Transactions on Medical Imaging (TMI), 2023
Yanwu Xu
Li Sun
Wei Peng
Shyam Visweswaran
Kayhan Batmanghelich
MedIm
DiffM
362
47
0
05 Oct 2023
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Anton Razzhigaev
Arseniy Shakhmatov
Anastasia Maltseva
V.Ya. Arkhipkin
Igor Pavlov
Ilya Ryabov
Angelina Kuts
Sergey Petrakov
Andrey Kuznetsov
Denis Dimitrov
334
121
0
05 Oct 2023
EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Yefei He
Jing Liu
Weijia Wu
Hong Zhou
Bohan Zhuang
DiffM
MQ
530
70
0
05 Oct 2023
Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models
IEEE International Conference on Computer Vision (ICCV), 2023
Huaijin Pi
Sida Peng
Minghui Yang
Xiaowei Zhou
Hujun Bao
DiffM
206
45
0
03 Oct 2023
Score-based Data Assimilation for a Two-Layer Quasi-Geostrophic Model
Sacha Lewin
Gilles Louppe
267
12
0
03 Oct 2023
Sequential Data Generation with Groupwise Diffusion Process
Sangyun Lee
Gayoung Lee
Hyunsung Kim
Junho Kim
Youngjung Uh
DiffM
331
4
0
02 Oct 2023
FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery
Tasin Islam
A. Miron
Xiaohui Liu
Yongmin Li
DiffM
286
8
0
29 Sep 2023
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
399
434
0
29 Sep 2023
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
European Conference on Computer Vision (ECCV), 2023
Shengkun Tang
Yaqing Wang
Maksim Dzhigil
Yi Liang
Yongbin Li
Dongkuan Xu
255
9
0
29 Sep 2023
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
International Journal of Computer Vision (IJCV), 2023
David Junhao Zhang
Jay Zhangjie Wu
Jia-Wei Liu
Rui Zhao
L. Ran
Yuchao Gu
Difei Gao
Mike Zheng Shou
DiffM
VGen
627
296
0
27 Sep 2023
Warfare:Breaking the Watermark Protection of AI-Generated Content
Guanlin Li
Yifei Chen
Jie Zhang
Shangwei Guo
Shangwei Guo
Tianwei Zhang
Jiwei Li
Tianwei Zhang
WIGM
285
6
0
27 Sep 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
International Journal of Computer Vision (IJCV), 2023
Yaohui Wang
Xinyuan Chen
Xin Ma
Shangchen Zhou
Ziqi Huang
...
Chen Change Loy
Bo Dai
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
DiffM
251
321
0
26 Sep 2023
A Simple Text to Video Model via Transformer
Gang Chen
ViT
77
1
0
26 Sep 2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER
Neural Information Processing Systems (NeurIPS), 2023
Mingzhen Sun
Weining Wang
Zihan Qin
Jiahui Sun
Si-Qing Chen
Qingbin Liu
DiffM
163
5
0
23 Sep 2023
A Diffusion-Model of Joint Interactive Navigation
Neural Information Processing Systems (NeurIPS), 2023
Matthew Niedoba
J. Lavington
Yunpeng Liu
Vasileios Lioutas
Justice Sefas
...
Dylan Green
Setareh Dabiri
Berend Zwartsenberg
Adam Scibior
Frank Wood
DiffM
247
17
0
21 Sep 2023
DreamLLM: Synergistic Multimodal Comprehension and Creation
International Conference on Learning Representations (ICLR), 2023
Runpei Dong
Chunrui Han
Yuang Peng
Zekun Qi
Zheng Ge
...
Hao-Ran Wei
Xiangwen Kong
Xiangyu Zhang
Kaisheng Ma
Li Yi
MLLM
299
275
0
20 Sep 2023
A Generative Framework for Self-Supervised Facial Representation Learning
Ruian He
Zhen Xing
Weimin Tan
Bo Yan
DiffM
343
0
0
15 Sep 2023
VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Carlos Hernandez-Olivan
Koichi Saito
Naoki Murata
Chieh-Hsin Lai
Marco A. Martínez-Ramírez
Wei-Hsiang Liao
Yuki Mitsufuji
DiffM
179
11
0
13 Sep 2023
Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion Models
International Conference on Machine Learning (ICML), 2023
Zalan Fabian
Berk Tınaz
Mahdi Soltanolkotabi
DiffM
253
7
0
12 Sep 2023
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
International Conference on Learning Representations (ICLR), 2023
Xingchao Liu
Xiwen Zhang
Jianzhu Ma
Jian Peng
Qiang Liu
602
312
0
12 Sep 2023
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Shuchen Xue
Mingyang Yi
Weijian Luo
Shifeng Zhang
Jiacheng Sun
Hao Sun
Zhi-Ming Ma
DiffM
540
68
0
10 Sep 2023
Variations and Relaxations of Normalizing Flows
Keegan Kelly
Lorena Piedras
Sukrit Rao
David Samuel Roth
BDL
275
2
0
08 Sep 2023
The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion
IEEE International Conference on Computer Vision (ICCV), 2023
Yujin Jeong
Won-Wha Ryoo
Seunghyun Lee
Dabin Seo
Wonmin Byeon
Sangpil Kim
Jinkyu Kim
DiffM
175
39
0
08 Sep 2023
SMPLitex: A Generative Model and Dataset for 3D Human Texture Estimation from Single Image
British Machine Vision Conference (BMVC), 2023
Dan Casas
M. C. Trinidad
3DH
3DGS
301
30
0
04 Sep 2023
Benchmarking Autoregressive Conditional Diffusion Models for Turbulent Flow Simulation
Georg Kohl
Li-Wei Chen
Nils Thuerey
AI4CE
DiffM
353
53
0
04 Sep 2023
MagicProp: Diffusion-based Video Editing via Motion-aware Appearance Propagation
Hanshu Yan
Jun Hao Liew
Long Mai
Shanchuan Lin
Jiashi Feng
VGen
DiffM
199
19
0
02 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Fengxiang Bie
Jianlong Wu
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
256
58
0
02 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Tao Gui
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
332
70
0
01 Sep 2023
StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation
IEEE International Conference on Computer Vision (ICCV), 2023
Yuhan Wang
Liming Jiang
Chen Change Loy
VGen
239
18
0
31 Aug 2023
Vision-Based Traffic Accident Detection and Anticipation: A Survey
Jianwu Fang
Jiahuan Qiao
Jianru Xue
Zhengguo Li
208
69
0
30 Aug 2023
Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs
Computer Vision and Pattern Recognition (CVPR), 2023
Hao Fei
Shengqiong Wu
Wei Ji
Hanwang Zhang
Tat-Seng Chua
VGen
DiffM
220
45
0
26 Aug 2023
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion
Se Jin Park
Joanna Hong
Minsu Kim
Y. Ro
255
4
0
23 Aug 2023
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Neural Information Processing Systems (NeurIPS), 2023
Emanuele Bugliarello
Hernan Moraldo
Ruben Villegas
Mohammad Babaeizadeh
M. Saffar
Han Zhang
D. Erhan
V. Ferrari
Pieter-Jan Kindermans
P. Voigtlaender
VGen
338
16
0
22 Aug 2023
Convergence guarantee for consistency models
Junlong Lyu
Zhitang Chen
Shoubo Feng
DiffM
155
5
0
22 Aug 2023
Previous
1
2
3
...
23
24
25
...
29
30
31
Next
Page 24 of 31
Page
of 31
Go