Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.12165
Cited By
Improved Conditional VRNNs for Video Prediction
IEEE International Conference on Computer Vision (ICCV), 2019
27 April 2019
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
DRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improved Conditional VRNNs for Video Prediction"
50 / 114 papers shown
Title
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
Jiatao Gu
Ying Shen
Tianrong Chen
Laurent Dinh
Y. Wang
Miguel Angel Bautista
David Berthelot
Josh Susskind
Shuangfei Zhai
DiffM
VGen
282
3
0
25 Nov 2025
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sina Mokhtarzadeh Azar
Emad Bahrami
Enrico Pallotta
Gianpiero Francesca
Radu Timofte
Juergen Gall
DiffM
108
0
0
23 Nov 2025
Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal Transformers
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025
Dean L. Slack
G. Hudson
T. Winterbottom
Noura Al Moubayed
130
0
0
23 Oct 2025
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
382
6
0
10 Apr 2025
Unified Arbitrary-Time Video Frame Interpolation and Prediction
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Xin Jin
Longhai Wu
Jie Chen
Ilhyun Cho
Cheul-hee Hahm
251
1
0
04 Mar 2025
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
Computer Vision and Pattern Recognition (CVPR), 2025
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
347
3
0
14 Jan 2025
DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
592
14
0
16 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
205
0
0
07 Dec 2024
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
DiffM
256
0
0
06 Dec 2024
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Xiang Deng
Youxin Pang
Xiaochen Zhao
Chao Xu
Lizhen Wang
Hongjiang Xiao
Shi Yan
Hongwen Zhang
Yebin Liu
DiffM
VGen
219
3
0
31 Oct 2024
Motion Graph Unleashed: A Novel Approach to Video Prediction
Neural Information Processing Systems (NeurIPS), 2024
Yiqi Zhong
Luming Liang
Bohan Tang
Ilya Zharkov
Ulrich Neumann
297
4
0
29 Oct 2024
Masked Autoregressive Model for Weather Forecasting
Doyi Kim
Minseok Seo
Hakjin Lee
Junghoon Seo
206
1
0
30 Sep 2024
Blox-Net: Generative Design-for-Robot-Assembly Using VLM Supervision, Physics Simulation, and a Robot with Reset
IEEE International Conference on Robotics and Automation (ICRA), 2024
Andrew Goldberg
Kavish Kondap
Tianshuang Qiu
Zehan Ma
Letian Fu
Justin Kerr
Huang Huang
Kaiyuan Chen
Kuan Fang
Ken Goldberg
189
13
0
25 Sep 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGen
EGVM
305
28
0
23 Jul 2024
The Power of Next-Frame Prediction for Learning Physical Laws
T. Winterbottom
G. Hudson
Daniel Kluvanec
Dean L. Slack
Jamie Sterling
Junjie Shentu
Chenghao Xiao
Zheming Zhou
Noura Al Moubayed
199
3
0
21 May 2024
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
232
32
0
18 Apr 2024
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
295
0
0
08 Apr 2024
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
Pengyuan Zhou
Lin Wang
Zhi Liu
Yanbin Hao
Pan Hui
Sasu Tarkoma
J. Kangasharju
VGen
244
46
0
30 Jan 2024
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
933
1,923
0
25 Nov 2023
Breathing Life Into Sketches Using Text-to-Video Priors
Computer Vision and Pattern Recognition (CVPR), 2023
Rinon Gal
Yael Vinker
Yuval Alaluf
Amit H. Bermano
Daniel Cohen-Or
Ariel Shamir
Gal Chechik
VGen
DiffM
178
47
0
21 Nov 2023
Triplet Attention Transformer for Spatiotemporal Predictive Learning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Xuesong Nie
Xi Chen
Haoyuan Jin
Zhihang Zhu
Yunfeng Yan
Donglian Qi
ViT
156
15
0
28 Oct 2023
HyperSINDy: Deep Generative Modeling of Nonlinear Stochastic Governing Equations
Mozes Jacobs
Bingni W. Brunton
Steven L. Brunton
J. Nathan Kutz
Ryan V. Raut
170
14
0
07 Oct 2023
LLM-grounded Video Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Long Lian
Baifeng Shi
Semih Yavuz
Ye Liu
Boyi Li
DiffM
218
74
0
29 Sep 2023
Automatic Animation of Hair Blowing in Still Portrait Photos
IEEE International Conference on Computer Vision (ICCV), 2023
Wenpeng Xiao
Wentao Liu
Yitong Wang
Guohao Li
Bing Li
3DH
216
14
0
25 Sep 2023
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Alexia Jolicoeur-Martineau
Kilian Fatras
Tal Kachman
302
56
0
18 Sep 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Zhen Xing
Jingdong Sun
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
222
105
0
18 Aug 2023
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction
IEEE transactions on multimedia (IEEE TMM), 2023
Mohammad Adiban
Kalin Stefanov
Sabato Marco Siniscalchi
G. Salvi
237
3
0
13 Jul 2023
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous Robots
IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Meenakshi Sarkar
V. Honkote
D. Das
D. Ghose
172
3
0
28 Jun 2023
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning
Neural Information Processing Systems (NeurIPS), 2023
Cheng Tan
Siyuan Li
Zhangyang Gao
Wen-Cai Guan
Zedong Wang
Zicheng Liu
Lirong Wu
Stan Z. Li
AI4TS
245
90
0
20 Jun 2023
Fast Fourier Inception Networks for Occluded Video Prediction
IEEE transactions on multimedia (IEEE TMM), 2023
Ping Li
Chenhan Zhang
Xianghua Xu
180
10
0
17 Jun 2023
Video Diffusion Models with Local-Global Context Guidance
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Si-hang Yang
Lu Zhang
Yu Liu
Zhizhuo Jiang
You He
VGen
DiffM
118
18
0
05 Jun 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
International Conference on Learning Representations (ICLR), 2023
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffM
VGen
214
97
0
22 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2023
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
610
1,410
0
18 Apr 2023
MS-LSTM: Exploring Spatiotemporal Multiscale Representations in Video Prediction Domain
Applied Soft Computing (Appl. Soft Comput.), 2023
Zhifeng Ma
Hao Zhang
Jie Liu
396
10
0
16 Apr 2023
Explicitly Minimizing the Blur Error of Variational Autoencoders
International Conference on Learning Representations (ICLR), 2023
G. Bredell
Kyriakos Flouris
K. Chaitanya
Ertunc Erdil
E. Konukoglu
153
35
0
12 Apr 2023
Model-Based Reinforcement Learning with Isolated Imaginations
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Minting Pan
Geng Chen
Yitao Zheng
Yunbo Wang
Xiaokang Yang
321
2
0
27 Mar 2023
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
Jaehoon Yoo
Semin Kim
Doyup Lee
Chiheon Kim
Seunghoon Hong
201
6
0
20 Mar 2023
A Dynamic Multi-Scale Voxel Flow Network for Video Prediction
Computer Vision and Pattern Recognition (CVPR), 2023
Xiaotao Hu
Zhewei Huang
Ailin Huang
Jun Xu
Shuchang Zhou
VGen
214
86
0
17 Mar 2023
TKN: Transformer-based Keypoint Prediction Network For Real-time Video Prediction
Haoran Li
Pengyuan Zhou
Yi-Wen Lin
Y. Hao
Haiyong Xie
Yong Liao
ViT
AI4TS
263
1
0
17 Mar 2023
Implicit Stacked Autoregressive Model for Video Prediction
Min-seok Seo
Hakjin Lee
Do-Yeon Kim
Junghoon Seo
VGen
143
20
0
14 Mar 2023
Continual Visual Reinforcement Learning with A Life-Long World Model
Wendong Zhang
Wendong Zhang
Geng Chen
Siyu Gao
Yunbo Wang
Xiaokang Yang
Yunbo Wang
CLL
313
3
0
12 Mar 2023
Distributional Learning of Variational AutoEncoder: Application to Synthetic Data Generation
Neural Information Processing Systems (NeurIPS), 2023
SeungHwan An
Jong-June Jeon
DRL
473
12
0
22 Feb 2023
Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal Forecasts
Social Science Research Network (SSRN), 2023
Pantelis R. Vlachas
Petros Koumoutsakos
AI4TS
AI4CE
264
11
0
22 Feb 2023
Anti-aliasing Predictive Coding Network for Future Video Frame Prediction
Chaofan Ling
Wei-Hong Li
Junpei Zhong
227
0
0
13 Jan 2023
Long-horizon video prediction using a dynamic latent hierarchy
Alexey Zakharov
Qinghai Guo
Zafeirios Fountas
161
5
0
29 Dec 2022
Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for Video Prediction
Chaofan Ling
Junpei Zhong
Wei-Hong Li
278
3
0
22 Dec 2022
Video Prediction by Efficient Transformers
Image and Vision Computing (IVC), 2022
Xi Ye
Guillaume-Alexandre Bilodeau
ViT
250
45
0
12 Dec 2022
Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve
International Conference on Learning Representations (ICLR), 2022
Juhan Bae
Michael Ruogu Zhang
Michael Ruan
Eric Wang
S. Hasegawa
Jimmy Ba
Roger C. Grosse
DRL
231
23
0
07 Dec 2022
Efficient Video Prediction via Sparsely Conditioned Flow Matching
IEEE International Conference on Computer Vision (ICCV), 2022
A. Davtyan
Sepehr Sameni
Paolo Favaro
VGen
DiffM
250
41
0
26 Nov 2022
WALDO: Future Video Synthesis using Object Layer Decomposition and Parametric Flow Prediction
IEEE International Conference on Computer Vision (ICCV), 2022
G. L. Moing
Jean Ponce
Cordelia Schmid
345
7
0
25 Nov 2022
1
2
3
Next