Improved Conditional VRNNs for Video Prediction

IEEE International Conference on Computer Vision (ICCV), 2019

27 April 2019

Aaron Courville

Papers citing "Improved Conditional VRNNs for Video Prediction"

50 / 114 papers shown

Title
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows Jiatao Gu Ying Shen Tianrong Chen Laurent Dinh Y. Wang Miguel Angel Bautista David Berthelot Josh Susskind Shuangfei Zhai DiffM VGen 290 3 0 25 Nov 2025
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization Sina Mokhtarzadeh Azar Emad Bahrami Enrico Pallotta Gianpiero Francesca Radu Timofte Juergen Gall DiffM 116 0 0 23 Nov 2025
Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal TransformersIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025 Dean L. Slack G. Hudson T. Winterbottom Noura Al Moubayed 134 0 0 23 Oct 2025
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos Rundong Luo Matthew Wallingford Ali Farhadi Noah Snavely Wei-Chiu Ma VGen 402 6 0 10 Apr 2025
Unified Arbitrary-Time Video Frame Interpolation and PredictionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025 Xin Jin Longhai Wu Jie Chen Ilhyun Cho Cheul-hee Hahm 255 1 0 04 Mar 2025
Advancing Semantic Future Prediction through Multimodal Visual Sequence TransformersComputer Vision and Pattern Recognition (CVPR), 2025 Efstathios Karypidis Ioannis Kakogeorgiou Spyros Gidaris N. Komodakis 359 3 0 14 Jan 2025
DINO-Foresight: Looking into the Future with DINO Efstathios Karypidis Ioannis Kakogeorgiou Spyros Gidaris N. Komodakis AI4CE 596 15 0 16 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction Gaurav Shrivastava Abhinav Shrivastava VGen 221 0 0 07 Dec 2024
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction Gaurav Shrivastava Abhinav Shrivastava VGen DiffM 260 0 0 06 Dec 2024
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-ExpertsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024 Xiang Deng Youxin Pang Xiaochen Zhao Chao Xu Lizhen Wang Hongjiang Xiao Shi Yan Hongwen Zhang Yebin Liu DiffM VGen 231 3 0 31 Oct 2024
Motion Graph Unleashed: A Novel Approach to Video PredictionNeural Information Processing Systems (NeurIPS), 2024 Yiqi Zhong Luming Liang Bohan Tang Ilya Zharkov Ulrich Neumann 309 4 0 29 Oct 2024
Masked Autoregressive Model for Weather Forecasting Doyi Kim Minseok Seo Hakjin Lee Junghoon Seo 210 1 0 30 Sep 2024
Blox-Net: Generative Design-for-Robot-Assembly Using VLM Supervision, Physics Simulation, and a Robot with ResetIEEE International Conference on Robotics and Automation (ICRA), 2024 Andrew Goldberg Kavish Kondap Tianshuang Qiu Zehan Ma Letian Fu Justin Kerr Huang Huang Kaiyuan Chen Kuan Fang Ken Goldberg 193 13 0 25 Sep 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos Jiahe Liu Youran Qu Qi Yan Fangyin Wei Lele Wang Renjie Liao VGen EGVM 317 28 0 23 Jul 2024
The Power of Next-Frame Prediction for Learning Physical Laws T. Winterbottom G. Hudson Daniel Kluvanec Dean L. Slack Jamie Sterling Junjie Shentu Chenghao Xiao Zheming Zhou Noura Al Moubayed 203 3 0 21 May 2024
On the Content Bias in Fréchet Video Distance Jason S. Hoffman Aniruddha Mahapatra Gaurav Parmar Jun-Yan Zhu Jia-Bin Huang EGVM 232 32 0 18 Apr 2024
Action-conditioned video data improves predictability Meenakshi Sarkar Debasish Ghose VGen 303 0 0 08 Apr 2024
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming Pengyuan Zhou Lin Wang Zhi Liu Yanbin Hao Pan Hui Sasu Tarkoma J. Kangasharju VGen 252 46 0 30 Jan 2024
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets A. Blattmann Tim Dockhorn Sumith Kulal Daniel Mendelevitch Maciej Kilian ... Zion English Vikram S. Voleti Adam Letts Varun Jampani Robin Rombach VGen 961 1,938 0 25 Nov 2023
Breathing Life Into Sketches Using Text-to-Video PriorsComputer Vision and Pattern Recognition (CVPR), 2023 Rinon Gal Yael Vinker Yuval Alaluf Amit H. Bermano Daniel Cohen-Or Ariel Shamir Gal Chechik VGen DiffM 190 47 0 21 Nov 2023
Triplet Attention Transformer for Spatiotemporal Predictive LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023 Xuesong Nie Xi Chen Haoyuan Jin Zhihang Zhu Yunfeng Yan Donglian Qi ViT 168 15 0 28 Oct 2023
HyperSINDy: Deep Generative Modeling of Nonlinear Stochastic Governing Equations Mozes Jacobs Bingni W. Brunton Steven L. Brunton J. Nathan Kutz Ryan V. Raut 178 14 0 07 Oct 2023
LLM-grounded Video Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023 Long Lian Baifeng Shi Semih Yavuz Ye Liu Boyi Li DiffM 230 74 0 29 Sep 2023
Automatic Animation of Hair Blowing in Still Portrait PhotosIEEE International Conference on Computer Vision (ICCV), 2023 Wenpeng Xiao Wentao Liu Yitong Wang Guohao Li Bing Li 3DH 216 14 0 25 Sep 2023
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted TreesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Alexia Jolicoeur-Martineau Kilian Fatras Tal Kachman 314 56 0 18 Sep 2023
SimDA: Simple Diffusion Adapter for Efficient Video GenerationComputer Vision and Pattern Recognition (CVPR), 2023 Zhen Xing Jingdong Sun Hang-Rui Hu Zuxuan Wu Yu-Gang Jiang VGen DiffM 250 105 0 18 Aug 2023
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video PredictionIEEE transactions on multimedia (IEEE TMM), 2023 Mohammad Adiban Kalin Stefanov Sabato Marco Siniscalchi G. Salvi 241 3 0 13 Jul 2023
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous RobotsIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023 Meenakshi Sarkar V. Honkote D. Das D. Ghose 180 3 0 28 Jun 2023
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive LearningNeural Information Processing Systems (NeurIPS), 2023 Cheng Tan Siyuan Li Zhangyang Gao Wen-Cai Guan Zedong Wang Zicheng Liu Lirong Wu Stan Z. Li AI4TS 245 90 0 20 Jun 2023
Fast Fourier Inception Networks for Occluded Video PredictionIEEE transactions on multimedia (IEEE TMM), 2023 Ping Li Chenhan Zhang Xianghua Xu 180 11 0 17 Jun 2023
Video Diffusion Models with Local-Global Context GuidanceInternational Joint Conference on Artificial Intelligence (IJCAI), 2023 Si-hang Yang Lu Zhang Yu Liu Zhizhuo Jiang You He VGen DiffM 118 18 0 05 Jun 2023
VDT: General-purpose Video Diffusion Transformers via Mask ModelingInternational Conference on Learning Representations (ICLR), 2023 Haoyu Lu Guoxing Yang Nanyi Fei Yuqi Huo Zhiwu Lu Ping Luo Mingyu Ding DiffM VGen 218 99 0 22 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023 A. Blattmann Robin Rombach Huan Ling Tim Dockhorn Seung Wook Kim Sanja Fidler Karsten Kreis 3DGS VGen 610 1,410 0 18 Apr 2023
MS-LSTM: Exploring Spatiotemporal Multiscale Representations in Video Prediction DomainApplied Soft Computing (Appl. Soft Comput.), 2023 Zhifeng Ma Hao Zhang Jie Liu 408 10 0 16 Apr 2023
Explicitly Minimizing the Blur Error of Variational AutoencodersInternational Conference on Learning Representations (ICLR), 2023 G. Bredell Kyriakos Flouris K. Chaitanya Ertunc Erdil E. Konukoglu 157 35 0 12 Apr 2023
Model-Based Reinforcement Learning with Isolated ImaginationsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 Minting Pan Geng Chen Yitao Zheng Yunbo Wang Xiaokang Yang 325 2 0 27 Mar 2023
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional TransformersComputer Vision and Pattern Recognition (CVPR), 2023 Jaehoon Yoo Semin Kim Doyup Lee Chiheon Kim Seunghoon Hong 201 6 0 20 Mar 2023
A Dynamic Multi-Scale Voxel Flow Network for Video PredictionComputer Vision and Pattern Recognition (CVPR), 2023 Xiaotao Hu Zhewei Huang Ailin Huang Jun Xu Shuchang Zhou VGen 218 86 0 17 Mar 2023
TKN: Transformer-based Keypoint Prediction Network For Real-time Video Prediction Haoran Li Pengyuan Zhou Yi-Wen Lin Y. Hao Haiyong Xie Yong Liao ViT AI4TS 267 1 0 17 Mar 2023
Implicit Stacked Autoregressive Model for Video Prediction Min-seok Seo Hakjin Lee Do-Yeon Kim Junghoon Seo VGen 143 20 0 14 Mar 2023
Continual Visual Reinforcement Learning with A Life-Long World Model Wendong Zhang Wendong Zhang Geng Chen Siyu Gao Yunbo Wang Xiaokang Yang Yunbo Wang CLL 321 3 0 12 Mar 2023
Distributional Learning of Variational AutoEncoder: Application to Synthetic Data GenerationNeural Information Processing Systems (NeurIPS), 2023 SeungHwan An Jong-June Jeon DRL 473 12 0 22 Feb 2023
Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal ForecastsSocial Science Research Network (SSRN), 2023 Pantelis R. Vlachas Petros Koumoutsakos AI4TS AI4CE 264 11 0 22 Feb 2023
Anti-aliasing Predictive Coding Network for Future Video Frame Prediction Chaofan Ling Wei-Hong Li Junpei Zhong 231 0 0 13 Jan 2023
Long-horizon video prediction using a dynamic latent hierarchy Alexey Zakharov Qinghai Guo Zafeirios Fountas 177 5 0 29 Dec 2022
Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for Video Prediction Chaofan Ling Junpei Zhong Wei-Hong Li 286 3 0 22 Dec 2022
Video Prediction by Efficient TransformersImage and Vision Computing (IVC), 2022 Xi Ye Guillaume-Alexandre Bilodeau ViT 258 45 0 12 Dec 2022
Multi-Rate VAE: Train Once, Get the Full Rate-Distortion CurveInternational Conference on Learning Representations (ICLR), 2022 Juhan Bae Michael Ruogu Zhang Michael Ruan Eric Wang S. Hasegawa Jimmy Ba Roger C. Grosse DRL 235 23 0 07 Dec 2022
Efficient Video Prediction via Sparsely Conditioned Flow MatchingIEEE International Conference on Computer Vision (ICCV), 2022 A. Davtyan Sepehr Sameni Paolo Favaro VGen DiffM 250 42 0 26 Nov 2022
WALDO: Future Video Synthesis using Object Layer Decomposition and Parametric Flow PredictionIEEE International Conference on Computer Vision (ICCV), 2022 G. L. Moing Jean Ponce Cordelia Schmid 349 7 0 25 Nov 2022

All Papers

Improved Conditional VRNNs for Video Prediction

Papers citing "Improved Conditional VRNNs for Video Prediction"