Stochastic Video Generation with a Learned Prior

21 February 2018

Papers citing "Stochastic Video Generation with a Learned Prior"

50 / 95 papers shown

Title
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos Rundong Luo Matthew Wallingford Ali Farhadi Noah Snavely Wei-Chiu Ma VGen 19 0 0 10 Apr 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation Sihyun Yu Meera Hahn Dan Kondratyuk Jinwoo Shin Agrim Gupta José Lezama Irfan Essa David A. Ross Jonathan Huang DiffM VGen 72 0 0 18 Feb 2025
Object-Centric Image to Video Generation with Language Guidance Angel Villar-Corrales Gjergj Plepi Sven Behnke DiffM VGen OCL 71 0 0 17 Feb 2025
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction Shreyam Gupta P. Agrawal Priyam Gupta 67 0 0 28 Jan 2025
Visual Representation Learning with Stochastic Frame Prediction Huiwon Jang Dongyoung Kim Junsu Kim Jinwoo Shin Pieter Abbeel Younggyo Seo 34 2 0 11 Jun 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models Jialong Wu Shaofeng Yin Ningya Feng Xu He Dong Li Jianye Hao Mingsheng Long VGen 35 23 0 24 May 2024
Action-conditioned video data improves predictability Meenakshi Sarkar Debasish Ghose VGen 33 0 0 08 Apr 2024
Breathing Life Into Sketches Using Text-to-Video Priors Rinon Gal Yael Vinker Yuval Alaluf Amit H. Bermano Daniel Cohen-Or Ariel Shamir Gal Chechik VGen DiffM 27 29 0 21 Nov 2023
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes Aran Nayebi R. Rajalingham M. Jazayeri G. R. Yang 28 17 0 19 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models A. Blattmann Robin Rombach Huan Ling Tim Dockhorn Seung Wook Kim Sanja Fidler Karsten Kreis 3DGS VGen 63 1,010 0 18 Apr 2023
Inductive biases in deep learning models for weather prediction Jannik Thümmel Matthias Karlbauer S. Otte C. Zarfl Georg Martius ... Thomas Scholten Ulrich Friedrich V. Wulfmeyer B. Goswami Martin Volker Butz AI4CE 33 4 0 06 Apr 2023
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers Jaehoon Yoo Semin Kim Doyup Lee Chiheon Kim Seunghoon Hong 21 3 0 20 Mar 2023
Predictive World Models from Real-World Partial Observations Robin Karlsson Alexander Carballo Keisuke Fujii Kento Ohtani K. Takeda 17 5 0 12 Jan 2023
Long-horizon video prediction using a dynamic latent hierarchy Alexey Zakharov Qinghai Guo Z. Fountas 19 4 0 29 Dec 2022
Motion and Context-Aware Audio-Visual Conditioned Video Prediction Yating Xu Conghui Hu G. Lee VGen 35 0 0 09 Dec 2022
Make-A-Story: Visual Memory Conditioned Consistent Story Generation Tanzila Rahman Hsin-Ying Lee Jian Ren Sergey Tulyakov Shweta Mahajan Leonid Sigal DiffM 16 68 0 23 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation Tsu-jui Fu Licheng Yu Ning Zhang Cheng-Yang Fu Jong-Chyi Su William Yang Wang Sean Bell VGen 40 37 0 23 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video Manipulation Levent Karacan Tolga Kerimouglu .Ismail .Inan Tolga Birdal Erkut Erdem Aykut Erdem 16 1 0 05 Nov 2022
A unified model for continuous conditional video prediction Xi Ye Guillaume-Alexandre Bilodeau AI4TS 32 7 0 11 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description Ruben Villegas Mohammad Babaeizadeh Pieter-Jan Kindermans Hernan Moraldo Han Zhang M. Saffar Santiago Castro Julius Kunze D. Erhan DiffM VGen 43 371 0 05 Oct 2022
Temporal View Synthesis of Dynamic Scenes through 3D Object Motion Estimation with Multi-Plane Images Nagabhushan Somraj Pranali Sancheti R. Soundararajan 27 4 0 19 Aug 2022
InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images Zhengqi Li Qianqian Wang Noah Snavely Angjoo Kanazawa VGen 22 59 0 22 Jul 2022
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning Cheng Tan Zhangyang Gao Lirong Wu Yongjie Xu Jun-Xiong Xia Siyuan Li Stan Z. Li 25 107 0 24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction Agrim Gupta Stephen Tian Yunzhi Zhang Jiajun Wu Roberto Martín-Martín Li Fei-Fei 100 110 0 23 Jun 2022
SimVP: Simpler yet Better Video Prediction Zhangyang Gao Cheng Tan Lirong Wu Stan Z. Li 23 210 0 09 Jun 2022
Cascaded Video Generation for Videos In-the-Wild Lluis Castrejon Nicolas Ballas Aaron Courville VGen 24 0 0 01 Jun 2022
SwinVRNN: A Data-Driven Ensemble Forecasting Model via Learned Distribution Perturbation Yuan Hu Lei Chen Zhibin Wang Hao Li OOD 21 45 0 26 May 2022
Action Conditioned Tactile Prediction: case study on slip prediction Willow Mandil Kiyanoush Nazari E. AmirGhalamzan 22 15 0 19 May 2022
Predicting Future Occupancy Grids in Dynamic Environment with Spatio-Temporal Learning K. S. Mann Abhishek Tomy Anshul K. Paigwar A. Renzaglia Christian Laugier 26 10 0 06 May 2022
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond Zheng Chang Xinfeng Zhang Shanshe Wang Siwei Ma Wen Gao 28 1 0 20 Apr 2022
Variational Heteroscedastic Volatility Model Zexuan Yin P. Barucca AI4TS 13 0 0 11 Apr 2022
Simple and Effective Synthesis of Indoor 3D Scenes Jing Yu Koh Harsh Agrawal Dhruv Batra Richard Tucker Austin Waters Honglak Lee Yinfei Yang Jason Baldridge Peter Anderson VGen 3DV 13 29 0 06 Apr 2022
FoV-Net: Field-of-View Extrapolation Using Self-Attention and Uncertainty Liqian Ma Stamatios Georgoulis Xu Jia Luc Van Gool 22 6 0 04 Apr 2022
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction Zheng Chang Xinfeng Zhang Shanshe Wang Siwei Ma Wen Gao 13 50 0 30 Mar 2022
VPTR: Efficient Transformers for Video Prediction Xi Ye Guillaume-Alexandre Bilodeau ViT 19 18 0 29 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo Kimin Lee Stephen James Pieter Abbeel SSL OnRL 16 115 0 25 Mar 2022
Stochastic Video Prediction with Structure and Motion Adil Kaan Akan Sadra Safadoust Fatma Guney VGen 14 9 0 20 Mar 2022
MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks Angel Villar-Corrales Ani J. Karapetyan Andreas Boltres Sven Behnke 19 11 0 17 Mar 2022
Diffusion Probabilistic Modeling for Video Generation Ruihan Yang Prakhar Srivastava Stephan Mandt DiffM VGen 32 255 0 16 Mar 2022
Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks Sihyun Yu Jihoon Tack Sangwoo Mo Hyunsu Kim Junho Kim Jung-Woo Ha Jinwoo Shin DiffM VGen 18 199 0 21 Feb 2022
AA-TransUNet: Attention Augmented TransUNet For Nowcasting Tasks Yimin Yang S. Mehrkanoon ViT AI4TS 34 41 0 10 Feb 2022
Noether Networks: Meta-Learning Useful Conserved Quantities Ferran Alet Dylan D. Doblar Allan Zhou J. Tenenbaum Kenji Kawaguchi Chelsea Finn 65 26 0 06 Dec 2021
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion Chenfei Wu Jian Liang Lei Ji Fan Yang Yuejian Fang Daxin Jiang Nan Duan ViT VGen 14 292 0 24 Nov 2021
Action2video: Generating Videos of Human 3D Actions Chuan Guo X. Zuo Sen Wang Xinshuang Liu Shihao Zou Minglun Gong Li Cheng 3DH 63 22 0 12 Nov 2021
Contrastively Disentangled Sequential Variational Autoencoder M. Kiener Weiran Wang Michael Gerndt CoGe DRL 4 40 0 22 Oct 2021
LARNet: Latent Action Representation for Human Action Synthesis Naman Biyani A. J. Rana Shruti Vyas Y. S. Rawat 11 4 0 21 Oct 2021
A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction Moitreya Chatterjee N. Ahuja A. Cherian UQCV VGen BDL 29 17 0 06 Oct 2021
Diverse Generation from a Single Video Made Possible Niv Haim Ben Feinstein Niv Granot Assaf Shocher Shai Bagon Tali Dekel Michal Irani DiffM VGen 34 18 0 17 Sep 2021
Simple Video Generation using Neural ODEs David Kanaa Vikram S. Voleti Samira Ebrahimi Kahou Christopher Pal 17 20 0 07 Sep 2021
Conditional Temporal Variational AutoEncoder for Action Video Prediction Xiaogang Xu Yi Wang Liwei Wang Bei Yu Jiaya Jia VGen 24 5 0 12 Aug 2021