High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks

5 November 2019

Papers citing "High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks"

32 / 32 papers shown

Title
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation Sihyun Yu Meera Hahn Dan Kondratyuk Jinwoo Shin Agrim Gupta José Lezama Irfan Essa David A. Ross Jonathan Huang DiffM VGen 74 0 0 18 Feb 2025
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction Shreyam Gupta P. Agrawal Priyam Gupta 69 0 0 28 Jan 2025
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators Wentao Zhang Junliang Guo Tianyu He Li Zhao Linli Xu Jiang Bian 34 3 0 10 Jul 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models Jialong Wu Shaofeng Yin Ningya Feng Xu He Dong Li Jianye Hao Mingsheng Long VGen 40 23 0 24 May 2024
Action-conditioned video data improves predictability Meenakshi Sarkar Debasish Ghose VGen 38 0 0 08 Apr 2024
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes Aran Nayebi R. Rajalingham M. Jazayeri G. R. Yang 36 17 0 19 May 2023
PastNet: Introducing Physical Inductive Biases for Spatio-temporal Video Prediction Hao Wu Wei Xion Fan Xu Xian-Sheng Hua C. L. Philip Chen Xiansheng Hua AI4TS 23 27 0 19 May 2023
Long-horizon video prediction using a dynamic latent hierarchy Alexey Zakharov Qinghai Guo Z. Fountas 19 4 0 29 Dec 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description Ruben Villegas Mohammad Babaeizadeh Pieter-Jan Kindermans Hernan Moraldo Han Zhang M. Saffar Santiago Castro Julius Kunze D. Erhan DiffM VGen 48 371 0 05 Oct 2022
Temporal View Synthesis of Dynamic Scenes through 3D Object Motion Estimation with Multi-Plane Images Nagabhushan Somraj Pranali Sancheti R. Soundararajan 27 4 0 19 Aug 2022
InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images Zhengqi Li Qianqian Wang Noah Snavely Angjoo Kanazawa VGen 26 59 0 22 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis Chenfei Wu Jian Liang Xiaowei Hu Zhe Gan Jianfeng Wang Lijuan Wang Zicheng Liu Yuejian Fang Nan Duan VGen 12 72 0 20 Jul 2022
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning Cheng Tan Zhangyang Gao Lirong Wu Yongjie Xu Jun-Xiong Xia Siyuan Li Stan Z. Li 31 107 0 24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction Agrim Gupta Stephen Tian Yunzhi Zhang Jiajun Wu Roberto Martín-Martín Li Fei-Fei 100 110 0 23 Jun 2022
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation Vikram S. Voleti Alexia Jolicoeur-Martineau Christopher Pal DiffM VGen 13 290 0 19 May 2022
Action Conditioned Tactile Prediction: case study on slip prediction Willow Mandil Kiyanoush Nazari E. AmirGhalamzan 27 16 0 19 May 2022
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction Zheng Chang Xinfeng Zhang Shanshe Wang Siwei Ma Wen Gao 21 50 0 30 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo Kimin Lee Stephen James Pieter Abbeel SSL OnRL 16 116 0 25 Mar 2022
Stochastic Video Prediction with Structure and Motion Adil Kaan Akan Sadra Safadoust Fatma Guney VGen 21 9 0 20 Mar 2022
Transframer: Arbitrary Frame Prediction with Generative Models C. Nash João Carreira Jacob Walker Iain Barr Andrew Jaegle Mateusz Malinowski Peter W. Battaglia ViT 19 37 0 17 Mar 2022
MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks Angel Villar-Corrales Ani J. Karapetyan Andreas Boltres Sven Behnke 19 11 0 17 Mar 2022
Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks Sihyun Yu Jihoon Tack Sangwoo Mo Hyunsu Kim Junho Kim Jung-Woo Ha Jinwoo Shin DiffM VGen 18 199 0 21 Feb 2022
LARNet: Latent Action Representation for Human Action Synthesis Naman Biyani A. J. Rana Shruti Vyas Y. S. Rawat 13 4 0 21 Oct 2021
Diverse Generation from a Single Video Made Possible Niv Haim Ben Feinstein Niv Granot Assaf Shocher Shai Bagon Tali Dekel Michal Irani DiffM VGen 34 18 0 17 Sep 2021
Physion: Evaluating Physical Prediction from Vision in Humans and Machines Daniel M. Bear E. Wang Damian Mrowca Felix Binder Hsiau-Yu Fish Tung ... Li Fei-Fei Nancy Kanwisher J. Tenenbaum Daniel L. K. Yamins Judith E. Fan OOD 55 86 0 15 Jun 2021
Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning Sangmin Lee Hak Gu Kim Dae Hwi Choi Hyungil Kim Yong Man Ro 22 102 0 02 Apr 2021
Self-Supervision by Prediction for Object Discovery in Videos Beril Besbinar P. Frossard SSL 21 7 0 09 Mar 2021
Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction Bohan Wu Suraj Nair Roberto Martin-Martin Li Fei-Fei Chelsea Finn DRL 21 99 0 06 Mar 2021
Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image Andrew Liu Richard Tucker Varun Jampani A. Makadia Noah Snavely Angjoo Kanazawa VGen 31 157 0 17 Dec 2020
ConvTransformer: A Convolutional Transformer Network for Video Frame Synthesis Zhouyong Liu S. Luo Wubin Li Jingben Lu Yufan Wu Shilei Sun Chunguo Li Luxi Yang ViT 17 79 0 20 Nov 2020
SCOP: Scientific Control for Reliable Neural Network Pruning Yehui Tang Yunhe Wang Yixing Xu Dacheng Tao Chunjing Xu Chao Xu Chang Xu AAML 39 166 0 21 Oct 2020
Stochastic Latent Residual Video Prediction Jean-Yves Franceschi E. Delasalles Mickaël Chen Sylvain Lamprier Patrick Gallinari VGen 26 159 0 21 Feb 2020