Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.01655
Cited By
High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks
5 November 2019
Ruben Villegas
Arkanath Pathak
Harini Kannan
D. Erhan
Quoc V. Le
Honglak Lee
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks"
32 / 32 papers shown
Title
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
74
0
0
18 Feb 2025
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
Shreyam Gupta
P. Agrawal
Priyam Gupta
69
0
0
28 Jan 2025
Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators
Wentao Zhang
Junliang Guo
Tianyu He
Li Zhao
Linli Xu
Jiang Bian
34
3
0
10 Jul 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Jianye Hao
Mingsheng Long
VGen
40
23
0
24 May 2024
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
38
0
0
08 Apr 2024
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes
Aran Nayebi
R. Rajalingham
M. Jazayeri
G. R. Yang
36
17
0
19 May 2023
PastNet: Introducing Physical Inductive Biases for Spatio-temporal Video Prediction
Hao Wu
Wei Xion
Fan Xu
Xian-Sheng Hua
C. L. Philip Chen
Xiansheng Hua
AI4TS
23
27
0
19 May 2023
Long-horizon video prediction using a dynamic latent hierarchy
Alexey Zakharov
Qinghai Guo
Z. Fountas
19
4
0
29 Dec 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
48
371
0
05 Oct 2022
Temporal View Synthesis of Dynamic Scenes through 3D Object Motion Estimation with Multi-Plane Images
Nagabhushan Somraj
Pranali Sancheti
R. Soundararajan
27
4
0
19 Aug 2022
InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images
Zhengqi Li
Qianqian Wang
Noah Snavely
Angjoo Kanazawa
VGen
26
59
0
22 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
12
72
0
20 Jul 2022
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
Cheng Tan
Zhangyang Gao
Lirong Wu
Yongjie Xu
Jun-Xiong Xia
Siyuan Li
Stan Z. Li
31
107
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
100
110
0
23 Jun 2022
MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation
Vikram S. Voleti
Alexia Jolicoeur-Martineau
Christopher Pal
DiffM
VGen
13
290
0
19 May 2022
Action Conditioned Tactile Prediction: case study on slip prediction
Willow Mandil
Kiyanoush Nazari
E. AmirGhalamzan
27
16
0
19 May 2022
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
21
50
0
30 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
16
116
0
25 Mar 2022
Stochastic Video Prediction with Structure and Motion
Adil Kaan Akan
Sadra Safadoust
Fatma Guney
VGen
21
9
0
20 Mar 2022
Transframer: Arbitrary Frame Prediction with Generative Models
C. Nash
João Carreira
Jacob Walker
Iain Barr
Andrew Jaegle
Mateusz Malinowski
Peter W. Battaglia
ViT
19
37
0
17 Mar 2022
MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks
Angel Villar-Corrales
Ani J. Karapetyan
Andreas Boltres
Sven Behnke
19
11
0
17 Mar 2022
Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks
Sihyun Yu
Jihoon Tack
Sangwoo Mo
Hyunsu Kim
Junho Kim
Jung-Woo Ha
Jinwoo Shin
DiffM
VGen
18
199
0
21 Feb 2022
LARNet: Latent Action Representation for Human Action Synthesis
Naman Biyani
A. J. Rana
Shruti Vyas
Y. S. Rawat
13
4
0
21 Oct 2021
Diverse Generation from a Single Video Made Possible
Niv Haim
Ben Feinstein
Niv Granot
Assaf Shocher
Shai Bagon
Tali Dekel
Michal Irani
DiffM
VGen
34
18
0
17 Sep 2021
Physion: Evaluating Physical Prediction from Vision in Humans and Machines
Daniel M. Bear
E. Wang
Damian Mrowca
Felix Binder
Hsiau-Yu Fish Tung
...
Li Fei-Fei
Nancy Kanwisher
J. Tenenbaum
Daniel L. K. Yamins
Judith E. Fan
OOD
55
86
0
15 Jun 2021
Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning
Sangmin Lee
Hak Gu Kim
Dae Hwi Choi
Hyungil Kim
Yong Man Ro
22
102
0
02 Apr 2021
Self-Supervision by Prediction for Object Discovery in Videos
Beril Besbinar
P. Frossard
SSL
21
7
0
09 Mar 2021
Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction
Bohan Wu
Suraj Nair
Roberto Martin-Martin
Li Fei-Fei
Chelsea Finn
DRL
21
99
0
06 Mar 2021
Infinite Nature: Perpetual View Generation of Natural Scenes from a Single Image
Andrew Liu
Richard Tucker
Varun Jampani
A. Makadia
Noah Snavely
Angjoo Kanazawa
VGen
31
157
0
17 Dec 2020
ConvTransformer: A Convolutional Transformer Network for Video Frame Synthesis
Zhouyong Liu
S. Luo
Wubin Li
Jingben Lu
Yufan Wu
Shilei Sun
Chunguo Li
Luxi Yang
ViT
17
79
0
20 Nov 2020
SCOP: Scientific Control for Reliable Neural Network Pruning
Yehui Tang
Yunhe Wang
Yixing Xu
Dacheng Tao
Chunjing Xu
Chao Xu
Chang Xu
AAML
39
166
0
21 Oct 2020
Stochastic Latent Residual Video Prediction
Jean-Yves Franceschi
E. Delasalles
Mickaël Chen
Sylvain Lamprier
Patrick Gallinari
VGen
26
159
0
21 Feb 2020
1