ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.07687
  4. Cited By
Stochastic Video Generation with a Learned Prior

Stochastic Video Generation with a Learned Prior

21 February 2018
Emily L. Denton
Rob Fergus
    VGen
ArXivPDFHTML

Papers citing "Stochastic Video Generation with a Learned Prior"

50 / 95 papers shown
Title
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
19
0
0
10 Apr 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
72
0
0
18 Feb 2025
Object-Centric Image to Video Generation with Language Guidance
Object-Centric Image to Video Generation with Language Guidance
Angel Villar-Corrales
Gjergj Plepi
Sven Behnke
DiffM
VGen
OCL
71
0
0
17 Feb 2025
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
Shreyam Gupta
P. Agrawal
Priyam Gupta
67
0
0
28 Jan 2025
Visual Representation Learning with Stochastic Frame Prediction
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
34
2
0
11 Jun 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Jianye Hao
Mingsheng Long
VGen
35
23
0
24 May 2024
Action-conditioned video data improves predictability
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
33
0
0
08 Apr 2024
Breathing Life Into Sketches Using Text-to-Video Priors
Breathing Life Into Sketches Using Text-to-Video Priors
Rinon Gal
Yael Vinker
Yuval Alaluf
Amit H. Bermano
Daniel Cohen-Or
Ariel Shamir
Gal Chechik
VGen
DiffM
27
29
0
21 Nov 2023
Neural Foundations of Mental Simulation: Future Prediction of Latent
  Representations on Dynamic Scenes
Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes
Aran Nayebi
R. Rajalingham
M. Jazayeri
G. R. Yang
28
17
0
19 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent
  Diffusion Models
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
63
1,010
0
18 Apr 2023
Inductive biases in deep learning models for weather prediction
Inductive biases in deep learning models for weather prediction
Jannik Thümmel
Matthias Karlbauer
S. Otte
C. Zarfl
Georg Martius
...
Thomas Scholten
Ulrich Friedrich
V. Wulfmeyer
B. Goswami
Martin Volker Butz
AI4CE
33
4
0
06 Apr 2023
Towards End-to-End Generative Modeling of Long Videos with
  Memory-Efficient Bidirectional Transformers
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers
Jaehoon Yoo
Semin Kim
Doyup Lee
Chiheon Kim
Seunghoon Hong
21
3
0
20 Mar 2023
Predictive World Models from Real-World Partial Observations
Predictive World Models from Real-World Partial Observations
Robin Karlsson
Alexander Carballo
Keisuke Fujii
Kento Ohtani
K. Takeda
17
5
0
12 Jan 2023
Long-horizon video prediction using a dynamic latent hierarchy
Long-horizon video prediction using a dynamic latent hierarchy
Alexey Zakharov
Qinghai Guo
Z. Fountas
19
4
0
29 Dec 2022
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Yating Xu
Conghui Hu
G. Lee
VGen
35
0
0
09 Dec 2022
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
DiffM
16
68
0
23 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via
  Multimodal Masked Video Generation
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
40
37
0
23 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video
  Manipulation
Disentangling Content and Motion for Text-Based Neural Video Manipulation
Levent Karacan
Tolga Kerimouglu
.Ismail .Inan
Tolga Birdal
Erkut Erdem
Aykut Erdem
16
1
0
05 Nov 2022
A unified model for continuous conditional video prediction
A unified model for continuous conditional video prediction
Xi Ye
Guillaume-Alexandre Bilodeau
AI4TS
32
7
0
11 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual
  Description
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
43
371
0
05 Oct 2022
Temporal View Synthesis of Dynamic Scenes through 3D Object Motion
  Estimation with Multi-Plane Images
Temporal View Synthesis of Dynamic Scenes through 3D Object Motion Estimation with Multi-Plane Images
Nagabhushan Somraj
Pranali Sancheti
R. Soundararajan
27
4
0
19 Aug 2022
InfiniteNature-Zero: Learning Perpetual View Generation of Natural
  Scenes from Single Images
InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images
Zhengqi Li
Qianqian Wang
Noah Snavely
Angjoo Kanazawa
VGen
22
59
0
22 Jul 2022
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive
  Learning
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
Cheng Tan
Zhangyang Gao
Lirong Wu
Yongjie Xu
Jun-Xiong Xia
Siyuan Li
Stan Z. Li
25
107
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
100
110
0
23 Jun 2022
SimVP: Simpler yet Better Video Prediction
SimVP: Simpler yet Better Video Prediction
Zhangyang Gao
Cheng Tan
Lirong Wu
Stan Z. Li
23
210
0
09 Jun 2022
Cascaded Video Generation for Videos In-the-Wild
Cascaded Video Generation for Videos In-the-Wild
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
24
0
0
01 Jun 2022
SwinVRNN: A Data-Driven Ensemble Forecasting Model via Learned
  Distribution Perturbation
SwinVRNN: A Data-Driven Ensemble Forecasting Model via Learned Distribution Perturbation
Yuan Hu
Lei Chen
Zhibin Wang
Hao Li
OOD
21
45
0
26 May 2022
Action Conditioned Tactile Prediction: case study on slip prediction
Action Conditioned Tactile Prediction: case study on slip prediction
Willow Mandil
Kiyanoush Nazari
E. AmirGhalamzan
22
15
0
19 May 2022
Predicting Future Occupancy Grids in Dynamic Environment with
  Spatio-Temporal Learning
Predicting Future Occupancy Grids in Dynamic Environment with Spatio-Temporal Learning
K. S. Mann
Abhishek Tomy
Anshul K. Paigwar
A. Renzaglia
Christian Laugier
26
10
0
06 May 2022
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
28
1
0
20 Apr 2022
Variational Heteroscedastic Volatility Model
Variational Heteroscedastic Volatility Model
Zexuan Yin
P. Barucca
AI4TS
13
0
0
11 Apr 2022
Simple and Effective Synthesis of Indoor 3D Scenes
Simple and Effective Synthesis of Indoor 3D Scenes
Jing Yu Koh
Harsh Agrawal
Dhruv Batra
Richard Tucker
Austin Waters
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
VGen
3DV
13
29
0
06 Apr 2022
FoV-Net: Field-of-View Extrapolation Using Self-Attention and
  Uncertainty
FoV-Net: Field-of-View Extrapolation Using Self-Attention and Uncertainty
Liqian Ma
Stamatios Georgoulis
Xu Jia
Luc Van Gool
22
6
0
04 Apr 2022
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution
  Video Prediction
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
13
50
0
30 Mar 2022
VPTR: Efficient Transformers for Video Prediction
VPTR: Efficient Transformers for Video Prediction
Xi Ye
Guillaume-Alexandre Bilodeau
ViT
19
18
0
29 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
16
115
0
25 Mar 2022
Stochastic Video Prediction with Structure and Motion
Stochastic Video Prediction with Structure and Motion
Adil Kaan Akan
Sadra Safadoust
Fatma Guney
VGen
14
9
0
20 Mar 2022
MSPred: Video Prediction at Multiple Spatio-Temporal Scales with
  Hierarchical Recurrent Networks
MSPred: Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks
Angel Villar-Corrales
Ani J. Karapetyan
Andreas Boltres
Sven Behnke
19
11
0
17 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
32
255
0
16 Mar 2022
Generating Videos with Dynamics-aware Implicit Generative Adversarial
  Networks
Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks
Sihyun Yu
Jihoon Tack
Sangwoo Mo
Hyunsu Kim
Junho Kim
Jung-Woo Ha
Jinwoo Shin
DiffM
VGen
18
199
0
21 Feb 2022
AA-TransUNet: Attention Augmented TransUNet For Nowcasting Tasks
AA-TransUNet: Attention Augmented TransUNet For Nowcasting Tasks
Yimin Yang
S. Mehrkanoon
ViT
AI4TS
34
41
0
10 Feb 2022
Noether Networks: Meta-Learning Useful Conserved Quantities
Noether Networks: Meta-Learning Useful Conserved Quantities
Ferran Alet
Dylan D. Doblar
Allan Zhou
J. Tenenbaum
Kenji Kawaguchi
Chelsea Finn
65
26
0
06 Dec 2021
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
Chenfei Wu
Jian Liang
Lei Ji
Fan Yang
Yuejian Fang
Daxin Jiang
Nan Duan
ViT
VGen
14
292
0
24 Nov 2021
Action2video: Generating Videos of Human 3D Actions
Action2video: Generating Videos of Human 3D Actions
Chuan Guo
X. Zuo
Sen Wang
Xinshuang Liu
Shihao Zou
Minglun Gong
Li Cheng
3DH
63
22
0
12 Nov 2021
Contrastively Disentangled Sequential Variational Autoencoder
Contrastively Disentangled Sequential Variational Autoencoder
M. Kiener
Weiran Wang
Michael Gerndt
CoGe
DRL
4
40
0
22 Oct 2021
LARNet: Latent Action Representation for Human Action Synthesis
LARNet: Latent Action Representation for Human Action Synthesis
Naman Biyani
A. J. Rana
Shruti Vyas
Y. S. Rawat
11
4
0
21 Oct 2021
A Hierarchical Variational Neural Uncertainty Model for Stochastic Video
  Prediction
A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction
Moitreya Chatterjee
N. Ahuja
A. Cherian
UQCV
VGen
BDL
29
17
0
06 Oct 2021
Diverse Generation from a Single Video Made Possible
Diverse Generation from a Single Video Made Possible
Niv Haim
Ben Feinstein
Niv Granot
Assaf Shocher
Shai Bagon
Tali Dekel
Michal Irani
DiffM
VGen
34
18
0
17 Sep 2021
Simple Video Generation using Neural ODEs
Simple Video Generation using Neural ODEs
David Kanaa
Vikram S. Voleti
Samira Ebrahimi Kahou
Christopher Pal
17
20
0
07 Sep 2021
Conditional Temporal Variational AutoEncoder for Action Video Prediction
Conditional Temporal Variational AutoEncoder for Action Video Prediction
Xiaogang Xu
Yi Wang
Liwei Wang
Bei Yu
Jiaya Jia
VGen
24
5
0
12 Aug 2021
12
Next