Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.01523
Cited By
Stochastic Adversarial Video Prediction
4 April 2018
Alex X. Lee
Richard Y. Zhang
F. Ebert
Pieter Abbeel
Chelsea Finn
Sergey Levine
DRL
VGen
GAN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stochastic Adversarial Video Prediction"
50 / 83 papers shown
Title
Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models
Tingxiu Chen
Yilei Shi
Zixuan Zheng
Bingcong Yan
Jingliang Hu
Xiao Xiang Zhu
Lichao Mou
VGen
MedIm
49
3
0
19 Mar 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
72
0
0
18 Feb 2025
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
Shreyam Gupta
P. Agrawal
Priyam Gupta
67
0
0
28 Jan 2025
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
40
22
0
24 Jun 2024
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
33
0
0
08 Apr 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
38
56
0
22 Feb 2024
DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting
Demin Yu
Xutao Li
Yunming Ye
Baoquan Zhang
Chuyao Luo
Kuai Dai
Rui Wang
Xunlai Chen
28
20
0
11 Dec 2023
Event-based Continuous Color Video Decompression from Single Frames
ZiYun Wang
Friedhelm Hamann
Kenneth Chaney
Wen Jiang
Martin Braschler
Kostas Daniilidis
38
5
0
30 Nov 2023
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
V.Ya. Arkhipkin
Zein Shaheen
Viacheslav Vasilev
E. Dakhova
Andrey Kuznetsov
Denis Dimitrov
DiffM
VGen
21
5
0
22 Nov 2023
Structured World Models from Human Videos
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
21
85
0
21 Aug 2023
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT
Yuhao Nie
E. Zelikman
Andea Scott
Quentin Paletta
A. Brandt
26
3
0
20 Jun 2023
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes
Haotian Xue
Antonio Torralba
J. Tenenbaum
Daniel L. K. Yamins
Yunzhu Li
H. Tung
PINN
VGen
AI4CE
46
8
0
22 Apr 2023
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGS
VGen
72
1,010
0
18 Apr 2023
Multi-modal learning for geospatial vegetation forecasting
V. Benson
Claire Robin
C. Requena-Mesa
Lazaro Alonso
Nuno Carvalhais
José A. Cortés
Zhihan Gao
Nora Linscheid
M. Weynants
Markus Reichstein
24
11
0
28 Mar 2023
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers
Jaehoon Yoo
Semin Kim
Doyup Lee
Chiheon Kim
Seunghoon Hong
21
3
0
20 Mar 2023
Long-horizon video prediction using a dynamic latent hierarchy
Alexey Zakharov
Qinghai Guo
Z. Fountas
19
4
0
29 Dec 2022
Towards Smooth Video Composition
Qihang Zhang
Ceyuan Yang
Yujun Shen
Yinghao Xu
Bolei Zhou
VGen
31
14
0
14 Dec 2022
Motion and Context-Aware Audio-Visual Conditioned Video Prediction
Yating Xu
Conghui Hu
G. Lee
VGen
35
0
0
09 Dec 2022
Temporal View Synthesis of Dynamic Scenes through 3D Object Motion Estimation with Multi-Plane Images
Nagabhushan Somraj
Pranali Sancheti
R. Soundararajan
27
4
0
19 Aug 2022
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
Cheng Tan
Zhangyang Gao
Lirong Wu
Yongjie Xu
Jun-Xiong Xia
Siyuan Li
Stan Z. Li
25
107
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
100
110
0
23 Jun 2022
SimVP: Simpler yet Better Video Prediction
Zhangyang Gao
Cheng Tan
Lirong Wu
Stan Z. Li
23
210
0
09 Jun 2022
Cascaded Video Generation for Videos In-the-Wild
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
24
0
0
01 Jun 2022
Action Conditioned Tactile Prediction: case study on slip prediction
Willow Mandil
Kiyanoush Nazari
E. AmirGhalamzan
22
15
0
19 May 2022
Synthetic Data -- what, why and how?
James Jordon
Lukasz Szpruch
F. Houssiau
M. Bottarelli
Giovanni Cherubin
Carsten Maple
Samuel N. Cohen
Adrian Weller
35
109
0
06 May 2022
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
28
1
0
20 Apr 2022
Video Diffusion Models
Jonathan Ho
Tim Salimans
Alexey A. Gritsenko
William Chan
Mohammad Norouzi
David J. Fleet
DiffM
VGen
27
1,503
0
07 Apr 2022
Traffic4cast at NeurIPS 2021 -- Temporal and Spatial Few-Shot Transfer Learning in Gridded Geo-Spatial Processes
Christian Eichenberger
M. Neun
Henry Martin
Pedro Herruzo
M. Spanring
...
Fei Tang
A. Gruca
Michael K Kopp
David P. Kreil
Sepp Hochreiter
38
18
0
31 Mar 2022
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
13
50
0
30 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
16
115
0
25 Mar 2022
Stochastic Video Prediction with Structure and Motion
Adil Kaan Akan
Sadra Safadoust
Fatma Guney
VGen
19
9
0
20 Mar 2022
Diffusion Probabilistic Modeling for Video Generation
Ruihan Yang
Prakhar Srivastava
Stephan Mandt
DiffM
VGen
32
255
0
16 Mar 2022
Playable Environments: Video Manipulation in Space and Time
Willi Menapace
Stéphane Lathuilière
Aliaksandr Siarohin
Christian Theobalt
Sergey Tulyakov
Vladislav Golyanik
Elisa Ricci
VGen
19
22
0
03 Mar 2022
Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks
Sihyun Yu
Jihoon Tack
Sangwoo Mo
Hyunsu Kim
Junho Kim
Jung-Woo Ha
Jinwoo Shin
DiffM
VGen
18
199
0
21 Feb 2022
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
Chenfei Wu
Jian Liang
Lei Ji
Fan Yang
Yuejian Fang
Daxin Jiang
Nan Duan
ViT
VGen
14
292
0
24 Nov 2021
LARNet: Latent Action Representation for Human Action Synthesis
Naman Biyani
A. J. Rana
Shruti Vyas
Y. S. Rawat
11
4
0
21 Oct 2021
Taming Visually Guided Sound Generation
Vladimir E. Iashin
Esa Rahtu
VLM
28
120
0
17 Oct 2021
Diverse Generation from a Single Video Made Possible
Niv Haim
Ben Feinstein
Niv Granot
Assaf Shocher
Shai Bagon
Tali Dekel
Michal Irani
DiffM
VGen
34
18
0
17 Sep 2021
A Framework for Multisensory Foresight for Embodied Agents
Xiaohui Chen
Ramtin Hosseini
K. Panetta
Jivko Sinapov
10
3
0
15 Sep 2021
Simple Video Generation using Neural ODEs
David Kanaa
Vikram S. Voleti
Samira Ebrahimi Kahou
Christopher Pal
19
20
0
07 Sep 2021
Conditional Temporal Variational AutoEncoder for Action Video Prediction
Xiaogang Xu
Yi Wang
Liwei Wang
Bei Yu
Jiaya Jia
VGen
24
5
0
12 Aug 2021
Video Generation from Text Employing Latent Path Construction for Temporal Modeling
Amir Mazaheri
M. Shah
20
8
0
29 Jul 2021
Insights from Generative Modeling for Neural Video Compression
Ruihan Yang
Yibo Yang
Joseph Marino
Stephan Mandt
VGen
27
15
0
28 Jul 2021
Physion: Evaluating Physical Prediction from Vision in Humans and Machines
Daniel M. Bear
E. Wang
Damian Mrowca
Felix Binder
Hsiau-Yu Fish Tung
...
Li Fei-Fei
Nancy Kanwisher
J. Tenenbaum
Daniel L. K. Yamins
Judith E. Fan
OOD
45
86
0
15 Jun 2021
Hierarchical Video Generation for Complex Data
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
12
4
0
04 Jun 2021
EarthNet2021: A large-scale dataset and challenge for Earth surface forecasting as a guided video prediction task
C. Requena-Mesa
V. Benson
Markus Reichstein
J. Runge
Joachim Denzler
66
50
0
16 Apr 2021
Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning
Sangmin Lee
Hak Gu Kim
Dae Hwi Choi
Hyungil Kim
Yong Man Ro
20
102
0
02 Apr 2021
Self-Supervision by Prediction for Object Discovery in Videos
Beril Besbinar
P. Frossard
SSL
19
7
0
09 Mar 2021
Deep Generative Modelling: A Comparative Review of VAEs, GANs, Normalizing Flows, Energy-Based and Autoregressive Models
Sam Bond-Taylor
Adam Leach
Yang Long
Chris G. Willcocks
VLM
TPM
36
477
0
08 Mar 2021
DMotion: Robotic Visuomotor Control with Unsupervised Forward Model Learned from Videos
Haoqi Yuan
Ruihai Wu
Andrew Zhao
Hanwang Zhang
Zihan Ding
Hao Dong
19
3
0
07 Mar 2021
1
2
Next