Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1804.01523
Cited By
Stochastic Adversarial Video Prediction
4 April 2018
Alex X. Lee
Richard Y. Zhang
F. Ebert
Pieter Abbeel
Chelsea Finn
Sergey Levine
DRL
VGen
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stochastic Adversarial Video Prediction"
50 / 279 papers shown
Title
Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer
Tasmiah Haque
Srinjoy Das
AI4TS
28
0
0
03 Dec 2025
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sina Mokhtarzadeh Azar
Emad Bahrami
Enrico Pallotta
Gianpiero Francesca
Radu Timofte
Juergen Gall
DiffM
108
0
0
23 Nov 2025
Ego-centric Predictive Model Conditioned on Hand Trajectories
Binjie Zhang
Mike Zheng Shou
EgoV
286
0
0
27 Aug 2025
Video Generators are Robot Policies
Junbang Liang
P. Tokmakov
Ruoshi Liu
Sruthi Sudhakar
Paarth Shah
Rares Andrei Ambrus
Carl Vondrick
VGen
267
11
0
01 Aug 2025
Video World Models with Long-term Spatial Memory
Tong Wu
Shuai Yang
Ryan Po
Yinghao Xu
Ziwei Liu
Dahua Lin
Gordon Wetzstein
VGen
KELM
VLM
316
37
0
05 Jun 2025
Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Tingxiu Chen
Yilei Shi
Zixuan Zheng
Bingcong Yan
Jingliang Hu
Xiao Xiang Zhu
Lichao Mou
VGen
MedIm
293
14
0
19 Mar 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
661
5
0
18 Feb 2025
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
Shreyam Gupta
P. Agrawal
Priyam Gupta
250
1
0
28 Jan 2025
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
Computer Vision and Pattern Recognition (CVPR), 2025
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
343
3
0
14 Jan 2025
Can Generative Video Models Help Pose Estimation?
Computer Vision and Pattern Recognition (CVPR), 2024
Ruojin Cai
Jason Y. Zhang
Philipp Henzler
Zhengqi Li
Noah Snavely
Ricardo Martín Brualla
VGen
192
6
0
20 Dec 2024
DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
584
14
0
16 Dec 2024
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2024
Tianwei Yin
Qiang Zhang
Richard Zhang
William T. Freeman
F. Durand
Eli Shechtman
Xun Huang
VGen
DiffM
564
11
0
10 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
205
0
0
07 Dec 2024
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
DiffM
256
0
0
06 Dec 2024
Lightweight Stochastic Video Prediction via Hybrid Warping
Visual Communications and Image Processing (VCIP), 2024
Kazuki Kotoyori
Shota Hirose
Heming Sun
J. Katto
288
0
0
04 Dec 2024
Motion Graph Unleashed: A Novel Approach to Video Prediction
Neural Information Processing Systems (NeurIPS), 2024
Yiqi Zhong
Luming Liang
Bohan Tang
Ilya Zharkov
Ulrich Neumann
289
4
0
29 Oct 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGen
EGVM
305
28
0
23 Jul 2024
Learning Granular Media Avalanche Behavior for Indirectly Manipulating Obstacles on a Granular Slope
Haodi Hu
Feifei Qian
Daniel Seita
242
3
0
02 Jul 2024
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
223
55
0
24 Jun 2024
Video Generation with Learned Action Prior
Meenakshi Sarkar
Devansh Bhardwaj
Debasish Ghose
VGen
GAN
295
0
0
20 Jun 2024
Tell Me What's Next: Textual Foresight for Generic UI Representations
Andrea Burns
Kate Saenko
Bryan A. Plummer
LM&Ro
AI4TS
265
7
0
12 Jun 2024
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Zhen Xing
Jingdong Sun
Zejia Weng
Zuxuan Wu
Yu-Gang Jiang
VGen
284
21
0
10 Jun 2024
Timely Communications for Remote Inference
Md Kamran Chowdhury Shisher
Yin Sun
I-Hong Hou
234
22
0
25 Apr 2024
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Yiran Xu
Taesung Park
Richard Zhang
Yang Zhou
Eli Shechtman
Feng Liu
Jia-Bin Huang
Difan Liu
SupR
320
25
0
18 Apr 2024
State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend
Fei Cui
Jiaojiao Fang
Xiaojiang Wu
Zelong Lai
Mengke Yang
Menghan Jia
Guizhong Liu
109
0
0
17 Apr 2024
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
295
0
0
08 Apr 2024
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting
Yujin Tang
Peijie Dong
Zhenheng Tang
Xiaowen Chu
Junwei Liang
Mamba
316
49
0
25 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGen
DiffM
250
24
0
21 Mar 2024
Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes
Yifan Chen
Mark Goldstein
Mengjian Hua
M. S. Albergo
Nicholas M. Boffi
Eric Vanden-Eijnden
AI4TS
332
35
0
20 Mar 2024
Towards Scene Graph Anticipation
European Conference on Computer Vision (ECCV), 2024
Rohith Peddi
Saksham Singh
Saurabh
Parag Singla
Vibhav Gogate
350
7
0
07 Mar 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
303
96
0
22 Feb 2024
Sign Language Production with Latent Motion Transformer
Pan Xie
Taiying Peng
Yao Du
Qipeng Zhang
SLR
187
10
0
20 Dec 2023
STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction
Xi Ye
Guillaume-Alexandre Bilodeau
VGen
DiffM
208
18
0
11 Dec 2023
DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting
Demin Yu
Xutao Li
Yunming Ye
Baoquan Zhang
Chuyao Luo
Kuai Dai
Rui Wang
Xunlai Chen
344
56
0
11 Dec 2023
Event-based Continuous Color Video Decompression from Single Frames
ZiYun Wang
Friedhelm Hamann
Kenneth Chaney
Wen Jiang
Martin Braschler
Kostas Daniilidis
501
11
0
30 Nov 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
913
1,923
0
25 Nov 2023
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
V.Ya. Arkhipkin
Zein Shaheen
Viacheslav Vasilev
E. Dakhova
Andrey Kuznetsov
Denis Dimitrov
DiffM
VGen
287
7
0
22 Nov 2023
Pair-wise Layer Attention with Spatial Masking for Video Prediction
Ping Li
Chenhan Zhang
Zheng Yang
Xianghua Xu
Mingli Song
196
0
0
19 Nov 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
European Conference on Computer Vision (ECCV), 2023
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
276
403
0
18 Oct 2023
Generative Image Dynamics
Computer Vision and Pattern Recognition (CVPR), 2023
Zhengqi Li
Richard Tucker
Noah Snavely
Aleksander Holynski
DiffM
330
91
0
14 Sep 2023
MMVP: Motion-Matrix-based Video Prediction
IEEE International Conference on Computer Vision (ICCV), 2023
Yiqi Zhong
Luming Liang
Ilya Zharkov
Ulrich Neumann
187
30
0
30 Aug 2023
Structured World Models from Human Videos
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
224
138
0
21 Aug 2023
SwinLSTM:Improving Spatiotemporal Prediction Accuracy using Swin Transformer and LSTM
IEEE International Conference on Computer Vision (ICCV), 2023
Song Tang
Chuang Li
Pufen Zhang
R. Tang
AI4TS
176
96
0
19 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Zhen Xing
Jingdong Sun
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
DiffM
218
105
0
18 Aug 2023
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video Prediction
IEEE transactions on multimedia (IEEE TMM), 2023
Mohammad Adiban
Kalin Stefanov
Sabato Marco Siniscalchi
G. Salvi
237
3
0
13 Jul 2023
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous Robots
IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Meenakshi Sarkar
V. Honkote
D. Das
D. Ghose
172
3
0
28 Jun 2023
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT
Yuhao Nie
E. Zelikman
Andea Scott
Quentin Paletta
A. Brandt
301
4
0
20 Jun 2023
Fast Fourier Inception Networks for Occluded Video Prediction
IEEE transactions on multimedia (IEEE TMM), 2023
Ping Li
Chenhan Zhang
Xianghua Xu
180
10
0
17 Jun 2023
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles
Tal Daniel
Aviv Tamar
DiffM
226
13
0
09 Jun 2023
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation
AAAI Conference on Artificial Intelligence (AAAI), 2023
A. Davtyan
Paolo Favaro
VGen
243
7
0
06 Jun 2023
1
2
3
4
5
6
Next