ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.01523
  4. Cited By
Stochastic Adversarial Video Prediction

Stochastic Adversarial Video Prediction

4 April 2018
Alex X. Lee
Richard Y. Zhang
F. Ebert
Pieter Abbeel
Chelsea Finn
Sergey Levine
    DRLVGenGAN
ArXiv (abs)PDFHTML

Papers citing "Stochastic Adversarial Video Prediction"

50 / 279 papers shown
Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer
Inference-time Stochastic Refinement of GRU-Normalizing Flow for Real-time Video Motion Transfer
Tasmiah Haque
Srinjoy Das
AI4TS
68
0
0
03 Dec 2025
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sina Mokhtarzadeh Azar
Emad Bahrami
Enrico Pallotta
Gianpiero Francesca
Radu Timofte
Juergen Gall
DiffM
121
0
0
23 Nov 2025
Ego-centric Predictive Model Conditioned on Hand Trajectories
Ego-centric Predictive Model Conditioned on Hand Trajectories
Binjie Zhang
Mike Zheng Shou
EgoV
296
0
0
27 Aug 2025
Video Generators are Robot Policies
Video Generators are Robot Policies
Junbang Liang
P. Tokmakov
Ruoshi Liu
Sruthi Sudhakar
Paarth Shah
Rares Andrei Ambrus
Carl Vondrick
VGen
284
15
0
01 Aug 2025
Video World Models with Long-term Spatial Memory
Tong Wu
Shuai Yang
Ryan Po
Yinghao Xu
Ziwei Liu
Dahua Lin
Gordon Wetzstein
VGenKELMVLM
328
41
0
05 Jun 2025
Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models
Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion ModelsInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Tingxiu Chen
Yilei Shi
Zixuan Zheng
Bingcong Yan
Jingliang Hu
Xiao Xiang Zhu
Lichao Mou
VGenMedIm
313
14
0
19 Mar 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffMVGen
695
6
0
18 Feb 2025
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
Shreyam Gupta
P. Agrawal
Priyam Gupta
288
2
0
28 Jan 2025
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
Advancing Semantic Future Prediction through Multimodal Visual Sequence TransformersComputer Vision and Pattern Recognition (CVPR), 2025
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
375
3
0
14 Jan 2025
Can Generative Video Models Help Pose Estimation?
Can Generative Video Models Help Pose Estimation?Computer Vision and Pattern Recognition (CVPR), 2024
Ruojin Cai
Jason Y. Zhang
Philipp Henzler
Zhengqi Li
Noah Snavely
Ricardo Martín Brualla
VGen
210
6
0
20 Dec 2024
DINO-Foresight: Looking into the Future with DINO
DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
616
15
0
16 Dec 2024
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
From Slow Bidirectional to Fast Autoregressive Video Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2024
Tianwei Yin
Qiang Zhang
Richard Zhang
William T. Freeman
F. Durand
Eli Shechtman
Xun Huang
VGenDiffM
592
11
0
10 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
239
0
0
07 Dec 2024
Continuous Video Process: Modeling Videos as Continuous
  Multi-Dimensional Processes for Video Prediction
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGenDiffM
280
0
0
06 Dec 2024
Lightweight Stochastic Video Prediction via Hybrid Warping
Lightweight Stochastic Video Prediction via Hybrid WarpingVisual Communications and Image Processing (VCIP), 2024
Kazuki Kotoyori
Shota Hirose
Heming Sun
J. Katto
339
0
0
04 Dec 2024
Motion Graph Unleashed: A Novel Approach to Video Prediction
Motion Graph Unleashed: A Novel Approach to Video PredictionNeural Information Processing Systems (NeurIPS), 2024
Yiqi Zhong
Luming Liang
Bohan Tang
Ilya Zharkov
Ulrich Neumann
310
4
0
29 Oct 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion
  Consistency in Videos
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGenEGVM
335
28
0
23 Jul 2024
Learning Granular Media Avalanche Behavior for Indirectly Manipulating
  Obstacles on a Granular Slope
Learning Granular Media Avalanche Behavior for Indirectly Manipulating Obstacles on a Granular Slope
Haodi Hu
Feifei Qian
Daniel Seita
269
3
0
02 Jul 2024
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
235
58
0
24 Jun 2024
Video Generation with Learned Action Prior
Video Generation with Learned Action Prior
Meenakshi Sarkar
Devansh Bhardwaj
Debasish Ghose
VGenGAN
306
0
0
20 Jun 2024
Tell Me What's Next: Textual Foresight for Generic UI Representations
Tell Me What's Next: Textual Foresight for Generic UI Representations
Andrea Burns
Kate Saenko
Bryan A. Plummer
LM&RoAI4TS
282
7
0
12 Jun 2024
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video
  Prediction
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Zhen Xing
Jingdong Sun
Zejia Weng
Zuxuan Wu
Yu-Gang Jiang
VGen
295
23
0
10 Jun 2024
Timely Communications for Remote Inference
Timely Communications for Remote Inference
Md Kamran Chowdhury Shisher
Yin Sun
I-Hong Hou
244
23
0
25 Apr 2024
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Yiran Xu
Taesung Park
Richard Zhang
Yang Zhou
Eli Shechtman
Feng Liu
Jia-Bin Huang
Difan Liu
SupR
323
25
0
18 Apr 2024
State-space Decomposition Model for Video Prediction Considering
  Long-term Motion Trend
State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend
Fei Cui
Jiaojiao Fang
Xiaojiang Wu
Zelong Lai
Mengke Yang
Menghan Jia
Guizhong Liu
144
0
0
17 Apr 2024
Action-conditioned video data improves predictability
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
328
0
0
08 Apr 2024
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate
  Spatiotemporal Forecasting
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting
Yujin Tang
Peijie Dong
Zhenheng Tang
Xiaowen Chu
Junwei Liang
Mamba
332
52
0
25 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent
  Decomposition
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGenDiffM
277
25
0
21 Mar 2024
Probabilistic Forecasting with Stochastic Interpolants and Föllmer
  Processes
Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes
Yifan Chen
Mark Goldstein
Mengjian Hua
M. S. Albergo
Nicholas M. Boffi
Eric Vanden-Eijnden
AI4TS
353
35
0
20 Mar 2024
Towards Scene Graph Anticipation
Towards Scene Graph AnticipationEuropean Conference on Computer Vision (ECCV), 2024
Rohith Peddi
Saksham Singh
Saurabh
Parag Singla
Vibhav Gogate
370
8
0
07 Mar 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
  Synthesis
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
340
100
0
22 Feb 2024
Sign Language Production with Latent Motion Transformer
Sign Language Production with Latent Motion Transformer
Pan Xie
Taiying Peng
Yao Du
Qipeng Zhang
SLR
207
10
0
20 Dec 2023
STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video
  Prediction
STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction
Xi Ye
Guillaume-Alexandre Bilodeau
VGenDiffM
217
18
0
11 Dec 2023
DiffCast: A Unified Framework via Residual Diffusion for Precipitation
  Nowcasting
DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting
Demin Yu
Xutao Li
Yunming Ye
Baoquan Zhang
Chuyao Luo
Kuai Dai
Rui Wang
Xunlai Chen
372
58
0
11 Dec 2023
Event-based Continuous Color Video Decompression from Single Frames
Event-based Continuous Color Video Decompression from Single Frames
ZiYun Wang
Friedhelm Hamann
Kenneth Chaney
Wen Jiang
Martin Braschler
Kostas Daniilidis
545
11
0
30 Nov 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
979
1,974
0
25 Nov 2023
FusionFrames: Efficient Architectural Aspects for Text-to-Video
  Generation Pipeline
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
V.Ya. Arkhipkin
Zein Shaheen
Viacheslav Vasilev
E. Dakhova
Andrey Kuznetsov
Denis Dimitrov
DiffMVGen
301
8
0
22 Nov 2023
Pair-wise Layer Attention with Spatial Masking for Video Prediction
Pair-wise Layer Attention with Spatial Masking for Video Prediction
Ping Li
Chenhan Zhang
Zheng Yang
Xianghua Xu
Mingli Song
212
0
0
19 Nov 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
DynamiCrafter: Animating Open-domain Images with Video Diffusion PriorsEuropean Conference on Computer Vision (ECCV), 2023
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
310
415
0
18 Oct 2023
Generative Image Dynamics
Generative Image DynamicsComputer Vision and Pattern Recognition (CVPR), 2023
Zhengqi Li
Richard Tucker
Noah Snavely
Aleksander Holynski
DiffM
358
93
0
14 Sep 2023
MMVP: Motion-Matrix-based Video Prediction
MMVP: Motion-Matrix-based Video PredictionIEEE International Conference on Computer Vision (ICCV), 2023
Yiqi Zhong
Luming Liang
Ilya Zharkov
Ulrich Neumann
235
30
0
30 Aug 2023
Structured World Models from Human Videos
Structured World Models from Human Videos
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
244
140
0
21 Aug 2023
SwinLSTM:Improving Spatiotemporal Prediction Accuracy using Swin
  Transformer and LSTM
SwinLSTM:Improving Spatiotemporal Prediction Accuracy using Swin Transformer and LSTMIEEE International Conference on Computer Vision (ICCV), 2023
Song Tang
Chuang Li
Pufen Zhang
R. Tang
AI4TS
205
98
0
19 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
SimDA: Simple Diffusion Adapter for Efficient Video GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Zhen Xing
Jingdong Sun
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGenDiffM
268
105
0
18 Aug 2023
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized
  Variational Autoencoder for Video Prediction
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video PredictionIEEE transactions on multimedia (IEEE TMM), 2023
Mohammad Adiban
Kalin Stefanov
Sabato Marco Siniscalchi
G. Salvi
249
3
0
13 Jul 2023
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human
  Motion Dataset for Autonomous Robots
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous RobotsIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Meenakshi Sarkar
V. Honkote
D. Das
D. Ghose
187
3
0
28 Jun 2023
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky
  Videos from Physics-constrained VideoGPT
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT
Yuhao Nie
E. Zelikman
Andea Scott
Quentin Paletta
A. Brandt
333
4
0
20 Jun 2023
Fast Fourier Inception Networks for Occluded Video Prediction
Fast Fourier Inception Networks for Occluded Video PredictionIEEE transactions on multimedia (IEEE TMM), 2023
Ping Li
Chenhan Zhang
Xianghua Xu
185
11
0
17 Jun 2023
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic
  Latent Particles
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles
Tal Daniel
Aviv Tamar
DiffM
255
13
0
09 Jun 2023
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object
  Video Generation
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023
A. Davtyan
Paolo Favaro
VGen
282
7
0
06 Jun 2023
123456
Next
Page 1 of 6