ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.01523
  4. Cited By
Stochastic Adversarial Video Prediction

Stochastic Adversarial Video Prediction

4 April 2018
Alex X. Lee
Richard Y. Zhang
F. Ebert
Pieter Abbeel
Chelsea Finn
Sergey Levine
    DRLVGenGAN
ArXiv (abs)PDFHTML

Papers citing "Stochastic Adversarial Video Prediction"

50 / 278 papers shown
Title
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sina Mokhtarzadeh Azar
Emad Bahrami
Enrico Pallotta
Gianpiero Francesca
Radu Timofte
Juergen Gall
DiffM
88
0
0
23 Nov 2025
Ego-centric Predictive Model Conditioned on Hand Trajectories
Ego-centric Predictive Model Conditioned on Hand Trajectories
Binjie Zhang
Mike Zheng Shou
EgoV
254
0
0
27 Aug 2025
Video Generators are Robot Policies
Video Generators are Robot Policies
Junbang Liang
P. Tokmakov
Ruoshi Liu
Sruthi Sudhakar
Paarth Shah
Rares Andrei Ambrus
Carl Vondrick
VGen
215
10
0
01 Aug 2025
Video World Models with Long-term Spatial Memory
Tong Wu
Shuai Yang
Ryan Po
Yinghao Xu
Ziwei Liu
Dahua Lin
Gordon Wetzstein
VGenKELMVLM
288
29
0
05 Jun 2025
Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models
Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion ModelsInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Tingxiu Chen
Yilei Shi
Zixuan Zheng
Bingcong Yan
Jingliang Hu
Xiao Xiang Zhu
Lichao Mou
VGenMedIm
265
13
0
19 Mar 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffMVGen
597
5
0
18 Feb 2025
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
MAUCell: An Adaptive Multi-Attention Framework for Video Frame Prediction
Shreyam Gupta
P. Agrawal
Priyam Gupta
246
1
0
28 Jan 2025
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
Advancing Semantic Future Prediction through Multimodal Visual Sequence TransformersComputer Vision and Pattern Recognition (CVPR), 2025
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
299
5
0
14 Jan 2025
Can Generative Video Models Help Pose Estimation?
Can Generative Video Models Help Pose Estimation?Computer Vision and Pattern Recognition (CVPR), 2024
Ruojin Cai
Jason Y. Zhang
Philipp Henzler
Zhengqi Li
Noah Snavely
Ricardo Martín Brualla
VGen
162
6
0
20 Dec 2024
DINO-Foresight: Looking into the Future with DINO
DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
516
14
0
16 Dec 2024
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
From Slow Bidirectional to Fast Autoregressive Video Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2024
Tianwei Yin
Qiang Zhang
Richard Zhang
William T. Freeman
F. Durand
Eli Shechtman
Xun Huang
VGenDiffM
452
11
0
10 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
197
0
0
07 Dec 2024
Continuous Video Process: Modeling Videos as Continuous
  Multi-Dimensional Processes for Video Prediction
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGenDiffM
240
0
0
06 Dec 2024
Lightweight Stochastic Video Prediction via Hybrid Warping
Lightweight Stochastic Video Prediction via Hybrid WarpingVisual Communications and Image Processing (VCIP), 2024
Kazuki Kotoyori
Shota Hirose
Heming Sun
J. Katto
224
0
0
04 Dec 2024
Motion Graph Unleashed: A Novel Approach to Video Prediction
Motion Graph Unleashed: A Novel Approach to Video PredictionNeural Information Processing Systems (NeurIPS), 2024
Yiqi Zhong
Luming Liang
Bohan Tang
Ilya Zharkov
Ulrich Neumann
265
3
0
29 Oct 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion
  Consistency in Videos
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGenEGVM
277
28
0
23 Jul 2024
Learning Granular Media Avalanche Behavior for Indirectly Manipulating
  Obstacles on a Granular Slope
Learning Granular Media Avalanche Behavior for Indirectly Manipulating Obstacles on a Granular Slope
Haodi Hu
Feifei Qian
Daniel Seita
214
3
0
02 Jul 2024
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
213
54
0
24 Jun 2024
Video Generation with Learned Action Prior
Video Generation with Learned Action Prior
Meenakshi Sarkar
Devansh Bhardwaj
Debasish Ghose
VGenGAN
275
0
0
20 Jun 2024
Tell Me What's Next: Textual Foresight for Generic UI Representations
Tell Me What's Next: Textual Foresight for Generic UI Representations
Andrea Burns
Kate Saenko
Bryan A. Plummer
LM&RoAI4TS
245
7
0
12 Jun 2024
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video
  Prediction
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
Zhen Xing
Jingdong Sun
Zejia Weng
Zuxuan Wu
Yu-Gang Jiang
VGen
248
21
0
10 Jun 2024
Timely Communications for Remote Inference
Timely Communications for Remote Inference
Md Kamran Chowdhury Shisher
Yin Sun
I-Hong Hou
206
22
0
25 Apr 2024
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Yiran Xu
Taesung Park
Richard Zhang
Yang Zhou
Eli Shechtman
Feng Liu
Jia-Bin Huang
Difan Liu
SupR
304
25
0
18 Apr 2024
State-space Decomposition Model for Video Prediction Considering
  Long-term Motion Trend
State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend
Fei Cui
Jiaojiao Fang
Xiaojiang Wu
Zelong Lai
Mengke Yang
Menghan Jia
Guizhong Liu
105
0
0
17 Apr 2024
Action-conditioned video data improves predictability
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
259
0
0
08 Apr 2024
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate
  Spatiotemporal Forecasting
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting
Yujin Tang
Peijie Dong
Zhenheng Tang
Xiaowen Chu
Junwei Liang
Mamba
288
45
0
25 Mar 2024
Efficient Video Diffusion Models via Content-Frame Motion-Latent
  Decomposition
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Sihyun Yu
Weili Nie
De-An Huang
Boyi Li
Jinwoo Shin
A. Anandkumar
VGenDiffM
226
24
0
21 Mar 2024
Probabilistic Forecasting with Stochastic Interpolants and Föllmer
  Processes
Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes
Yifan Chen
Mark Goldstein
Mengjian Hua
M. S. Albergo
Nicholas M. Boffi
Eric Vanden-Eijnden
AI4TS
312
35
0
20 Mar 2024
Towards Scene Graph Anticipation
Towards Scene Graph AnticipationEuropean Conference on Computer Vision (ECCV), 2024
Rohith Peddi
Saksham Singh
Saurabh
Parag Singla
Vibhav Gogate
304
7
0
07 Mar 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
  Synthesis
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace
Aliaksandr Siarohin
Ivan Skorokhodov
Ekaterina Deyneka
Tsai-Shien Chen
...
Yuwei Fang
A. Stoliar
Elisa Ricci
Jian Ren
Sergey Tulyakov
VGen
271
94
0
22 Feb 2024
Sign Language Production with Latent Motion Transformer
Sign Language Production with Latent Motion Transformer
Pan Xie
Taiying Peng
Yao Du
Qipeng Zhang
SLR
159
10
0
20 Dec 2023
STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video
  Prediction
STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction
Xi Ye
Guillaume-Alexandre Bilodeau
VGenDiffM
192
17
0
11 Dec 2023
DiffCast: A Unified Framework via Residual Diffusion for Precipitation
  Nowcasting
DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting
Demin Yu
Xutao Li
Yunming Ye
Baoquan Zhang
Chuyao Luo
Kuai Dai
Rui Wang
Xunlai Chen
308
50
0
11 Dec 2023
Event-based Continuous Color Video Decompression from Single Frames
Event-based Continuous Color Video Decompression from Single Frames
ZiYun Wang
Friedhelm Hamann
Kenneth Chaney
Wen Jiang
Martin Braschler
Kostas Daniilidis
463
10
0
30 Nov 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
797
1,869
0
25 Nov 2023
FusionFrames: Efficient Architectural Aspects for Text-to-Video
  Generation Pipeline
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
V.Ya. Arkhipkin
Zein Shaheen
Viacheslav Vasilev
E. Dakhova
Andrey Kuznetsov
Denis Dimitrov
DiffMVGen
263
7
0
22 Nov 2023
Pair-wise Layer Attention with Spatial Masking for Video Prediction
Pair-wise Layer Attention with Spatial Masking for Video Prediction
Ping Li
Chenhan Zhang
Zheng Yang
Xianghua Xu
Mingli Song
192
0
0
19 Nov 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
DynamiCrafter: Animating Open-domain Images with Video Diffusion PriorsEuropean Conference on Computer Vision (ECCV), 2023
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
252
392
0
18 Oct 2023
Generative Image Dynamics
Generative Image DynamicsComputer Vision and Pattern Recognition (CVPR), 2023
Zhengqi Li
Richard Tucker
Noah Snavely
Aleksander Holynski
DiffM
282
88
0
14 Sep 2023
MMVP: Motion-Matrix-based Video Prediction
MMVP: Motion-Matrix-based Video PredictionIEEE International Conference on Computer Vision (ICCV), 2023
Yiqi Zhong
Luming Liang
Ilya Zharkov
Ulrich Neumann
157
30
0
30 Aug 2023
Structured World Models from Human Videos
Structured World Models from Human Videos
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
204
136
0
21 Aug 2023
SwinLSTM:Improving Spatiotemporal Prediction Accuracy using Swin
  Transformer and LSTM
SwinLSTM:Improving Spatiotemporal Prediction Accuracy using Swin Transformer and LSTMIEEE International Conference on Computer Vision (ICCV), 2023
Song Tang
Chuang Li
Pufen Zhang
R. Tang
AI4TS
152
93
0
19 Aug 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
SimDA: Simple Diffusion Adapter for Efficient Video GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Zhen Xing
Jingdong Sun
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGenDiffM
182
103
0
18 Aug 2023
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized
  Variational Autoencoder for Video Prediction
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video PredictionIEEE transactions on multimedia (IEEE TMM), 2023
Mohammad Adiban
Kalin Stefanov
Sabato Marco Siniscalchi
G. Salvi
213
3
0
13 Jul 2023
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human
  Motion Dataset for Autonomous Robots
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous RobotsIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Meenakshi Sarkar
V. Honkote
D. Das
D. Ghose
152
3
0
28 Jun 2023
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky
  Videos from Physics-constrained VideoGPT
SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT
Yuhao Nie
E. Zelikman
Andea Scott
Quentin Paletta
A. Brandt
293
4
0
20 Jun 2023
Fast Fourier Inception Networks for Occluded Video Prediction
Fast Fourier Inception Networks for Occluded Video PredictionIEEE transactions on multimedia (IEEE TMM), 2023
Ping Li
Chenhan Zhang
Xianghua Xu
152
10
0
17 Jun 2023
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic
  Latent Particles
DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles
Tal Daniel
Aviv Tamar
DiffM
210
12
0
09 Jun 2023
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object
  Video Generation
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video GenerationAAAI Conference on Artificial Intelligence (AAAI), 2023
A. Davtyan
Paolo Favaro
VGen
235
7
0
06 Jun 2023
Video Diffusion Models with Local-Global Context Guidance
Video Diffusion Models with Local-Global Context GuidanceInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Si-hang Yang
Lu Zhang
Yu Liu
Zhizhuo Jiang
You He
VGenDiffM
118
18
0
05 Jun 2023
123456
Next