ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.12165
  4. Cited By
Improved Conditional VRNNs for Video Prediction

Improved Conditional VRNNs for Video Prediction

IEEE International Conference on Computer Vision (ICCV), 2019
27 April 2019
Lluis Castrejon
Nicolas Ballas
Aaron Courville
    VGenDRL
ArXiv (abs)PDFHTML

Papers citing "Improved Conditional VRNNs for Video Prediction"

50 / 114 papers shown
Title
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
Jiatao Gu
Ying Shen
Tianrong Chen
Laurent Dinh
Y. Wang
Miguel Angel Bautista
David Berthelot
Josh Susskind
Shuangfei Zhai
DiffMVGen
282
3
0
25 Nov 2025
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sequence-Adaptive Video Prediction in Continuous Streams using Diffusion Noise Optimization
Sina Mokhtarzadeh Azar
Emad Bahrami
Enrico Pallotta
Gianpiero Francesca
Radu Timofte
Juergen Gall
DiffM
108
0
0
23 Nov 2025
Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal Transformers
Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal TransformersIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025
Dean L. Slack
G. Hudson
T. Winterbottom
Noura Al Moubayed
130
0
0
23 Oct 2025
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
382
6
0
10 Apr 2025
Unified Arbitrary-Time Video Frame Interpolation and PredictionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Xin Jin
Longhai Wu
Jie Chen
Ilhyun Cho
Cheul-hee Hahm
251
1
0
04 Mar 2025
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers
Advancing Semantic Future Prediction through Multimodal Visual Sequence TransformersComputer Vision and Pattern Recognition (CVPR), 2025
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
347
3
0
14 Jan 2025
DINO-Foresight: Looking into the Future with DINO
DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
592
14
0
16 Dec 2024
Efficient Continuous Video Flow Model for Video Prediction
Efficient Continuous Video Flow Model for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGen
205
0
0
07 Dec 2024
Continuous Video Process: Modeling Videos as Continuous
  Multi-Dimensional Processes for Video Prediction
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction
Gaurav Shrivastava
Abhinav Shrivastava
VGenDiffM
256
0
0
06 Dec 2024
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided
  Mixture-of-Experts
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-ExpertsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Xiang Deng
Youxin Pang
Xiaochen Zhao
Chao Xu
Lizhen Wang
Hongjiang Xiao
Shi Yan
Hongwen Zhang
Yebin Liu
DiffMVGen
219
3
0
31 Oct 2024
Motion Graph Unleashed: A Novel Approach to Video Prediction
Motion Graph Unleashed: A Novel Approach to Video PredictionNeural Information Processing Systems (NeurIPS), 2024
Yiqi Zhong
Luming Liang
Bohan Tang
Ilya Zharkov
Ulrich Neumann
297
4
0
29 Oct 2024
Masked Autoregressive Model for Weather Forecasting
Masked Autoregressive Model for Weather Forecasting
Doyi Kim
Minseok Seo
Hakjin Lee
Junghoon Seo
206
1
0
30 Sep 2024
Blox-Net: Generative Design-for-Robot-Assembly Using VLM Supervision,
  Physics Simulation, and a Robot with Reset
Blox-Net: Generative Design-for-Robot-Assembly Using VLM Supervision, Physics Simulation, and a Robot with ResetIEEE International Conference on Robotics and Automation (ICRA), 2024
Andrew Goldberg
Kavish Kondap
Tianshuang Qiu
Zehan Ma
Letian Fu
Justin Kerr
Huang Huang
Kaiyuan Chen
Kuan Fang
Ken Goldberg
189
13
0
25 Sep 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion
  Consistency in Videos
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Fangyin Wei
Lele Wang
Renjie Liao
VGenEGVM
305
28
0
23 Jul 2024
The Power of Next-Frame Prediction for Learning Physical Laws
The Power of Next-Frame Prediction for Learning Physical Laws
T. Winterbottom
G. Hudson
Daniel Kluvanec
Dean L. Slack
Jamie Sterling
Junjie Shentu
Chenghao Xiao
Zheming Zhou
Noura Al Moubayed
199
3
0
21 May 2024
On the Content Bias in Fréchet Video Distance
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
232
32
0
18 Apr 2024
Action-conditioned video data improves predictability
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
295
0
0
08 Apr 2024
A Survey on Generative AI and LLM for Video Generation, Understanding,
  and Streaming
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
Pengyuan Zhou
Lin Wang
Zhi Liu
Yanbin Hao
Pan Hui
Sasu Tarkoma
J. Kangasharju
VGen
244
46
0
30 Jan 2024
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
933
1,923
0
25 Nov 2023
Breathing Life Into Sketches Using Text-to-Video Priors
Breathing Life Into Sketches Using Text-to-Video PriorsComputer Vision and Pattern Recognition (CVPR), 2023
Rinon Gal
Yael Vinker
Yuval Alaluf
Amit H. Bermano
Daniel Cohen-Or
Ariel Shamir
Gal Chechik
VGenDiffM
178
47
0
21 Nov 2023
Triplet Attention Transformer for Spatiotemporal Predictive Learning
Triplet Attention Transformer for Spatiotemporal Predictive LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Xuesong Nie
Xi Chen
Haoyuan Jin
Zhihang Zhu
Yunfeng Yan
Donglian Qi
ViT
156
15
0
28 Oct 2023
HyperSINDy: Deep Generative Modeling of Nonlinear Stochastic Governing
  Equations
HyperSINDy: Deep Generative Modeling of Nonlinear Stochastic Governing Equations
Mozes Jacobs
Bingni W. Brunton
Steven L. Brunton
J. Nathan Kutz
Ryan V. Raut
170
14
0
07 Oct 2023
LLM-grounded Video Diffusion Models
LLM-grounded Video Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023
Long Lian
Baifeng Shi
Semih Yavuz
Ye Liu
Boyi Li
DiffM
218
74
0
29 Sep 2023
Automatic Animation of Hair Blowing in Still Portrait Photos
Automatic Animation of Hair Blowing in Still Portrait PhotosIEEE International Conference on Computer Vision (ICCV), 2023
Wenpeng Xiao
Wentao Liu
Yitong Wang
Guohao Li
Bing Li
3DH
216
14
0
25 Sep 2023
Generating and Imputing Tabular Data via Diffusion and Flow-based
  Gradient-Boosted Trees
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted TreesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Alexia Jolicoeur-Martineau
Kilian Fatras
Tal Kachman
302
56
0
18 Sep 2023
SimDA: Simple Diffusion Adapter for Efficient Video Generation
SimDA: Simple Diffusion Adapter for Efficient Video GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Zhen Xing
Jingdong Sun
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGenDiffM
222
105
0
18 Aug 2023
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized
  Variational Autoencoder for Video Prediction
S-HR-VQVAE: Sequential Hierarchical Residual Learning Vector Quantized Variational Autoencoder for Video PredictionIEEE transactions on multimedia (IEEE TMM), 2023
Mohammad Adiban
Kalin Stefanov
Sabato Marco Siniscalchi
G. Salvi
237
3
0
13 Jul 2023
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human
  Motion Dataset for Autonomous Robots
Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous RobotsIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Meenakshi Sarkar
V. Honkote
D. Das
D. Ghose
172
3
0
28 Jun 2023
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive
  Learning
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive LearningNeural Information Processing Systems (NeurIPS), 2023
Cheng Tan
Siyuan Li
Zhangyang Gao
Wen-Cai Guan
Zedong Wang
Zicheng Liu
Lirong Wu
Stan Z. Li
AI4TS
245
90
0
20 Jun 2023
Fast Fourier Inception Networks for Occluded Video Prediction
Fast Fourier Inception Networks for Occluded Video PredictionIEEE transactions on multimedia (IEEE TMM), 2023
Ping Li
Chenhan Zhang
Xianghua Xu
180
10
0
17 Jun 2023
Video Diffusion Models with Local-Global Context Guidance
Video Diffusion Models with Local-Global Context GuidanceInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Si-hang Yang
Lu Zhang
Yu Liu
Zhizhuo Jiang
You He
VGenDiffM
118
18
0
05 Jun 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
VDT: General-purpose Video Diffusion Transformers via Mask ModelingInternational Conference on Learning Representations (ICLR), 2023
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffMVGen
214
97
0
22 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent
  Diffusion Models
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGSVGen
610
1,410
0
18 Apr 2023
MS-LSTM: Exploring Spatiotemporal Multiscale Representations in Video
  Prediction Domain
MS-LSTM: Exploring Spatiotemporal Multiscale Representations in Video Prediction DomainApplied Soft Computing (Appl. Soft Comput.), 2023
Zhifeng Ma
Hao Zhang
Jie Liu
396
10
0
16 Apr 2023
Explicitly Minimizing the Blur Error of Variational Autoencoders
Explicitly Minimizing the Blur Error of Variational AutoencodersInternational Conference on Learning Representations (ICLR), 2023
G. Bredell
Kyriakos Flouris
K. Chaitanya
Ertunc Erdil
E. Konukoglu
153
35
0
12 Apr 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Model-Based Reinforcement Learning with Isolated ImaginationsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Minting Pan
Geng Chen
Yitao Zheng
Yunbo Wang
Xiaokang Yang
321
2
0
27 Mar 2023
Towards End-to-End Generative Modeling of Long Videos with
  Memory-Efficient Bidirectional Transformers
Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional TransformersComputer Vision and Pattern Recognition (CVPR), 2023
Jaehoon Yoo
Semin Kim
Doyup Lee
Chiheon Kim
Seunghoon Hong
201
6
0
20 Mar 2023
A Dynamic Multi-Scale Voxel Flow Network for Video Prediction
A Dynamic Multi-Scale Voxel Flow Network for Video PredictionComputer Vision and Pattern Recognition (CVPR), 2023
Xiaotao Hu
Zhewei Huang
Ailin Huang
Jun Xu
Shuchang Zhou
VGen
214
86
0
17 Mar 2023
TKN: Transformer-based Keypoint Prediction Network For Real-time Video
  Prediction
TKN: Transformer-based Keypoint Prediction Network For Real-time Video Prediction
Haoran Li
Pengyuan Zhou
Yi-Wen Lin
Y. Hao
Haiyong Xie
Yong Liao
ViTAI4TS
263
1
0
17 Mar 2023
Implicit Stacked Autoregressive Model for Video Prediction
Implicit Stacked Autoregressive Model for Video Prediction
Min-seok Seo
Hakjin Lee
Do-Yeon Kim
Junghoon Seo
VGen
143
20
0
14 Mar 2023
Continual Visual Reinforcement Learning with A Life-Long World Model
Continual Visual Reinforcement Learning with A Life-Long World Model
Wendong Zhang
Wendong Zhang
Geng Chen
Siyu Gao
Yunbo Wang
Xiaokang Yang
Yunbo Wang
CLL
313
3
0
12 Mar 2023
Distributional Learning of Variational AutoEncoder: Application to
  Synthetic Data Generation
Distributional Learning of Variational AutoEncoder: Application to Synthetic Data GenerationNeural Information Processing Systems (NeurIPS), 2023
SeungHwan An
Jong-June Jeon
DRL
473
12
0
22 Feb 2023
Learning from Predictions: Fusing Training and Autoregressive Inference
  for Long-Term Spatiotemporal Forecasts
Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal ForecastsSocial Science Research Network (SSRN), 2023
Pantelis R. Vlachas
Petros Koumoutsakos
AI4TSAI4CE
264
11
0
22 Feb 2023
Anti-aliasing Predictive Coding Network for Future Video Frame
  Prediction
Anti-aliasing Predictive Coding Network for Future Video Frame Prediction
Chaofan Ling
Wei-Hong Li
Junpei Zhong
227
0
0
13 Jan 2023
Long-horizon video prediction using a dynamic latent hierarchy
Long-horizon video prediction using a dynamic latent hierarchy
Alexey Zakharov
Qinghai Guo
Zafeirios Fountas
161
5
0
29 Dec 2022
Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for
  Video Prediction
Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for Video Prediction
Chaofan Ling
Junpei Zhong
Wei-Hong Li
278
3
0
22 Dec 2022
Video Prediction by Efficient Transformers
Video Prediction by Efficient TransformersImage and Vision Computing (IVC), 2022
Xi Ye
Guillaume-Alexandre Bilodeau
ViT
250
45
0
12 Dec 2022
Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve
Multi-Rate VAE: Train Once, Get the Full Rate-Distortion CurveInternational Conference on Learning Representations (ICLR), 2022
Juhan Bae
Michael Ruogu Zhang
Michael Ruan
Eric Wang
S. Hasegawa
Jimmy Ba
Roger C. Grosse
DRL
231
23
0
07 Dec 2022
Efficient Video Prediction via Sparsely Conditioned Flow Matching
Efficient Video Prediction via Sparsely Conditioned Flow MatchingIEEE International Conference on Computer Vision (ICCV), 2022
A. Davtyan
Sepehr Sameni
Paolo Favaro
VGenDiffM
250
41
0
26 Nov 2022
WALDO: Future Video Synthesis using Object Layer Decomposition and
  Parametric Flow Prediction
WALDO: Future Video Synthesis using Object Layer Decomposition and Parametric Flow PredictionIEEE International Conference on Computer Vision (ICCV), 2022
G. L. Moing
Jean Ponce
Cordelia Schmid
345
7
0
25 Nov 2022
123
Next