ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.6604
  4. Cited By
Video (language) modeling: a baseline for generative models of natural
  videos

Video (language) modeling: a baseline for generative models of natural videos

20 December 2014
MarcÁurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
R. Collobert
S. Chopra
    VGen
ArXivPDFHTML

Papers citing "Video (language) modeling: a baseline for generative models of natural videos"

50 / 113 papers shown
Title
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
42
23
0
24 Jun 2024
SFTformer: A Spatial-Frequency-Temporal Correlation-Decoupling
  Transformer for Radar Echo Extrapolation
SFTformer: A Spatial-Frequency-Temporal Correlation-Decoupling Transformer for Radar Echo Extrapolation
Liangyu Xu
Wanxuan Lu
Hongfeng Yu
Fanglong Yao
Xian Sun
Kun Fu
47
5
0
28 Feb 2024
USTEP: Spatio-Temporal Predictive Learning under A Unified View
USTEP: Spatio-Temporal Predictive Learning under A Unified View
Cheng Tan
Jue Wang
Zhangyang Gao
Siyuan Li
Stan Z. Li
38
1
0
09 Oct 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous
  Driving
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
24
149
0
18 Sep 2023
TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis
TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis
Zihan Zhang
Richard Liu
Kfir Aberman
Rana Hanocka
DiffM
37
26
0
27 Jul 2023
MagicVideo: Efficient Video Generation With Latent Diffusion Models
MagicVideo: Efficient Video Generation With Latent Diffusion Models
Daquan Zhou
Weimin Wang
Hanshu Yan
Weiwei Lv
Yizhe Zhu
Jiashi Feng
DiffM
VGen
39
373
0
20 Nov 2022
See, Plan, Predict: Language-guided Cognitive Planning with Video
  Prediction
See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction
Maria Attarian
Advaya Gupta
Ziyi Zhou
Wei Yu
Igor Gilitschenski
Animesh Garg
LM&Ro
29
7
0
07 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual
  Description
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
62
371
0
05 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
55
1,479
0
05 Oct 2022
Image Classification using Sequence of Pixels
Image Classification using Sequence of Pixels
Gajraj Kuldeep
21
0
0
23 Sep 2022
Intelligent 3D Network Protocol for Multimedia Data Classification using
  Deep Learning
Intelligent 3D Network Protocol for Multimedia Data Classification using Deep Learning
A. Syed
Eman A. Aldhahri
M. Iqbal
Abid Ali
Ammar Muthanna
Harun Jamil
F. Jamil
3DH
16
2
0
23 Jul 2022
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive
  Learning
Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning
Cheng Tan
Zhangyang Gao
Lirong Wu
Yongjie Xu
Jun Xia
Siyuan Li
Stan Z. Li
46
107
0
24 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
112
111
0
23 Jun 2022
SimVP: Simpler yet Better Video Prediction
SimVP: Simpler yet Better Video Prediction
Zhangyang Gao
Cheng Tan
Lirong Wu
Stan Z. Li
43
211
0
09 Jun 2022
Cascaded Video Generation for Videos In-the-Wild
Cascaded Video Generation for Videos In-the-Wild
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
34
0
0
01 Jun 2022
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond
STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
30
1
0
20 Apr 2022
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive
  Transformer
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Songwei Ge
Thomas Hayes
Harry Yang
Xiaoyue Yin
Guan Pang
David Jacobs
Jia-Bin Huang
Devi Parikh
ViT
56
215
0
07 Apr 2022
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution
  Video Prediction
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction
Zheng Chang
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
29
50
0
30 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
18
118
0
25 Mar 2022
Look for the Change: Learning Object States and State-Modifying Actions
  from Untrimmed Web Videos
Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Tomávs Souvcek
Jean-Baptiste Alayrac
Antoine Miech
Ivan Laptev
Josef Sivic
21
32
0
22 Mar 2022
StyleGAN-V: A Continuous Video Generator with the Price, Image Quality
  and Perks of StyleGAN2
StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
Ivan Skorokhodov
Sergey Tulyakov
Mohamed Elhoseiny
VGen
35
279
0
29 Dec 2021
Wide and Narrow: Video Prediction from Context and Motion
Wide and Narrow: Video Prediction from Context and Motion
Jaehoon Cho
Jiyoung Lee
Changjae Oh
Wonil Song
Kwanghoon Sohn
22
1
0
22 Oct 2021
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised
  Predictive Learning
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning
Zhiyu Yao
Yunbo Wang
Haixu Wu
Jianmin Wang
Mingsheng Long
AI4TS
29
8
0
08 Oct 2021
A Hierarchical Variational Neural Uncertainty Model for Stochastic Video
  Prediction
A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction
Moitreya Chatterjee
Narendra Ahuja
A. Cherian
UQCV
VGen
BDL
42
17
0
06 Oct 2021
Google Neural Network Models for Edge Devices: Analyzing and Mitigating
  Machine Learning Inference Bottlenecks
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
20
81
0
29 Sep 2021
A Framework for Multisensory Foresight for Embodied Agents
A Framework for Multisensory Foresight for Embodied Agents
Xiaohui Chen
Ramtin Hosseini
K. Panetta
Jivko Sinapov
24
3
0
15 Sep 2021
Hierarchical Video Generation for Complex Data
Hierarchical Video Generation for Complex Data
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
22
4
0
04 Jun 2021
FDNet: A Deep Learning Approach with Two Parallel Cross Encoding
  Pathways for Precipitation Nowcasting
FDNet: A Deep Learning Approach with Two Parallel Cross Encoding Pathways for Precipitation Nowcasting
Bi Yan
Chao Yang
F. Chen
Kohei Takeda
Changjun Wang
29
13
0
06 May 2021
DriveGAN: Towards a Controllable High-Quality Neural Simulation
DriveGAN: Towards a Controllable High-Quality Neural Simulation
S. Kim
Jonah Philion
Antonio Torralba
Sanja Fidler
29
109
0
30 Apr 2021
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive
  Learning
PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning
Yunbo Wang
Haixu Wu
Jianjin Zhang
Zhifeng Gao
Jianmin Wang
Philip S. Yu
Mingsheng Long
22
378
0
17 Mar 2021
Self-Supervision by Prediction for Object Discovery in Videos
Self-Supervision by Prediction for Object Discovery in Videos
Beril Besbinar
P. Frossard
SSL
26
7
0
09 Mar 2021
Predicting Video with VQVAE
Predicting Video with VQVAE
Jacob Walker
Ali Razavi
Aaron van den Oord
DRL
24
66
0
02 Mar 2021
Learning Temporal Dynamics from Cycles in Narrated Video
Learning Temporal Dynamics from Cycles in Narrated Video
Dave Epstein
Jiajun Wu
Cordelia Schmid
Chen Sun
AI4TS
35
14
0
07 Jan 2021
Learning the Predictability of the Future
Learning the Predictability of the Future
Dídac Surís
Ruoshi Liu
Carl Vondrick
24
71
0
01 Jan 2021
Mutual Information Based Method for Unsupervised Disentanglement of
  Video Representation
Mutual Information Based Method for Unsupervised Disentanglement of Video Representation
Aditya Sreekar
Ujjwal Tiwari
A. Namboodiri
DRL
26
4
0
17 Nov 2020
Enriching Video Captions With Contextual Text
Enriching Video Captions With Contextual Text
Philipp Rimle
Pelin Dogan
Markus Gross
30
3
0
29 Jul 2020
Latent Video Transformer
Latent Video Transformer
Ruslan Rakhimov
Denis Volkhonskiy
Alexey Artemov
Denis Zorin
Evgeny Burnaev
VGen
33
118
0
18 Jun 2020
Going in circles is the way forward: the role of recurrence in visual
  inference
Going in circles is the way forward: the role of recurrence in visual inference
R. S. V. Bergen
N. Kriegeskorte
17
82
0
26 Mar 2020
Photo-Realistic Video Prediction on Natural Videos of Largely Changing
  Frames
Photo-Realistic Video Prediction on Natural Videos of Largely Changing Frames
O. Shouno
GAN
38
21
0
19 Mar 2020
Stochastic Latent Residual Video Prediction
Stochastic Latent Residual Video Prediction
Jean-Yves Franceschi
E. Delasalles
Mickaël Chen
Sylvain Lamprier
Patrick Gallinari
VGen
26
159
0
21 Feb 2020
Learning Predictive Models From Observation and Interaction
Learning Predictive Models From Observation and Interaction
Karl Schmeckpeper
Annie Xie
Oleh Rybkin
Stephen Tian
Kostas Daniilidis
Sergey Levine
Chelsea Finn
DRL
33
60
0
30 Dec 2019
Action Anticipation with RBF Kernelized Feature Mapping RNN
Action Anticipation with RBF Kernelized Feature Mapping RNN
Yuge Shi
Basura Fernando
Richard I. Hartley
31
82
0
18 Nov 2019
High Fidelity Video Prediction with Large Stochastic Recurrent Neural
  Networks
High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks
Ruben Villegas
Arkanath Pathak
Harini Kannan
D. Erhan
Quoc V. Le
Honglak Lee
VGen
22
136
0
05 Nov 2019
Markov Decision Process for Video Generation
Markov Decision Process for Video Generation
V. Yushchenko
Nikita Araslanov
Stefan Roth
VGen
23
20
0
26 Sep 2019
Adversarial Video Generation on Complex Datasets
Adversarial Video Generation on Complex Datasets
Aidan Clark
Jeff Donahue
Karen Simonyan
VGen
GAN
27
74
0
15 Jul 2019
Unsupervised Learning of Object Structure and Dynamics from Videos
Unsupervised Learning of Object Structure and Dynamics from Videos
Matthias Minderer
Chen Sun
Ruben Villegas
Forrester Cole
Kevin Patrick Murphy
Honglak Lee
27
150
0
19 Jun 2019
Improved Conditional VRNNs for Video Prediction
Improved Conditional VRNNs for Video Prediction
Lluis Castrejon
Nicolas Ballas
Aaron Courville
VGen
DRL
21
161
0
27 Apr 2019
Keyframing the Future: Keyframe Discovery for Visual Prediction and
  Planning
Keyframing the Future: Keyframe Discovery for Visual Prediction and Planning
Karl Pertsch
Oleh Rybkin
Jingyun Yang
Shenghao Zhou
Konstantinos G. Derpanis
Kostas Daniilidis
Joseph J. Lim
Andrew Jaegle
VGen
45
24
0
11 Apr 2019
Point-to-Point Video Generation
Point-to-Point Video Generation
Tsun-Hsuan Wang
Y. Cheng
Chieh Hubert Lin
Hwann-Tzong Chen
Min Sun
VGen
DiffM
16
21
0
05 Apr 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video
  Generation
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation
Manoj Kumar
Mohammad Babaeizadeh
D. Erhan
Chelsea Finn
Sergey Levine
Laurent Dinh
Durk Kingma
VGen
25
131
0
04 Mar 2019
123
Next