ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1507.08750
  4. Cited By
Action-Conditional Video Prediction using Deep Networks in Atari Games

Action-Conditional Video Prediction using Deep Networks in Atari Games

31 July 2015
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
ArXivPDFHTML

Papers citing "Action-Conditional Video Prediction using Deep Networks in Atari Games"

50 / 221 papers shown
Title
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
Jun Guo
Xiaojian Ma
Yikai Wang
Min Yang
Huaping Liu
Qing Li
VGen
34
0
0
15 May 2025
Strengthening Generative Robot Policies through Predictive World Modeling
Strengthening Generative Robot Policies through Predictive World Modeling
Han Qi
Haocheng Yin
Yilun Du
Yilun Du
Heng Yang
66
2
0
02 Feb 2025
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Siddharth Aravindan
Dixant Mittal
Wee Sun Lee
BDL
79
0
0
17 Jan 2025
FACTS: A Factored State-Space Framework For World Modelling
FACTS: A Factored State-Space Framework For World Modelling
Li Nanbo
Firas Laakom
Yucheng Xu
Wenyi Wang
Jürgen Schmidhuber
AI4TS
226
0
0
28 Oct 2024
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation
Hyungjoo Chae
Namyoung Kim
Kai Tzu-iunn Ong
Minju Gwak
Gwanwoo Song
Jihoon Kim
S. Kim
Dongha Lee
Jinyoung Yeo
LLMAG
33
15
0
17 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
Synthesizing Evolving Symbolic Representations for Autonomous Systems
Synthesizing Evolving Symbolic Representations for Autonomous Systems
Gabriele Sartor
A. Oddi
R. Rasconi
V. Santucci
Rosa Meo
26
0
0
18 Sep 2024
MuirBench: A Comprehensive Benchmark for Robust Multi-image
  Understanding
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Fei Wang
Xingyu Fu
James Y. Huang
Zekun Li
Qin Liu
...
Kai-Wei Chang
Dan Roth
Sheng Zhang
Hoifung Poon
Muhao Chen
VLM
50
47
0
13 Jun 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
71
75
0
27 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Haifeng Zhang
Mingsheng Long
VGen
49
26
0
24 May 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey
  and Unifying Perspective
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
50
6
0
09 Apr 2024
Action-conditioned video data improves predictability
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
51
0
0
08 Apr 2024
Automatic Music Playlist Generation via Simulation-based Reinforcement
  Learning
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Federico Tomasi
Joseph Cauteruccio
Surya Kanoria
K. Ciosek
Matteo Rinaldi
Zhenwen Dai
OffRL
30
5
0
13 Oct 2023
Long-Term Prediction of Natural Video Sequences with Robust Video
  Predictors
Long-Term Prediction of Natural Video Sequences with Robust Video Predictors
Luke Ditria
Tom Drummond
51
0
0
21 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
35
28
0
14 Aug 2023
Context-Conditional Navigation with a Learning-Based Terrain- and
  Robot-Aware Dynamics Model
Context-Conditional Navigation with a Learning-Based Terrain- and Robot-Aware Dynamics Model
Suresh Guttikonda
Jan Achterhold
Haolong Li
Joschka Boedecker
Joerg Stueckler
29
2
0
18 Jul 2023
Pre-training Contextualized World Models with In-the-wild Videos for
  Reinforcement Learning
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
36
25
0
29 May 2023
State Representation Learning Using an Unbalanced Atlas
State Representation Learning Using an Unbalanced Atlas
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
37
2
0
17 May 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Model-Based Reinforcement Learning with Isolated Imaginations
Minting Pan
Xiangming Zhu
Yitao Zheng
Yunbo Wang
Xiaokang Yang
34
0
0
27 Mar 2023
Learning from Predictions: Fusing Training and Autoregressive Inference
  for Long-Term Spatiotemporal Forecasts
Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal Forecasts
Pantelis R. Vlachas
Petros Koumoutsakos
AI4TS
AI4CE
23
7
0
22 Feb 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement
  Learning
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
29
8
0
26 Jan 2023
Long-horizon video prediction using a dynamic latent hierarchy
Long-horizon video prediction using a dynamic latent hierarchy
Alexey Zakharov
Qinghai Guo
Z. Fountas
36
4
0
29 Dec 2022
Representation Learning for Continuous Action Spaces is Beneficial for
  Efficient Policy Learning
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Joint Embedding Predictive Architectures Focus on Slow Features
Joint Embedding Predictive Architectures Focus on Slow Features
Vlad Sobal
V. JyothirS
Siddhartha Jalagam
Nicolas Carion
Kyunghyun Cho
Yann LeCun
24
8
0
20 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Reward-Predictive Clustering
Reward-Predictive Clustering
Lucas Lehnert
M. Frank
Michael L. Littman
OffRL
25
0
0
07 Nov 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
See, Plan, Predict: Language-guided Cognitive Planning with Video
  Prediction
See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction
Maria Attarian
Advaya Gupta
Ziyi Zhou
Wei Yu
Igor Gilitschenski
Animesh Garg
LM&Ro
29
7
0
07 Oct 2022
Rewarding Episodic Visitation Discrepancy for Exploration in
  Reinforcement Learning
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
36
12
0
19 Sep 2022
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image
  Generator
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Younggyo Seo
Kimin Lee
Fangchen Liu
Stephen James
Pieter Abbeel
VGen
29
28
0
15 Sep 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
68
0
16 Jun 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in
  World Models
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Xiangming Zhu
Yunbo Wang
Xiaokang Yang
29
39
0
27 May 2022
Brainish: Formalizing A Multimodal Language for Intelligence and
  Consciousness
Brainish: Formalizing A Multimodal Language for Intelligence and Consciousness
Paul Pu Liang
30
4
0
14 Apr 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
18
119
0
25 Mar 2022
Playable Environments: Video Manipulation in Space and Time
Playable Environments: Video Manipulation in Space and Time
Willi Menapace
Stéphane Lathuilière
Aliaksandr Siarohin
Christian Theobalt
Sergey Tulyakov
Vladislav Golyanik
Elisa Ricci
VGen
34
22
0
03 Mar 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal
  Point Processes
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Xiaoyu Tan
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
30
17
0
29 Jan 2022
InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition
InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition
A. Glavan
Estefanía Talavera
21
10
0
23 Dec 2021
Learning to track environment state via predictive autoencoding
Learning to track environment state via predictive autoencoding
Marian Andrecki
N. K. Taylor
8
0
0
14 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Noether Networks: Meta-Learning Useful Conserved Quantities
Noether Networks: Meta-Learning Useful Conserved Quantities
Ferran Alet
Dylan D. Doblar
Allan Zhou
J. Tenenbaum
Kenji Kawaguchi
Chelsea Finn
75
27
0
06 Dec 2021
Self-Consistent Models and Values
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
OPEn: An Open-ended Physics Environment for Learning Without a Task
OPEn: An Open-ended Physics Environment for Learning Without a Task
Chuang Gan
Abhishek Bhandwaldar
Antonio Torralba
J. Tenenbaum
Phillip Isola
LRM
135
4
0
13 Oct 2021
Planning from Pixels in Environments with Combinatorially Hard Search
  Spaces
Planning from Pixels in Environments with Combinatorially Hard Search Spaces
Marco Bagatella
Miroslav Olsák
Michal Rolínek
Georg Martius
OffRL
21
6
0
12 Oct 2021
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised
  Predictive Learning
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning
Zhiyu Yao
Yunbo Wang
Haixu Wu
Jianmin Wang
Mingsheng Long
AI4TS
29
8
0
08 Oct 2021
Goal-Directed Design Agents: Integrating Visual Imitation with One-Step
  Lookahead Optimization for Generative Design
Goal-Directed Design Agents: Integrating Visual Imitation with One-Step Lookahead Optimization for Generative Design
Ayush Raina
Lucas Puentes
Jonathan Cagan
Christopher McComb
AI4CE
26
6
0
07 Oct 2021
A Framework for Multisensory Foresight for Embodied Agents
A Framework for Multisensory Foresight for Embodied Agents
Xiaohui Chen
Ramtin Hosseini
K. Panetta
Jivko Sinapov
26
3
0
15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual
  Control Architecture Without Training
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
19
0
0
04 Aug 2021
Human-Level Reinforcement Learning through Theory-Based Modeling,
  Exploration, and Planning
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
LM&Ro
24
43
0
27 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy
  Correction
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
35
9
0
04 Jul 2021
12345
Next