Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1507.08750
Cited By
Action-Conditional Video Prediction using Deep Networks in Atari Games
31 July 2015
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Action-Conditional Video Prediction using Deep Networks in Atari Games"
50 / 221 papers shown
Title
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
Jun Guo
Xiaojian Ma
Yikai Wang
Min Yang
Huaping Liu
Qing Li
VGen
34
0
0
15 May 2025
Strengthening Generative Robot Policies through Predictive World Modeling
Han Qi
Haocheng Yin
Yilun Du
Yilun Du
Heng Yang
66
2
0
02 Feb 2025
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Siddharth Aravindan
Dixant Mittal
Wee Sun Lee
BDL
79
0
0
17 Jan 2025
FACTS: A Factored State-Space Framework For World Modelling
Li Nanbo
Firas Laakom
Yucheng Xu
Wenyi Wang
Jürgen Schmidhuber
AI4TS
226
0
0
28 Oct 2024
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation
Hyungjoo Chae
Namyoung Kim
Kai Tzu-iunn Ong
Minju Gwak
Gwanwoo Song
Jihoon Kim
S. Kim
Dongha Lee
Jinyoung Yeo
LLMAG
33
15
0
17 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
Synthesizing Evolving Symbolic Representations for Autonomous Systems
Gabriele Sartor
A. Oddi
R. Rasconi
V. Santucci
Rosa Meo
26
0
0
18 Sep 2024
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Fei Wang
Xingyu Fu
James Y. Huang
Zekun Li
Qin Liu
...
Kai-Wei Chang
Dan Roth
Sheng Zhang
Hoifung Poon
Muhao Chen
VLM
50
47
0
13 Jun 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
71
75
0
27 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Haifeng Zhang
Mingsheng Long
VGen
49
26
0
24 May 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
50
6
0
09 Apr 2024
Action-conditioned video data improves predictability
Meenakshi Sarkar
Debasish Ghose
VGen
51
0
0
08 Apr 2024
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
Federico Tomasi
Joseph Cauteruccio
Surya Kanoria
K. Ciosek
Matteo Rinaldi
Zhenwen Dai
OffRL
30
5
0
13 Oct 2023
Long-Term Prediction of Natural Video Sequences with Robust Video Predictors
Luke Ditria
Tom Drummond
51
0
0
21 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
35
28
0
14 Aug 2023
Context-Conditional Navigation with a Learning-Based Terrain- and Robot-Aware Dynamics Model
Suresh Guttikonda
Jan Achterhold
Haolong Li
Joschka Boedecker
Joerg Stueckler
29
2
0
18 Jul 2023
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
36
25
0
29 May 2023
State Representation Learning Using an Unbalanced Atlas
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
37
2
0
17 May 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Minting Pan
Xiangming Zhu
Yitao Zheng
Yunbo Wang
Xiaokang Yang
34
0
0
27 Mar 2023
Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal Forecasts
Pantelis R. Vlachas
Petros Koumoutsakos
AI4TS
AI4CE
23
7
0
22 Feb 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
29
8
0
26 Jan 2023
Long-horizon video prediction using a dynamic latent hierarchy
Alexey Zakharov
Qinghai Guo
Z. Fountas
36
4
0
29 Dec 2022
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Joint Embedding Predictive Architectures Focus on Slow Features
Vlad Sobal
V. JyothirS
Siddhartha Jalagam
Nicolas Carion
Kyunghyun Cho
Yann LeCun
24
8
0
20 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Reward-Predictive Clustering
Lucas Lehnert
M. Frank
Michael L. Littman
OffRL
25
0
0
07 Nov 2022
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction
Maria Attarian
Advaya Gupta
Ziyi Zhou
Wei Yu
Igor Gilitschenski
Animesh Garg
LM&Ro
29
7
0
07 Oct 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
36
12
0
19 Sep 2022
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator
Younggyo Seo
Kimin Lee
Fangchen Liu
Stephen James
Pieter Abbeel
VGen
29
28
0
15 Sep 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
68
0
16 Jun 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models
Minting Pan
Xiangming Zhu
Yunbo Wang
Xiaokang Yang
29
39
0
27 May 2022
Brainish: Formalizing A Multimodal Language for Intelligence and Consciousness
Paul Pu Liang
30
4
0
14 Apr 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
18
119
0
25 Mar 2022
Playable Environments: Video Manipulation in Space and Time
Willi Menapace
Stéphane Lathuilière
Aliaksandr Siarohin
Christian Theobalt
Sergey Tulyakov
Vladislav Golyanik
Elisa Ricci
VGen
34
22
0
03 Mar 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Xiaoyu Tan
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
30
17
0
29 Jan 2022
InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition
A. Glavan
Estefanía Talavera
21
10
0
23 Dec 2021
Learning to track environment state via predictive autoencoding
Marian Andrecki
N. K. Taylor
8
0
0
14 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Noether Networks: Meta-Learning Useful Conserved Quantities
Ferran Alet
Dylan D. Doblar
Allan Zhou
J. Tenenbaum
Kenji Kawaguchi
Chelsea Finn
75
27
0
06 Dec 2021
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
OPEn: An Open-ended Physics Environment for Learning Without a Task
Chuang Gan
Abhishek Bhandwaldar
Antonio Torralba
J. Tenenbaum
Phillip Isola
LRM
135
4
0
13 Oct 2021
Planning from Pixels in Environments with Combinatorially Hard Search Spaces
Marco Bagatella
Miroslav Olsák
Michal Rolínek
Georg Martius
OffRL
21
6
0
12 Oct 2021
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning
Zhiyu Yao
Yunbo Wang
Haixu Wu
Jianmin Wang
Mingsheng Long
AI4TS
29
8
0
08 Oct 2021
Goal-Directed Design Agents: Integrating Visual Imitation with One-Step Lookahead Optimization for Generative Design
Ayush Raina
Lucas Puentes
Jonathan Cagan
Christopher McComb
AI4CE
26
6
0
07 Oct 2021
A Framework for Multisensory Foresight for Embodied Agents
Xiaohui Chen
Ramtin Hosseini
K. Panetta
Jivko Sinapov
26
3
0
15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training
T. Gulrez
W. Mansell
19
0
0
04 Aug 2021
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning
Pedro Tsividis
J. Loula
Jake Burga
Nathan Foss
Andres Campero
Thomas Pouncy
S. Gershman
J. Tenenbaum
LM&Ro
24
43
0
27 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Assaf Hallak
Gal Dalal
Steven Dalton
I. Frosio
Shie Mannor
Gal Chechik
OffRL
OnRL
35
9
0
04 Jul 2021
1
2
3
4
5
Next