Action-Conditional Video Prediction using Deep Networks in Atari Games

31 July 2015

Papers citing "Action-Conditional Video Prediction using Deep Networks in Atari Games"

50 / 221 papers shown

Title
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation Jun Guo Xiaojian Ma Yikai Wang Min Yang Huaping Liu Qing Li VGen 34 0 0 15 May 2025
Strengthening Generative Robot Policies through Predictive World Modeling Han Qi Haocheng Yin Yilun Du Yilun Du Heng Yang 66 2 0 02 Feb 2025
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning Siddharth Aravindan Dixant Mittal Wee Sun Lee BDL 79 0 0 17 Jan 2025
FACTS: A Factored State-Space Framework For World Modelling Li Nanbo Firas Laakom Yucheng Xu Wenyi Wang Jürgen Schmidhuber AI4TS 226 0 0 28 Oct 2024
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Hyungjoo Chae Namyoung Kim Kai Tzu-iunn Ong Minju Gwak Gwanwoo Song Jihoon Kim S. Kim Dongha Lee Jinyoung Yeo LLMAG 33 15 0 17 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling Jasmine Bayrooti Carl Henrik Ek Amanda Prorok 42 0 0 07 Oct 2024
Synthesizing Evolving Symbolic Representations for Autonomous Systems Gabriele Sartor A. Oddi R. Rasconi V. Santucci Rosa Meo 26 0 0 18 Sep 2024
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding Fei Wang Xingyu Fu James Y. Huang Zekun Li Qin Liu ... Kai-Wei Chang Dan Roth Sheng Zhang Hoifung Poon Muhao Chen VLM 50 47 0 13 Jun 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability Shenyuan Gao Jiazhi Yang Li Chen Kashyap Chitta Yihang Qiu Andreas Geiger Jun Zhang Hongyang Li 71 75 0 27 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models Jialong Wu Shaofeng Yin Ningya Feng Xu He Dong Li Haifeng Zhang Mingsheng Long VGen 49 26 0 24 May 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective Victor-Alexandru Darvariu Stephen Hailes Mirco Musolesi AI4CE 50 6 0 09 Apr 2024
Action-conditioned video data improves predictability Meenakshi Sarkar Debasish Ghose VGen 51 0 0 08 Apr 2024
Automatic Music Playlist Generation via Simulation-based Reinforcement Learning Federico Tomasi Joseph Cauteruccio Surya Kanoria K. Ciosek Matteo Rinaldi Zhenwen Dai OffRL 30 5 0 13 Oct 2023
Long-Term Prediction of Natural Video Sequences with Robust Video Predictors Luke Ditria Tom Drummond 51 0 0 21 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation Hanqing Wang Wei Liang Luc Van Gool Wenguan Wang LM&Ro 35 28 0 14 Aug 2023
Context-Conditional Navigation with a Learning-Based Terrain- and Robot-Aware Dynamics Model Suresh Guttikonda Jan Achterhold Haolong Li Joschka Boedecker Joerg Stueckler 29 2 0 18 Jul 2023
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning Jialong Wu Haoyu Ma Chao Deng Mingsheng Long OffRL 36 25 0 29 May 2023
State Representation Learning Using an Unbalanced Atlas Li Meng Morten Goodwin Anis Yazidi P. Engelstad 37 2 0 17 May 2023
Model-Based Reinforcement Learning with Isolated Imaginations Minting Pan Xiangming Zhu Yitao Zheng Yunbo Wang Xiaokang Yang 34 0 0 27 Mar 2023
Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal Forecasts Pantelis R. Vlachas Petros Koumoutsakos AI4TS AI4CE 23 7 0 22 Feb 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning Mingqi Yuan Bo Li Xin Jin Wenjun Zeng OffRL 29 8 0 26 Jan 2023
Long-horizon video prediction using a dynamic latent hierarchy Alexey Zakharov Qinghai Guo Z. Fountas 36 4 0 29 Dec 2022
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning Tingting Zhao Ying Wang Weidong Sun Yarui Chen Gang Niu Masashi Sugiyama 19 1 0 23 Nov 2022
Joint Embedding Predictive Architectures Focus on Slow Features Vlad Sobal V. JyothirS Siddhartha Jalagam Nicolas Carion Kyunghyun Cho Yann LeCun 24 8 0 20 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments Daniel Jarrett Corentin Tallec Florent Altché Thomas Mesnard Rémi Munos Michal Valko 48 5 0 18 Nov 2022
Reward-Predictive Clustering Lucas Lehnert M. Frank Michael L. Littman OffRL 25 0 0 07 Nov 2022
Learning General World Models in a Handful of Reward-Free Deployments Yingchen Xu Jack Parker-Holder Aldo Pacchiano Philip J. Ball Oleh Rybkin Stephen J. Roberts Tim Rocktaschel Edward Grefenstette OffRL 62 9 0 23 Oct 2022
See, Plan, Predict: Language-guided Cognitive Planning with Video Prediction Maria Attarian Advaya Gupta Ziyi Zhou Wei Yu Igor Gilitschenski Animesh Garg LM&Ro 29 7 0 07 Oct 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning Mingqi Yuan Bo Li Xin Jin Wenjun Zeng 36 12 0 19 Sep 2022
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator Younggyo Seo Kimin Lee Fangchen Liu Stephen James Pieter Abbeel VGen 29 28 0 15 Sep 2022
BYOL-Explore: Exploration by Bootstrapped Prediction Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 22 68 0 16 Jun 2022
Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models Minting Pan Xiangming Zhu Yunbo Wang Xiaokang Yang 29 39 0 27 May 2022
Brainish: Formalizing A Multimodal Language for Intelligence and Consciousness Paul Pu Liang 30 4 0 14 Apr 2022
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo Kimin Lee Stephen James Pieter Abbeel SSL OnRL 18 119 0 25 Mar 2022
Playable Environments: Video Manipulation in Space and Time Willi Menapace Stéphane Lathuilière Aliaksandr Siarohin Christian Theobalt Sergey Tulyakov Vladislav Golyanik Elisa Ricci VGen 34 22 0 03 Mar 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes Chao Qu Xiaoyu Tan Siqiao Xue Xiaoming Shi James Y. Zhang Hongyuan Mei OffRL 30 17 0 29 Jan 2022
InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition A. Glavan Estefanía Talavera 21 10 0 23 Dec 2021
Learning to track environment state via predictive autoencoding Marian Andrecki N. K. Taylor 8 0 0 14 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty Angelos Filos Eszter Vértes Zita Marinho Gregory Farquhar Diana Borsa A. Friesen Feryal M. P. Behbahani Tom Schaul André Barreto Simon Osindero 44 7 0 08 Dec 2021
Noether Networks: Meta-Learning Useful Conserved Quantities Ferran Alet Dylan D. Doblar Allan Zhou J. Tenenbaum Kenji Kawaguchi Chelsea Finn 75 27 0 06 Dec 2021
Self-Consistent Models and Values Roy Miles Kate Baumli Zita Marinho Angelos Filos Matteo Hessel Hado van Hasselt David Silver 38 8 0 25 Oct 2021
OPEn: An Open-ended Physics Environment for Learning Without a Task Chuang Gan Abhishek Bhandwaldar Antonio Torralba J. Tenenbaum Phillip Isola LRM 135 4 0 13 Oct 2021
Planning from Pixels in Environments with Combinatorially Hard Search Spaces Marco Bagatella Miroslav Olsák Michal Rolínek Georg Martius OffRL 21 6 0 12 Oct 2021
ModeRNN: Harnessing Spatiotemporal Mode Collapse in Unsupervised Predictive Learning Zhiyu Yao Yunbo Wang Haixu Wu Jianmin Wang Mingsheng Long AI4TS 29 8 0 08 Oct 2021
Goal-Directed Design Agents: Integrating Visual Imitation with One-Step Lookahead Optimization for Generative Design Ayush Raina Lucas Puentes Jonathan Cagan Christopher McComb AI4CE 26 6 0 07 Oct 2021
A Framework for Multisensory Foresight for Embodied Agents Xiaohui Chen Ramtin Hosseini K. Panetta Jivko Sinapov 26 3 0 15 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain Jianye Hao Tianpei Yang Hongyao Tang Chenjia Bai Jinyi Liu Zhaopeng Meng Peng Liu Zhen Wang OffRL 36 93 0 14 Sep 2021
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training T. Gulrez W. Mansell 19 0 0 04 Aug 2021
Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning Pedro Tsividis J. Loula Jake Burga Nathan Foss Andres Campero Thomas Pouncy S. Gershman J. Tenenbaum LM&Ro 24 43 0 27 Jul 2021
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction Assaf Hallak Gal Dalal Steven Dalton I. Frosio Shie Mannor Gal Chechik OffRL OnRL 35 9 0 04 Jul 2021