World Models

27 March 2018

Papers citing "World Models"

50 / 254 papers shown

Title
PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference Masashi Okada Norio Kosaka T. Taniguchi 8 43 0 01 Mar 2020
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions Elise van der Pol Thomas Kipf F. Oliehoek Max Welling 25 77 0 27 Feb 2020
From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI S. Risi Mike Preuss 35 56 0 24 Feb 2020
The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence G. Marcus VLM 32 355 0 14 Feb 2020
Causally Correct Partial Models for Reinforcement Learning Danilo Jimenez Rezende Ivo Danihelka George Papamakarios Nan Rosemary Ke Ray Jiang ... Jane X. Wang Jovana Mitrović F. Besse Ioannis Antonoglou Lars Buesing AI4TS 24 32 0 07 Feb 2020
DDSP: Differentiable Digital Signal Processing Jesse Engel Lamtharn Hantrakul Chenjie Gu Adam Roberts DiffM 96 373 0 14 Jan 2020
Interestingness Elements for Explainable Reinforcement Learning: Understanding Agents' Capabilities and Limitations Pedro Sequeira Melinda Gervasio 19 104 0 19 Dec 2019
Adversarial recovery of agent rewards from latent spaces of the limit order book Jacobo Roa-Vicens Yuanbo Wang Virgile Mison Y. Gal Ricardo M. A. Silva 23 3 0 09 Dec 2019
Policy Optimization Reinforcement Learning with Entropy Regularization Jingbin Liu Xinyang Gu Shuai Liu 25 4 0 02 Dec 2019
Experience-Embedded Visual Foresight Yen-Chen Lin Maria Bauzá Phillip Isola 22 35 0 12 Nov 2019
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning Patrik Reizinger Marton Szemenyei 31 16 0 23 Oct 2019
Backpropagation Algorithms and Reservoir Computing in Recurrent Neural Networks for the Forecasting of Complex Spatiotemporal Dynamics Pantelis R. Vlachas Jaideep Pathak Brian R. Hunt T. Sapsis M. Girvan Edward Ott Petros Koumoutsakos AI4TS 24 384 0 09 Oct 2019
Model-based Reinforcement Learning for Predictions and Control for Limit Order Books Haoran Wei Yuanbo Wang L. Mangu Keith S. Decker 13 24 0 09 Oct 2019
Mature GAIL: Imitation Learning for Low-level and High-dimensional Input using Global Encoder and Cost Transformation Wonsup Shin Hyolim Kang Sunghoon Hong 13 0 0 07 Sep 2019
Towards Model-based Reinforcement Learning for Industry-near Environments Per-Arne Andersen M. G. Olsen Ole-Christoffer Granmo OffRL DRL 22 4 0 27 Jul 2019
Learning to design from humans: Imitating human designers through deep learning Ayush Raina Christopher McComb Jonathan Cagan 3DV AI4CE 32 65 0 26 Jul 2019
Learning the Arrow of Time Nasim Rahaman Steffen Wolf Anirudh Goyal Roman Remme Yoshua Bengio 14 5 0 02 Jul 2019
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model Alex X. Lee Anusha Nagabandi Pieter Abbeel Sergey Levine OffRL BDL 36 372 0 01 Jul 2019
Supervise Thyself: Examining Self-Supervised Representations in Interactive Environments Evan Racah C. Pal SSL 27 2 0 27 Jun 2019
Emergence of Exploratory Look-Around Behaviors through Active Observation Completion Santhosh Kumar Ramakrishnan Dinesh Jayaraman Kristen Grauman 20 40 0 27 Jun 2019
Learning Causal State Representations of Partially Observable Environments Amy Zhang Zachary Chase Lipton Luis Villaseñor-Pineda Kamyar Azizzadenesheli Anima Anandkumar Laurent Itti Joelle Pineau Tommaso Furlanello CML 40 49 0 25 Jun 2019
DynoPlan: Combining Motion Planning and Deep Neural Network based Controllers for Safe HRL Daniel Angelov Yordan V. Hristov S. Ramamoorthy 19 1 0 24 Jun 2019
Shaping Belief States with Generative Environment Models for RL Karol Gregor Danilo Jimenez Rezende F. Besse Yan Wu Hamza Merzic Aaron van den Oord OffRL AI4CE 25 118 0 21 Jun 2019
Generative Adversarial Networks are Special Cases of Artificial Curiosity (1990) and also Closely Related to Predictability Minimization (1991) J. Schmidhuber GAN DRL 30 57 0 11 Jun 2019
On the Transfer of Inductive Bias from Simulation to the Real World: a New Disentanglement Dataset Muhammad Waleed Gondal Manuel Wüthrich Ðorðe Miladinovic Francesco Locatello M. Breidt V. Volchkov J. Akpo Olivier Bachem Bernhard Schölkopf Stefan Bauer OOD DRL 33 134 0 07 Jun 2019
Proximal Reliability Optimization for Reinforcement Learning Narendra Patwardhan Zequn Wang 18 0 0 03 Jun 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment Jivitesh Sharma Per-Arne Andersen Ole-Christoffer Granmo M. G. Olsen AI4CE 38 68 0 23 May 2019
COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration Nicholas Watters Loic Matthey Matko Bosnjak Christopher P. Burgess Alexander Lerchner OffRL 11 117 0 22 May 2019
Bayesian policy selection using active inference Ozan Çatal J. Nauta Tim Verbelen Pieter Simoens Bart Dhoedt BDL 46 32 0 17 Apr 2019
Likelihood-free MCMC with Amortized Approximate Ratio Estimators Joeri Hermans Volodimir Begy Gilles Louppe 32 20 0 10 Mar 2019
World Discovery Models M. G. Azar Bilal Piot Bernardo Avila-Pires Jean-Bastien Grill Florent Altché Rémi Munos 21 26 0 20 Feb 2019
Engineered Self-Organization for Resilient Robot Self-Assembly with Minimal Surprise Tanja Katharina Kaiser Heiko Hamann 27 9 0 14 Feb 2019
Successor Features Combine Elements of Model-Free and Model-based Reinforcement Learning Lucas Lehnert Michael L. Littman 18 10 0 31 Jan 2019
TensorFlow.js: Machine Learning for the Web and Beyond D. Smilkov Nikhil Thorat Yannick Assogba Ann Yuan Nick Kreeger ... D. Sculley R. Monga G. Corrado F. Viégas Martin Wattenberg 30 173 0 16 Jan 2019
Imminent Collision Mitigation with Reinforcement Learning and Vision Horia Porav Paul Newman 22 18 0 03 Jan 2019
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies Alexander Sax Bradley Emi Amir Zamir Leonidas J. Guibas Silvio Savarese Jitendra Malik SSL 39 16 0 31 Dec 2018
Deconfounding Reinforcement Learning in Observational Settings Chaochao Lu Bernhard Schölkopf José Miguel Hernández-Lobato CML OOD 39 73 0 26 Dec 2018
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control Xingxing Liang Qi Wang Yanghe Feng Zhong Liu Jincai Huang 29 5 0 24 Dec 2018
Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control F. Ebert Chelsea Finn Sudeep Dasari Annie Xie Alex X. Lee Sergey Levine SSL 35 379 0 03 Dec 2018
Unsupervised Control Through Non-Parametric Discriminative Rewards David Warde-Farley T. Wiele Tejas D. Kulkarni Catalin Ionescu Steven Hansen Volodymyr Mnih DRL OffRL SSL 41 173 0 28 Nov 2018
Towards Governing Agent's Efficacy: Action-Conditional $β$ -VAE for Deep Transparent Reinforcement Learning John Yang Gyujeong Lee Minsung Hyun Simyung Chang Nojun Kwak 29 3 0 11 Nov 2018
Unsupervised Emergence of Spatial Structure from Sensorimotor Prediction Alban Laflaquière Michael Garcia Ortiz 20 3 0 02 Oct 2018
The Dreaming Variational Autoencoder for Reinforcement Learning Environments Per-Arne Andersen M. G. Olsen Ole-Christoffer Granmo DRL 22 17 0 02 Oct 2018
S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning Antonin Raffin Ashley Hill Kalifou René Traoré Timothée Lesort Natalia Díaz Rodríguez David Filliat OffRL 19 35 0 25 Sep 2018
Combined Reinforcement Learning via Abstract Representations Vincent François-Lavet Yoshua Bengio Doina Precup Joelle Pineau OffRL 30 89 0 12 Sep 2018
Visual Reinforcement Learning with Imagined Goals Ashvin Nair Vitchyr H. Pong Murtaza Dalal Shikhar Bahl Steven Lin Sergey Levine SSL 40 535 0 12 Jul 2018
Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal Jake Bruce Niko Sünderhauf Piotr Wojciech Mirowski R. Hadsell Michael Milford 22 35 0 11 Jul 2018
A survey on policy search algorithms for learning robot controllers in a handful of trials Konstantinos Chatzilygeroudis Vassilis Vassiliades F. Stulp Sylvain Calinon Jean-Baptiste Mouret 17 155 0 06 Jul 2018
Learning to Drive in a Day Alex Kendall Jeffrey Hawke David Janz Przemyslaw Mazur Daniele Reda John M. Allen Vinh-Dieu Lam Alex Bewley Amar Shah 42 643 0 01 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards Jose A. Arjona-Medina Michael Gillhofer Michael Widrich Thomas Unterthiner Johannes Brandstetter Sepp Hochreiter 30 213 0 20 Jun 2018