Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.10122
Cited By
World Models
27 March 2018
David R Ha
Jürgen Schmidhuber
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"World Models"
50 / 254 papers shown
Title
PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference
Masashi Okada
Norio Kosaka
T. Taniguchi
8
43
0
01 Mar 2020
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
Elise van der Pol
Thomas Kipf
F. Oliehoek
Max Welling
25
77
0
27 Feb 2020
From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI
S. Risi
Mike Preuss
35
56
0
24 Feb 2020
The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence
G. Marcus
VLM
32
355
0
14 Feb 2020
Causally Correct Partial Models for Reinforcement Learning
Danilo Jimenez Rezende
Ivo Danihelka
George Papamakarios
Nan Rosemary Ke
Ray Jiang
...
Jane X. Wang
Jovana Mitrović
F. Besse
Ioannis Antonoglou
Lars Buesing
AI4TS
24
32
0
07 Feb 2020
DDSP: Differentiable Digital Signal Processing
Jesse Engel
Lamtharn Hantrakul
Chenjie Gu
Adam Roberts
DiffM
96
373
0
14 Jan 2020
Interestingness Elements for Explainable Reinforcement Learning: Understanding Agents' Capabilities and Limitations
Pedro Sequeira
Melinda Gervasio
19
104
0
19 Dec 2019
Adversarial recovery of agent rewards from latent spaces of the limit order book
Jacobo Roa-Vicens
Yuanbo Wang
Virgile Mison
Y. Gal
Ricardo M. A. Silva
23
3
0
09 Dec 2019
Policy Optimization Reinforcement Learning with Entropy Regularization
Jingbin Liu
Xinyang Gu
Shuai Liu
25
4
0
02 Dec 2019
Experience-Embedded Visual Foresight
Yen-Chen Lin
Maria Bauzá
Phillip Isola
22
35
0
12 Nov 2019
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
Patrik Reizinger
Marton Szemenyei
31
16
0
23 Oct 2019
Backpropagation Algorithms and Reservoir Computing in Recurrent Neural Networks for the Forecasting of Complex Spatiotemporal Dynamics
Pantelis R. Vlachas
Jaideep Pathak
Brian R. Hunt
T. Sapsis
M. Girvan
Edward Ott
Petros Koumoutsakos
AI4TS
24
384
0
09 Oct 2019
Model-based Reinforcement Learning for Predictions and Control for Limit Order Books
Haoran Wei
Yuanbo Wang
L. Mangu
Keith S. Decker
13
24
0
09 Oct 2019
Mature GAIL: Imitation Learning for Low-level and High-dimensional Input using Global Encoder and Cost Transformation
Wonsup Shin
Hyolim Kang
Sunghoon Hong
13
0
0
07 Sep 2019
Towards Model-based Reinforcement Learning for Industry-near Environments
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
OffRL
DRL
22
4
0
27 Jul 2019
Learning to design from humans: Imitating human designers through deep learning
Ayush Raina
Christopher McComb
Jonathan Cagan
3DV
AI4CE
32
65
0
26 Jul 2019
Learning the Arrow of Time
Nasim Rahaman
Steffen Wolf
Anirudh Goyal
Roman Remme
Yoshua Bengio
14
5
0
02 Jul 2019
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Alex X. Lee
Anusha Nagabandi
Pieter Abbeel
Sergey Levine
OffRL
BDL
36
372
0
01 Jul 2019
Supervise Thyself: Examining Self-Supervised Representations in Interactive Environments
Evan Racah
C. Pal
SSL
27
2
0
27 Jun 2019
Emergence of Exploratory Look-Around Behaviors through Active Observation Completion
Santhosh Kumar Ramakrishnan
Dinesh Jayaraman
Kristen Grauman
20
40
0
27 Jun 2019
Learning Causal State Representations of Partially Observable Environments
Amy Zhang
Zachary Chase Lipton
Luis Villaseñor-Pineda
Kamyar Azizzadenesheli
Anima Anandkumar
Laurent Itti
Joelle Pineau
Tommaso Furlanello
CML
40
49
0
25 Jun 2019
DynoPlan: Combining Motion Planning and Deep Neural Network based Controllers for Safe HRL
Daniel Angelov
Yordan V. Hristov
S. Ramamoorthy
19
1
0
24 Jun 2019
Shaping Belief States with Generative Environment Models for RL
Karol Gregor
Danilo Jimenez Rezende
F. Besse
Yan Wu
Hamza Merzic
Aaron van den Oord
OffRL
AI4CE
25
118
0
21 Jun 2019
Generative Adversarial Networks are Special Cases of Artificial Curiosity (1990) and also Closely Related to Predictability Minimization (1991)
J. Schmidhuber
GAN
DRL
30
57
0
11 Jun 2019
On the Transfer of Inductive Bias from Simulation to the Real World: a New Disentanglement Dataset
Muhammad Waleed Gondal
Manuel Wüthrich
Ðorðe Miladinovic
Francesco Locatello
M. Breidt
V. Volchkov
J. Akpo
Olivier Bachem
Bernhard Schölkopf
Stefan Bauer
OOD
DRL
33
134
0
07 Jun 2019
Proximal Reliability Optimization for Reinforcement Learning
Narendra Patwardhan
Zequn Wang
18
0
0
03 Jun 2019
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment
Jivitesh Sharma
Per-Arne Andersen
Ole-Christoffer Granmo
M. G. Olsen
AI4CE
38
68
0
23 May 2019
COBRA: Data-Efficient Model-Based RL through Unsupervised Object Discovery and Curiosity-Driven Exploration
Nicholas Watters
Loic Matthey
Matko Bosnjak
Christopher P. Burgess
Alexander Lerchner
OffRL
11
117
0
22 May 2019
Bayesian policy selection using active inference
Ozan Çatal
J. Nauta
Tim Verbelen
Pieter Simoens
Bart Dhoedt
BDL
46
32
0
17 Apr 2019
Likelihood-free MCMC with Amortized Approximate Ratio Estimators
Joeri Hermans
Volodimir Begy
Gilles Louppe
32
20
0
10 Mar 2019
World Discovery Models
M. G. Azar
Bilal Piot
Bernardo Avila-Pires
Jean-Bastien Grill
Florent Altché
Rémi Munos
21
26
0
20 Feb 2019
Engineered Self-Organization for Resilient Robot Self-Assembly with Minimal Surprise
Tanja Katharina Kaiser
Heiko Hamann
27
9
0
14 Feb 2019
Successor Features Combine Elements of Model-Free and Model-based Reinforcement Learning
Lucas Lehnert
Michael L. Littman
18
10
0
31 Jan 2019
TensorFlow.js: Machine Learning for the Web and Beyond
D. Smilkov
Nikhil Thorat
Yannick Assogba
Ann Yuan
Nick Kreeger
...
D. Sculley
R. Monga
G. Corrado
F. Viégas
Martin Wattenberg
30
173
0
16 Jan 2019
Imminent Collision Mitigation with Reinforcement Learning and Vision
Horia Porav
Paul Newman
22
18
0
03 Jan 2019
Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Visuomotor Policies
Alexander Sax
Bradley Emi
Amir Zamir
Leonidas J. Guibas
Silvio Savarese
Jitendra Malik
SSL
39
16
0
31 Dec 2018
Deconfounding Reinforcement Learning in Observational Settings
Chaochao Lu
Bernhard Schölkopf
José Miguel Hernández-Lobato
CML
OOD
39
73
0
26 Dec 2018
VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control
Xingxing Liang
Qi Wang
Yanghe Feng
Zhong Liu
Jincai Huang
29
5
0
24 Dec 2018
Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control
F. Ebert
Chelsea Finn
Sudeep Dasari
Annie Xie
Alex X. Lee
Sergey Levine
SSL
35
379
0
03 Dec 2018
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRL
OffRL
SSL
41
173
0
28 Nov 2018
Towards Governing Agent's Efficacy: Action-Conditional
β
β
β
-VAE for Deep Transparent Reinforcement Learning
John Yang
Gyujeong Lee
Minsung Hyun
Simyung Chang
Nojun Kwak
29
3
0
11 Nov 2018
Unsupervised Emergence of Spatial Structure from Sensorimotor Prediction
Alban Laflaquière
Michael Garcia Ortiz
20
3
0
02 Oct 2018
The Dreaming Variational Autoencoder for Reinforcement Learning Environments
Per-Arne Andersen
M. G. Olsen
Ole-Christoffer Granmo
DRL
22
17
0
02 Oct 2018
S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning
Antonin Raffin
Ashley Hill
Kalifou René Traoré
Timothée Lesort
Natalia Díaz Rodríguez
David Filliat
OffRL
19
35
0
25 Sep 2018
Combined Reinforcement Learning via Abstract Representations
Vincent François-Lavet
Yoshua Bengio
Doina Precup
Joelle Pineau
OffRL
30
89
0
12 Sep 2018
Visual Reinforcement Learning with Imagined Goals
Ashvin Nair
Vitchyr H. Pong
Murtaza Dalal
Shikhar Bahl
Steven Lin
Sergey Levine
SSL
40
535
0
12 Jul 2018
Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal
Jake Bruce
Niko Sünderhauf
Piotr Wojciech Mirowski
R. Hadsell
Michael Milford
22
35
0
11 Jul 2018
A survey on policy search algorithms for learning robot controllers in a handful of trials
Konstantinos Chatzilygeroudis
Vassilis Vassiliades
F. Stulp
Sylvain Calinon
Jean-Baptiste Mouret
17
155
0
06 Jul 2018
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
42
643
0
01 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
30
213
0
20 Jun 2018
Previous
1
2
3
4
5
6
Next