Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,677 papers shown
Title
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
28
93
0
29 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
32
18
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
26
83
0
26 May 2018
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
19
48
0
24 May 2018
Intelligent Trainer for Model-Based Reinforcement Learning
Yuanlong Li
Linsen Dong
Xin Zhou
Yonggang Wen
K. Guan
OffRL
24
0
0
24 May 2018
Representation Balancing MDPs for Off-Policy Policy Evaluation
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
27
75
0
23 May 2018
A General Family of Robust Stochastic Operators for Reinforcement Learning
Yingdong Lu
M. Squillante
C. Wu
22
3
0
21 May 2018
Leveraging human knowledge in tabular reinforcement learning: A study of human subjects
Ariel Rosenfeld
Moshe Cohen
Matthew E. Taylor
Sarit Kraus
OffRL
14
31
0
15 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
25
51
0
11 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
19
42
0
09 May 2018
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
58
710
0
04 May 2018
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
41
30
0
04 May 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
38
401
0
24 Apr 2018
Benchmarking projective simulation in navigation problems
A. Melnikov
A. Makmal
Hans J. Briegel
19
19
0
23 Apr 2018
Gotta Learn Fast: A New Benchmark for Generalization in RL
Alex Nichol
Vicki Pfau
Christopher Hesse
Oleg Klimov
John Schulman
VLM
OffRL
15
177
0
10 Apr 2018
Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Jennifer Hicks
Sean F. Carroll
Sergey Levine
M. Salathé
Scott L. Delp
40
60
0
31 Mar 2018
Safe end-to-end imitation learning for model predictive control
Keuntaek Lee
Kamil Saigol
Evangelos A. Theodorou
BDL
24
24
0
27 Mar 2018
World Models
David R Ha
Jürgen Schmidhuber
SyDa
53
1,036
0
27 Mar 2018
Setting up a Reinforcement Learning Task with a Real-World Robot
A. R. Mahmood
D. Korenkevych
Brent Komer
James Bergstra
26
76
0
19 Mar 2018
Learning to Sequence Robot Behaviors for Visual Navigation
Hadi Salman
Puneet Singhal
Tanmay Shankar
Peng Yin
A. Salman
William Paivine
Guillaume Sartoretti
Matthew Travers
Howie Choset
20
8
0
05 Mar 2018
Accelerating Natural Gradient with Higher-Order Invariance
Yang Song
Jiaming Song
Stefano Ermon
26
21
0
04 Mar 2018
General Video Game AI: a Multi-Track Framework for Evaluating Agents, Games and Content Generation Algorithms
Diego Perez-Liebana
Jialin Liu
Ahmed Khalifa
Raluca D. Gaina
Julian Togelius
Simon Lucas
30
174
0
28 Feb 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
30
126
0
27 Feb 2018
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Matthias Plappert
Marcin Andrychowicz
Alex Ray
Bob McGrew
Bowen Baker
...
Joshua Tobin
Maciek Chociej
Peter Welinder
Vikash Kumar
Wojciech Zaremba
33
558
0
26 Feb 2018
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari
P. Chrabaszcz
I. Loshchilov
Frank Hutter
32
99
0
24 Feb 2018
Verifying Controllers Against Adversarial Examples with Bayesian Optimization
Shromona Ghosh
Felix Berkenkamp
G. Ranade
S. Qadeer
Ashish Kapoor
AAML
33
45
0
23 Feb 2018
Structured Control Nets for Deep Reinforcement Learning
Mario Srouji
Jian Zhang
Ruslan Salakhutdinov
33
43
0
22 Feb 2018
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
34
37
0
21 Feb 2018
Continual Reinforcement Learning with Complex Synapses
Christos Kaplanis
Murray Shanahan
Claudia Clopath
KELM
32
87
0
20 Feb 2018
State Representation Learning for Control: An Overview
Timothée Lesort
Natalia Díaz Rodríguez
Jean-François Goudou
David Filliat
OffRL
50
319
0
12 Feb 2018
Learning to Evade Static PE Machine Learning Malware Models via Reinforcement Learning
Hyrum S. Anderson
Anant Kharkar
Bobby Filar
David Evans
P. Roth
AAML
38
207
0
26 Jan 2018
Reinforcement Learning based Recommender System using Biclustering Technique
Sungwoon Choi
Heonseok Ha
Uiwon Hwang
Chanju Kim
Jung-Woo Ha
Sungroh Yoon
OffRL
27
70
0
17 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
41
37
0
09 Jan 2018
Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors
Benjamin Spector
Serge J. Belongie
OffRL
22
15
0
07 Jan 2018
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
74
1,107
0
02 Jan 2018
ES Is More Than Just a Traditional Finite-Difference Approximator
Joel Lehman
Jay Chen
Jeff Clune
Kenneth O. Stanley
33
89
0
18 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
686
0
18 Dec 2017
Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality
W. Cho
Mengdi Wang
OffRL
33
14
0
07 Dec 2017
MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence
Lianmin Zheng
Jiacheng Yang
Han Cai
Weinan Zhang
Jun Wang
Yong Yu
21
206
0
02 Dec 2017
Time Limits in Reinforcement Learning
Fabio Pardo
Arash Tavakoli
Vitaly Levdik
Petar Kormushev
CLL
44
158
0
01 Dec 2017
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
43
10
0
30 Nov 2017
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control
Shangtong Zhang
Osmar R. Zaiane
45
10
0
30 Nov 2017
Divide-and-Conquer Reinforcement Learning
Dibya Ghosh
Avi Singh
Aravind Rajeswaran
Vikash Kumar
Sergey Levine
OffRL
51
125
0
27 Nov 2017
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
22
260
0
24 Nov 2017
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
56
300
0
31 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning
Sergio Valcarcel Macua
Aleksi Tukiainen
D. Hernández
David Baldazo
Enrique Munoz de Cote
S. Zazo
32
29
0
28 Oct 2017
Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Shaojun Zhu
A. Kimmel
Kostas E. Bekris
Abdeslam Boularias
31
14
0
24 Oct 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
18
267
0
28 Sep 2017
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
Kunal Menda
Katherine Driggs-Campbell
Mykel J. Kochenderfer
32
28
0
18 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
42
546
0
18 Sep 2017
Previous
1
2
3
...
32
33
34
Next