ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,677 papers shown
Title
Truncated Horizon Policy Search: Combining Reinforcement Learning &
  Imitation Learning
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
Wen Sun
J. Andrew Bagnell
Byron Boots
28
93
0
29 May 2018
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul
Michael A. Osborne
Shimon Whiteson
32
18
0
27 May 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
26
83
0
26 May 2018
A0C: Alpha Zero in Continuous Action Space
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
19
48
0
24 May 2018
Intelligent Trainer for Model-Based Reinforcement Learning
Intelligent Trainer for Model-Based Reinforcement Learning
Yuanlong Li
Linsen Dong
Xin Zhou
Yonggang Wen
K. Guan
OffRL
24
0
0
24 May 2018
Representation Balancing MDPs for Off-Policy Policy Evaluation
Representation Balancing MDPs for Off-Policy Policy Evaluation
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
27
75
0
23 May 2018
A General Family of Robust Stochastic Operators for Reinforcement
  Learning
A General Family of Robust Stochastic Operators for Reinforcement Learning
Yingdong Lu
M. Squillante
C. Wu
22
3
0
21 May 2018
Leveraging human knowledge in tabular reinforcement learning: A study of
  human subjects
Leveraging human knowledge in tabular reinforcement learning: A study of human subjects
Ariel Rosenfeld
Moshe Cohen
Matthew E. Taylor
Sarit Kraus
OffRL
14
31
0
15 May 2018
Deep Hierarchical Reinforcement Learning Algorithm in Partially
  Observable Markov Decision Processes
Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes
T. P. Le
Ngo Anh Vien
Abu Layek
TaeChoong Chung
25
51
0
11 May 2018
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
Joshua Romoff
Peter Henderson
Alexandre Piché
Vincent François-Lavet
Joelle Pineau
19
42
0
09 May 2018
Behavioral Cloning from Observation
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
58
710
0
04 May 2018
Exploration by Distributional Reinforcement Learning
Exploration by Distributional Reinforcement Learning
Yunhao Tang
Shipra Agrawal
OOD
41
30
0
04 May 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
38
401
0
24 Apr 2018
Benchmarking projective simulation in navigation problems
Benchmarking projective simulation in navigation problems
A. Melnikov
A. Makmal
Hans J. Briegel
19
19
0
23 Apr 2018
Gotta Learn Fast: A New Benchmark for Generalization in RL
Gotta Learn Fast: A New Benchmark for Generalization in RL
Alex Nichol
Vicki Pfau
Christopher Hesse
Oleg Klimov
John Schulman
VLM
OffRL
15
177
0
10 Apr 2018
Learning to Run challenge: Synthesizing physiologically accurate motion
  using deep reinforcement learning
Learning to Run challenge: Synthesizing physiologically accurate motion using deep reinforcement learning
L. Kidzinski
Sharada Mohanty
Carmichael F. Ong
Jennifer Hicks
Sean F. Carroll
Sergey Levine
M. Salathé
Scott L. Delp
40
60
0
31 Mar 2018
Safe end-to-end imitation learning for model predictive control
Safe end-to-end imitation learning for model predictive control
Keuntaek Lee
Kamil Saigol
Evangelos A. Theodorou
BDL
24
24
0
27 Mar 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
53
1,036
0
27 Mar 2018
Setting up a Reinforcement Learning Task with a Real-World Robot
Setting up a Reinforcement Learning Task with a Real-World Robot
A. R. Mahmood
D. Korenkevych
Brent Komer
James Bergstra
26
76
0
19 Mar 2018
Learning to Sequence Robot Behaviors for Visual Navigation
Learning to Sequence Robot Behaviors for Visual Navigation
Hadi Salman
Puneet Singhal
Tanmay Shankar
Peng Yin
A. Salman
William Paivine
Guillaume Sartoretti
Matthew Travers
Howie Choset
20
8
0
05 Mar 2018
Accelerating Natural Gradient with Higher-Order Invariance
Accelerating Natural Gradient with Higher-Order Invariance
Yang Song
Jiaming Song
Stefano Ermon
26
21
0
04 Mar 2018
General Video Game AI: a Multi-Track Framework for Evaluating Agents,
  Games and Content Generation Algorithms
General Video Game AI: a Multi-Track Framework for Evaluating Agents, Games and Content Generation Algorithms
Diego Perez-Liebana
Jialin Liu
Ahmed Khalifa
Raluca D. Gaina
Julian Togelius
Simon Lucas
30
174
0
28 Feb 2018
The Mirage of Action-Dependent Baselines in Reinforcement Learning
The Mirage of Action-Dependent Baselines in Reinforcement Learning
George Tucker
Surya Bhupatiraju
S. Gu
Richard Turner
Zoubin Ghahramani
Sergey Levine
OffRL
30
126
0
27 Feb 2018
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and
  Request for Research
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Matthias Plappert
Marcin Andrychowicz
Alex Ray
Bob McGrew
Bowen Baker
...
Joshua Tobin
Maciek Chociej
Peter Welinder
Vikash Kumar
Wojciech Zaremba
33
558
0
26 Feb 2018
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing
  Atari
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari
P. Chrabaszcz
I. Loshchilov
Frank Hutter
32
99
0
24 Feb 2018
Verifying Controllers Against Adversarial Examples with Bayesian
  Optimization
Verifying Controllers Against Adversarial Examples with Bayesian Optimization
Shromona Ghosh
Felix Berkenkamp
G. Ranade
S. Qadeer
Ashish Kapoor
AAML
33
45
0
23 Feb 2018
Structured Control Nets for Deep Reinforcement Learning
Structured Control Nets for Deep Reinforcement Learning
Mario Srouji
Jian Zhang
Ruslan Salakhutdinov
33
43
0
22 Feb 2018
Clipped Action Policy Gradient
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
34
37
0
21 Feb 2018
Continual Reinforcement Learning with Complex Synapses
Continual Reinforcement Learning with Complex Synapses
Christos Kaplanis
Murray Shanahan
Claudia Clopath
KELM
32
87
0
20 Feb 2018
State Representation Learning for Control: An Overview
State Representation Learning for Control: An Overview
Timothée Lesort
Natalia Díaz Rodríguez
Jean-François Goudou
David Filliat
OffRL
50
319
0
12 Feb 2018
Learning to Evade Static PE Machine Learning Malware Models via
  Reinforcement Learning
Learning to Evade Static PE Machine Learning Malware Models via Reinforcement Learning
Hyrum S. Anderson
Anant Kharkar
Bobby Filar
David Evans
P. Roth
AAML
38
207
0
26 Jan 2018
Reinforcement Learning based Recommender System using Biclustering
  Technique
Reinforcement Learning based Recommender System using Biclustering Technique
Sungwoon Choi
Heonseok Ha
Uiwon Hwang
Chanju Kim
Jung-Woo Ha
Sungroh Yoon
OffRL
27
70
0
17 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games
  in 21 minutes
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
41
37
0
09 Jan 2018
Sample-Efficient Reinforcement Learning through Transfer and
  Architectural Priors
Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors
Benjamin Spector
Serge J. Belongie
OffRL
22
15
0
07 Jan 2018
DeepMind Control Suite
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
74
1,107
0
02 Jan 2018
ES Is More Than Just a Traditional Finite-Difference Approximator
ES Is More Than Just a Traditional Finite-Difference Approximator
Joel Lehman
Jay Chen
Jeff Clune
Kenneth O. Stanley
33
89
0
18 Dec 2017
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative
  for Training Deep Neural Networks for Reinforcement Learning
Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning
F. Such
Vashisht Madhavan
Edoardo Conti
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
686
0
18 Dec 2017
Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using
  Bellman Duality
Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality
W. Cho
Mengdi Wang
OffRL
33
14
0
07 Dec 2017
MAgent: A Many-Agent Reinforcement Learning Platform for Artificial
  Collective Intelligence
MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence
Lianmin Zheng
Jiacheng Yang
Han Cai
Weinan Zhang
Jun Wang
Yong Yu
21
206
0
02 Dec 2017
Time Limits in Reinforcement Learning
Time Limits in Reinforcement Learning
Fabio Pardo
Arash Tavakoli
Vitaly Levdik
Petar Kormushev
CLL
44
158
0
01 Dec 2017
Variational Deep Q Network
Variational Deep Q Network
Yunhao Tang
A. Kucukelbir
BDL
43
10
0
30 Nov 2017
Comparing Deep Reinforcement Learning and Evolutionary Methods in
  Continuous Control
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control
Shangtong Zhang
Osmar R. Zaiane
45
10
0
30 Nov 2017
Divide-and-Conquer Reinforcement Learning
Divide-and-Conquer Reinforcement Learning
Dibya Ghosh
Avi Singh
Aravind Rajeswaran
Vikash Kumar
Sergey Levine
OffRL
51
125
0
27 Nov 2017
Action Branching Architectures for Deep Reinforcement Learning
Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli
Fabio Pardo
Petar Kormushev
22
260
0
24 Nov 2017
Backpropagation through the Void: Optimizing control variates for
  black-box gradient estimation
Backpropagation through the Void: Optimizing control variates for black-box gradient estimation
Will Grathwohl
Dami Choi
Yuhuai Wu
Geoffrey Roeder
David Duvenaud
56
300
0
31 Oct 2017
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep
  Reinforcement Learning
Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning
Sergio Valcarcel Macua
Aleksi Tukiainen
D. Hernández
David Baldazo
Enrique Munoz de Cote
S. Zazo
32
29
0
28 Oct 2017
Fast Model Identification via Physics Engines for Data-Efficient Policy
  Search
Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Shaojun Zhu
A. Kimmel
Kostas E. Bekris
Abdeslam Boularias
31
14
0
24 Oct 2017
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces
Garrett A. Warnell
Nicholas R. Waytowich
Vernon J. Lawhern
Peter Stone
18
267
0
28 Sep 2017
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
DropoutDAgger: A Bayesian Approach to Safe Imitation Learning
Kunal Menda
Katherine Driggs-Campbell
Mykel J. Kochenderfer
32
28
0
18 Sep 2017
Revisiting the Arcade Learning Environment: Evaluation Protocols and
  Open Problems for General Agents
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
Marlos C. Machado
Marc G. Bellemare
Erik Talvitie
J. Veness
Matthew J. Hausknecht
Michael Bowling
42
546
0
18 Sep 2017
Previous
123...323334
Next