ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.01540
  4. Cited By
OpenAI Gym

OpenAI Gym

5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
    OffRL
    ODL
ArXivPDFHTML

Papers citing "OpenAI Gym"

50 / 1,677 papers shown
Title
Double Actor-Critic with TD Error-Driven Regularization in Reinforcement
  Learning
Double Actor-Critic with TD Error-Driven Regularization in Reinforcement Learning
Haohui Chen
Zhiyong Chen
Aoxiang Liu
Wentuo Fang
OffRL
38
0
0
28 Sep 2024
Optimized Monte Carlo Tree Search for Enhanced Decision Making in the
  FrozenLake Environment
Optimized Monte Carlo Tree Search for Enhanced Decision Making in the FrozenLake Environment
Esteban Aldana Guerra
21
0
0
25 Sep 2024
Adversarial and Reactive Traffic Entities for Behavior-Realistic Driving Simulation: A Review
Adversarial and Reactive Traffic Entities for Behavior-Realistic Driving Simulation: A Review
Joshua Ransiek
Philipp Reis
Tobias Schürmann
Eric Sax
75
0
0
21 Sep 2024
An Efficient Multi-Robot Arm Coordination Strategy for Pick-and-Place
  Tasks using Reinforcement Learning
An Efficient Multi-Robot Arm Coordination Strategy for Pick-and-Place Tasks using Reinforcement Learning
Tizian Jermann
H. Kolvenbach
Fidel Esquivel Estay
Koen Krämer
Marco Hutter
18
0
0
20 Sep 2024
Selective Exploration and Information Gathering in Search and Rescue
  Using Hierarchical Learning Guided by Natural Language Input
Selective Exploration and Information Gathering in Search and Rescue Using Hierarchical Learning Guided by Natural Language Input
Dimitrios Panagopoulos
Adoldo Perrusquia
Weisi Guo
40
2
0
20 Sep 2024
Optimizing Falsification for Learning-Based Control Systems: A
  Multi-Fidelity Bayesian Approach
Optimizing Falsification for Learning-Based Control Systems: A Multi-Fidelity Bayesian Approach
Zahra Shahrooei
Mykel J. Kochenderfer
Ali Baheri
36
1
0
12 Sep 2024
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
Shreyas S R
OffRL
OnRL
36
0
0
10 Sep 2024
Compatible Gradient Approximations for Actor-Critic Algorithms
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
42
0
0
02 Sep 2024
Efficient Multi-agent Navigation with Lightweight DRL Policy
Efficient Multi-agent Navigation with Lightweight DRL Policy
Xingrong Diao
Jiankun Wang
56
0
0
29 Aug 2024
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Ruiqi Zhang
Jing Hou
Florian Walter
Shangding Gu
Jiayi Guan
Florian Röhrbein
Yali Du
Panpan Cai
G. Chen
Alois Knoll
57
13
0
19 Aug 2024
PREMAP: A Unifying PREiMage APproximation Framework for Neural Networks
PREMAP: A Unifying PREiMage APproximation Framework for Neural Networks
Xiyue Zhang
Benjie Wang
Marta Kwiatkowska
Huan Zhang
AAML
43
2
0
17 Aug 2024
Markov Balance Satisfaction Improves Performance in Strictly Batch
  Offline Imitation Learning
Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning
Rishabh Agrawal
Nathan Dahlin
Rahul Jain
Ashutosh Nayyar
OffRL
41
0
0
17 Aug 2024
Diffusion Model for Planning: A Systematic Literature Review
Diffusion Model for Planning: A Systematic Literature Review
Toshihide Ubukata
Jialong Li
Kenji Tei
DiffM
MedIm
64
6
0
16 Aug 2024
Experimental evaluation of offline reinforcement learning for HVAC
  control in buildings
Experimental evaluation of offline reinforcement learning for HVAC control in buildings
Jun Wang
Linyan Li
Qi Liu
Yu Yang
OffRL
AI4CE
38
1
0
15 Aug 2024
How Well Can Vision Language Models See Image Details?
How Well Can Vision Language Models See Image Details?
Chenhui Gou
Abdulwahab Felemban
Faizan Farooq Khan
Deyao Zhu
Jianfei Cai
Hamid Rezatofighi
Mohamed Elhoseiny
VLM
MLLM
47
4
0
07 Aug 2024
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative
  Imitation Learning
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder
Stephen Adhisaputra
Josef Pauli
18
0
0
07 Aug 2024
Discretizing Continuous Action Space with Unimodal Probability
  Distributions for On-Policy Reinforcement Learning
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
Yuanyang Zhu
Zhi Wang
Yuanheng Zhu
Chunlin Chen
Dongbin Zhao
39
0
0
01 Aug 2024
On the Perturbed States for Transformed Input-robust Reinforcement
  Learning
On the Perturbed States for Transformed Input-robust Reinforcement Learning
Tung M. Luu
Haeyong Kang
Matthew Groh
Thanh Nguyen
Chang D. Yoo
OOD
AAML
OffRL
36
0
0
31 Jul 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
47
1
0
26 Jul 2024
Affectively Framework: Towards Human-like Affect-Based Agents
Affectively Framework: Towards Human-like Affect-Based Agents
M. Barthet
Roberto Gallotta
Ahmed Khalifa
Antonios Liapis
Georgios N. Yannakakis
24
1
0
25 Jul 2024
Path Following and Stabilisation of a Bicycle Model using a
  Reinforcement Learning Approach
Path Following and Stabilisation of a Bicycle Model using a Reinforcement Learning Approach
Sebastian Weyrer
Peter Manzl
A. L. Schwab
Johannes Gerstmayr
21
0
0
24 Jul 2024
Sustainable broadcasting in Blockchain Networks with Reinforcement Learning
Sustainable broadcasting in Blockchain Networks with Reinforcement Learning
Danila Valko
Daniel Kudenko
49
0
0
22 Jul 2024
Temporal Abstraction in Reinforcement Learning with Offline Data
Temporal Abstraction in Reinforcement Learning with Offline Data
Ranga Shaarad Ayyagari
Anurita Ghosh
Ambedkar Dukkipati
OffRL
39
0
0
21 Jul 2024
VisFly: An Efficient and Versatile Simulator for Training Vision-based
  Flight
VisFly: An Efficient and Versatile Simulator for Training Vision-based Flight
Fanxing Li
Fangyu Sun
Tianbao Zhang
Danping Zou
58
3
0
20 Jul 2024
Model-based Policy Optimization using Symbolic World Model
Model-based Policy Optimization using Symbolic World Model
Andrey Gorodetskiy
Konstantin Mironov
Aleksandr I. Panov
57
0
0
18 Jul 2024
LLM-Empowered State Representation for Reinforcement Learning
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
45
7
0
18 Jul 2024
A Review of Nine Physics Engines for Reinforcement Learning Research
A Review of Nine Physics Engines for Reinforcement Learning Research
Michael Kaup
Cornelius Wolff
Hyerim Hwang
Julius Mayer
Elia Bruni
AI4CE
50
5
0
11 Jul 2024
Structural Design Through Reinforcement Learning
Structural Design Through Reinforcement Learning
Thomas Rochefort-Beaudoin
Aurelian Vadean
Niels Aage
S. Achiche
AI4CE
33
0
0
10 Jul 2024
Preference-Guided Reinforcement Learning for Efficient Exploration
Preference-Guided Reinforcement Learning for Efficient Exploration
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Xuyang Chen
Lin Zhao
45
0
0
09 Jul 2024
A Review of Differentiable Simulators
A Review of Differentiable Simulators
Rhys Newbury
Jack Collins
Kerry He
Jiahe Pan
Ingmar Posner
David Howard
Akansel Cosgun
AI4CE
54
9
0
08 Jul 2024
EAGERx: Graph-Based Framework for Sim2real Robot Learning
EAGERx: Graph-Based Framework for Sim2real Robot Learning
B. V. D. Heijden
Jelle Luijkx
Laura Ferranti
Jens Kober
Robert Babuška
44
0
0
05 Jul 2024
VSP: Assessing the dual challenges of perception and reasoning in
  spatial planning tasks for VLMs
VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs
Qiucheng Wu
Handong Zhao
Michael Stephen Saxon
T. Bui
William Yang Wang
Yang Zhang
Shiyu Chang
CoGe
57
5
0
02 Jul 2024
Let Hybrid A* Path Planner Obey Traffic Rules: A Deep Reinforcement
  Learning-Based Planning Framework
Let Hybrid A* Path Planner Obey Traffic Rules: A Deep Reinforcement Learning-Based Planning Framework
Xibo Li
Shruti Patel
Christof Büskens
48
2
0
01 Jul 2024
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
PUZZLES: A Benchmark for Neural Algorithmic Reasoning
Benjamin Estermann
Luca A. Lanzendörfer
Yannick Niedermayr
Roger Wattenhofer
63
3
0
29 Jun 2024
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Gautham Vasan
Yan Wang
Fahim Shahriar
James Bergstra
Martin Jägersand
A. R. Mahmood
40
1
0
29 Jun 2024
Tradeoffs When Considering Deep Reinforcement Learning for Contingency
  Management in Advanced Air Mobility
Tradeoffs When Considering Deep Reinforcement Learning for Contingency Management in Advanced Air Mobility
Luis E. Alvarez
Marc W. Brittain
Steven D. Young
37
0
0
28 Jun 2024
Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL
  Agents
Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Chung-En Sun
Sicun Gao
Tsui-Wei Weng
AAML
31
3
0
26 Jun 2024
Tolerance of Reinforcement Learning Controllers against Deviations in
  Cyber Physical Systems
Tolerance of Reinforcement Learning Controllers against Deviations in Cyber Physical Systems
Changjian Zhang
Parv Kapoor
Eunsuk Kang
Romulo Meira-Goes
David Garlan
Akila Ganlath
Shatadal Mishra
N. Ammar
49
0
0
24 Jun 2024
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers
Harald Semmelrock
Tony Ross-Hellauer
Simone Kopeinik
Dieter Theiler
Armin Haberl
Stefan Thalmann
Dominik Kowald
65
7
0
20 Jun 2024
Learning the Approach During the Short-loading Cycle Using Reinforcement
  Learning
Learning the Approach During the Short-loading Cycle Using Reinforcement Learning
Carl Borngrund
Ulf Bodin
Henrik Andreasson
Fredrik Sandin
15
0
0
19 Jun 2024
A Systematization of the Wagner Framework: Graph Theory Conjectures and
  Reinforcement Learning
A Systematization of the Wagner Framework: Graph Theory Conjectures and Reinforcement Learning
Flora Angileri
Giulia Lombardi
Andrea Fois
Renato Faraone
C. Metta
...
M. Fantozzi
S. Galfrè
Daniele Pavesi
Maurizio Parton
F. Morandin
26
2
0
18 Jun 2024
When Vision Meets Touch: A Contemporary Review for Visuotactile Sensors
  from the Signal Processing Perspective
When Vision Meets Touch: A Contemporary Review for Visuotactile Sensors from the Signal Processing Perspective
Shoujie Li
Zihan Wang
Changsheng Wu
Xiang Li
Shan Luo
Bin Fang
Fuchun Sun
Xiao-Ping Zhang
Wenbo Ding
AI4TS
47
11
0
18 Jun 2024
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep
  Reinforcement Learning Algorithms
CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms
Arda Sarp Yenicesu
Furkan B. Mutlu
Suleyman Serdar Kozat
Ozgur S. Oguz
25
1
0
13 Jun 2024
Deep Reinforcement Learning-based Quadcopter Controller: A Practical
  Approach and Experiments
Deep Reinforcement Learning-based Quadcopter Controller: A Practical Approach and Experiments
Truong-Dong Do
Nguyen Xuan Mung
Sung Kyung Hong
33
0
0
13 Jun 2024
RRLS : Robust Reinforcement Learning Suite
RRLS : Robust Reinforcement Learning Suite
Adil Zouitine
David Bertoin
Pierre Clavier
Matthieu Geist
Emmanuel Rachelson
OffRL
37
2
0
12 Jun 2024
RILe: Reinforced Imitation Learning
RILe: Reinforced Imitation Learning
Mert Albaba
Sammy Christen
Christoph Gebhardt
Thomas Langarek
Otmar Hilliges
Otmar Hilliges
55
1
0
12 Jun 2024
PufferLib: Making Reinforcement Learning Libraries and Environments Play
  Nice
PufferLib: Making Reinforcement Learning Libraries and Environments Play Nice
Joseph Suarez
AI4CE
56
2
0
11 Jun 2024
Integrating Domain Knowledge for handling Limited Data in Offline RL
Integrating Domain Knowledge for handling Limited Data in Offline RL
Briti Gangopadhyay
Zhao Wang
Jia-Fong Yeh
Shingo Takamatsu
OffRL
37
0
0
11 Jun 2024
Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach
  For Adaptive Brain Stimulation
Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation
Michelle Pan
Mariah L. Schrum
Vivek Myers
Erdem Bıyık
Anca Dragan
28
0
0
10 Jun 2024
Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time
  Strategy Switch Identification Using Running Error Estimation
Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation
Mohidul Haque Mridul
Mohammad Foysal Khan
Redwan Ahmed Rizvee
Md. Mosaddek Khan
AAML
21
0
0
10 Jun 2024
Previous
123456...323334
Next