Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,677 papers shown
Title
MassSpecGym: A benchmark for the discovery and identification of molecules
Roman Bushuiev
Anton Bushuiev
Niek F. de Jonge
A. Young
Fleming Kretschmer
...
Justin J. J. van der Hooft
Michael A. Stravs
Sebastian Böcker
Josef Sivic
Tomáš Pluskal
54
4
0
17 Feb 2025
SpikingSoft: A Spiking Neuron Controller for Bio-inspired Locomotion with Soft Snake Robots
Chuhan Zhang
Cong Wang
Wei Pan
Cosimo Della Santina
78
0
0
31 Jan 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
Se-Wook Yoo
Seung-Woo Seo
63
0
0
30 Jan 2025
Divergence-Augmented Policy Optimization
Qing Wang
Yingru Li
Jiechao Xiong
Tong Zhang
OffRL
55
16
0
28 Jan 2025
Upside Down Reinforcement Learning with Policy Generators
Jacopo Di Ventura
Dylan R. Ashley
Vincent Herrmann
Francesco Faccio
Jürgen Schmidhuber
44
0
0
27 Jan 2025
A Tensor Low-Rank Approximation for Value Functions in Multi-Task Reinforcement Learning
Sergio Rozada
Santiago Paternain
J. Bazerque
Antonio G. Marques
69
0
0
17 Jan 2025
An Empirical Study of Deep Reinforcement Learning in Continuing Tasks
Yi Wan
D. Korenkevych
Zheqing Zhu
OffRL
CLL
55
0
0
12 Jan 2025
Investigating the Impact of Communication-Induced Action Space on Exploration of Unknown Environments with Decentralized Multi-Agent Reinforcement Learning
Gabriele Calzolari
Vidya Sumathy
Christoforos Kanellakis
G. Nikolakopoulos
53
0
0
31 Dec 2024
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
Yangchun Zhang
Wang Zhou
Yirui Zhou
60
0
0
31 Dec 2024
Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking
Yun-Hai Liu
Bowen Yang
Licheng Zhong
He Wang
Li Yi
58
5
0
23 Dec 2024
Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets
Akane Tsuboya
Yu Kono
Tatsuji Takahashi
41
0
0
23 Dec 2024
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
109
1
0
22 Dec 2024
Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference
Matthew D Riemer
G. Subbaraj
Glen Berseth
Irina Rish
OffRL
90
1
0
18 Dec 2024
Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency
Taisuke Kobayashi
Takumi Aotani
145
5
0
17 Dec 2024
Adaptive Reward Design for Reinforcement Learning
Minjae Kwon
Ingy Elsayed-Aly
Lu Feng
80
2
0
14 Dec 2024
RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks
Xu Yang
Chenhui Lin
Haotian Liu
Wenchuan Wu
90
1
0
02 Dec 2024
SANGO: Socially Aware Navigation through Grouped Obstacles
Rahath Malladi
Amol Harsh
Arshia Sangwan
Sunita Chauhan
Sandeep Manjanna
67
0
0
29 Nov 2024
Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning
Mahdi Salahshour
Amirahmad Shafiee
Mojtaba Tefagh
76
0
0
26 Nov 2024
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
81
0
0
24 Nov 2024
Creating Hierarchical Dispositions of Needs in an Agent
Tofara Moyo
91
0
0
23 Nov 2024
Multi-Agent Environments for Vehicle Routing Problems
Ricardo Gama
Daniel Fuertes
Carlos R. del-Blanco
Hugo L. Fernandes
AI4CE
89
0
0
21 Nov 2024
SuPLE: Robot Learning with Lyapunov Rewards
Phu Nguyen
Daniel Polani
Stas Tiomkin
69
0
0
20 Nov 2024
Multi-agent Path Finding for Timed Tasks using Evolutionary Games
Sheryl Paul
Anand Balakrishnan
Xin Qin
Jyotirmoy V. Deshmukh
33
0
0
15 Nov 2024
Precision-Focused Reinforcement Learning Model for Robotic Object Pushing
Lara Bergmann
David P. Leins
R. Haschke
Klaus Neumann
42
3
0
13 Nov 2024
CROPS: A Deployable Crop Management System Over All Possible State Availabilities
Jing Wu
Zhixin Lai
Shengjie Liu
Suiyao Chen
Ran Tao
Pan Zhao
Chuyuan Tao
Yikun Cheng
N. Hovakimyan
OffRL
58
0
0
09 Nov 2024
Learn to Solve Vehicle Routing Problems ASAP: A Neural Optimization Approach for Time-Constrained Vehicle Routing Problems with Finite Vehicle Fleet
Elija Deineko
Carina Kehrt
29
1
0
07 Nov 2024
Sharp Analysis for KL-Regularized Contextual Bandits and RLHF
Heyang Zhao
Chenlu Ye
Quanquan Gu
Tong Zhang
OffRL
71
3
0
07 Nov 2024
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Kai Yan
Alex Schwing
Yu-xiong Wang
OffRL
OnRL
41
0
0
31 Oct 2024
Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restelli
44
0
0
31 Oct 2024
SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving
Minh Tri Huynh
Duc Dung Nguyen
OffRL
31
0
0
30 Oct 2024
Environment as Policy: Learning to Race in Unseen Tracks
Hongze Wang
Jiaxu Xing
Nico Messikommer
Davide Scaramuzza
34
1
0
29 Oct 2024
A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications
Sriniketh Vangaru
Daniel Rosen
Dylan Green
Raphael Rodriguez
Maxwell Wiecek
Amos Johnson
Alyse M. Jones
William C. Headley
42
1
0
28 Oct 2024
Robustness and Generalization in Quantum Reinforcement Learning via Lipschitz Regularization
Nico Meyer
Julian Berberich
Christopher Mutschler
Daniel D. Scherer
37
0
0
28 Oct 2024
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning
Kaiyan Zhao
Yiming Wang
Yuyang Chen
Yan Li
Leong Hou U
Xiaoguang Niu
46
1
0
27 Oct 2024
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
119
2
0
23 Oct 2024
Guiding Reinforcement Learning with Incomplete System Dynamics
Shuyuan Wang
Jingliang Duan
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
Lixian Zhang
29
0
0
22 Oct 2024
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
Zixuan Yang
Jiaqi Zheng
Guihai Chen
OffRL
41
0
0
19 Oct 2024
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games
Pranav Rajbhandari
Prithviraj Dasgupta
D. Sofge
21
0
0
17 Oct 2024
Continual Deep Reinforcement Learning to Prevent Catastrophic Forgetting in Jamming Mitigation
Kemal Davaslioglu
Sastry Kompella
T. Erpek
Y. Sagduyu
26
1
0
14 Oct 2024
iFANnpp: Nuclear Power Plant Digital Twin for Robots and Autonomous Intelligence
Youndo Do
Marc Zebrowitz
Jackson Stahl
Fan Zhang
AI4CE
24
0
0
11 Oct 2024
Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Rameswar Panda
L. Pan
47
3
0
10 Oct 2024
Solving Multi-Goal Robotic Tasks with Decision Transformer
Paul Gajewski
Dominik Zurek
Marcin Pietroñ
Kamil Faber
OffRL
37
1
0
08 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
210
1
0
08 Oct 2024
Control-oriented Clustering of Visual Latent Representation
Han Qi
Haocheng Yin
Heng Yang
SSL
69
2
0
07 Oct 2024
DABI: Evaluation of Data Augmentation Methods Using Downsampling in Bilateral Control-Based Imitation Learning with Images
Masato Kobayashi
Thanpimon Buamanee
Yuki Uranishi
26
2
0
06 Oct 2024
Urban Computing for Climate and Environmental Justice: Early Perspectives From Two Research Initiatives
Carolina Veiga
Ashish Sharma
Daniel de Oliveira
Marcos Lage
Fabio Miranda
AI4CE
44
0
0
06 Oct 2024
Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space
Yangming Li
Chieh-Hsin Lai
Carola-Bibiane Schönlieb
Yuki Mitsufuji
Stefano Ermon
DiffM
48
0
0
02 Oct 2024
AARK: An Open Toolkit for Autonomous Racing Research
J. Bockman
Matthew Howe
Adrian Orenstein
Feras Dayoub
39
0
0
01 Oct 2024
Learning to Swim: Reinforcement Learning for 6-DOF Control of Thruster-driven Autonomous Underwater Vehicles
Levi Cai
Kevin Chang
Yogesh A. Girdhar
32
1
0
30 Sep 2024
Constrained Reinforcement Learning for Safe Heat Pump Control
Baohe Zhang
Lilli Frison
Thomas Brox
Joschka Bödecker
AI4CE
26
0
0
29 Sep 2024
Previous
1
2
3
4
5
...
32
33
34
Next