Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1606.01540
Cited By
OpenAI Gym
5 June 2016
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OpenAI Gym"
50 / 1,672 papers shown
Title
lmgame-Bench: How Good are LLMs at Playing Games?
Lanxiang Hu
Mingjia Huo
Yu Zhang
Haoyang Yu
Eric P. Xing
Ion Stoica
Tajana Rosing
Haojian Jin
Hao Zhang
22
0
0
21 May 2025
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
Runze Zhao
Yue Yu
Adams Yiyue Zhu
Chen Yang
Dongruo Zhou
22
0
0
20 May 2025
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Dongsu Lee
Minhae Kwon
OffRL
17
0
0
19 May 2025
Learning Probabilistic Temporal Logic Specifications for Stochastic Systems
Rajarshi Roy
Yash Pote
David Parker
Marta Kwiatkowska
21
0
0
17 May 2025
Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAP
Francesco Sovrano
34
0
0
16 May 2025
Visual Planning: Let's Think Only with Images
Yi Xu
Chengzu Li
Han Zhou
Xingchen Wan
Caiqi Zhang
Anna Korhonen
Ivan Vulić
LM&Ro
LRM
24
0
0
16 May 2025
ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks
Feiran You
Hongyang Du
OffRL
LRM
32
0
0
16 May 2025
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Ningyuan Yang
Jiaxuan Gao
Feng Gao
Yi Wu
Chao Yu
55
0
0
15 May 2025
Diffusion-SAFE: Shared Autonomy Framework with Diffusion for Safe Human-to-Robot Driving Handover
Yunxin Fan
Monroe Kennedy III
30
0
0
15 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yansen Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRL
LLMAG
LM&Ro
LRM
21
0
0
15 May 2025
High-order Regularization for Machine Learning and Learning-based Control
Xinghua Liu
Ming Cao
30
0
0
13 May 2025
Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control
Hazim Alzorgan
Abolfazl Razi
39
0
0
13 May 2025
Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems
Matthew Sgambati
Aleksandar Vakanski
Matthew Anderson
37
0
0
06 May 2025
TutorGym: A Testbed for Evaluating AI Agents as Tutors and Students
Daniel Weitekamp
M. N. Siddiqui
Christopher MacLellan
LLMAG
ELM
42
0
0
02 May 2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Lang Feng
Weihao Tan
Zhiyi Lyu
Longtao Zheng
Haiyang Xu
Ming Yan
Fei Huang
Jingyi Wang
34
0
0
01 May 2025
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
29
0
0
29 Apr 2025
DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction
Chris Child
Lam Ngo
49
0
0
29 Apr 2025
Fitness Landscape of Large Language Model-Assisted Automated Algorithm Search
Fei Liu
Qingfu Zhang
Xialiang Tong
M. Yuan
K. Mao
77
0
0
28 Apr 2025
HyperController: A Hyperparameter Controller for Fast and Stable Training of Reinforcement Learning Neural Networks
J. Gornet
Yiannis Kantaros
Bruno Sinopoli
233
0
0
27 Apr 2025
Recursive Deep Inverse Reinforcement Learning
Paul Ghanem
Michael Potter
Owen Howell
Pau Closas
A. Ramezani
Deniz Erdogmus
Tales Imbiriba
34
0
0
17 Apr 2025
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
Xuyang Chen
Guojian Wang
Keyu Yan
Lin Zhao
OffRL
42
1
0
16 Apr 2025
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild
Jonas Myhre Schiøtt
Viktor Sebastian Petersen
Dimitrios P. Papadopoulos
VLM
40
0
0
16 Apr 2025
Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss
Ukjo Hwang
Songnam Hong
OffRL
41
0
0
14 Apr 2025
TRATSS: Transformer-Based Task Scheduling System for Autonomous Vehicles
Yazan Youssef
Paulo Ricardo Marques de Araujo
Aboelmagd Noureldin
Sidney Givigi
31
0
0
07 Apr 2025
Sim4EndoR: A Reinforcement Learning Centered Simulation Platform for Task Automation of Endovascular Robotics
Tianliang Yao
Madaoji Ban
Bo Lu
Zhiqiang Pei
Peng Qi
44
2
0
04 Apr 2025
A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control
Anirudh Satheesh
Keenan Powell
50
0
0
30 Mar 2025
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
Rajdeep Singh Hundal
Yan Xiao
Xiaochun Cao
Jin Dong
Manuel Rigger
56
0
0
28 Mar 2025
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization
Zhenyu Liang
Hao Li
Naiwei Yu
Kebin Sun
Ran Cheng
73
1
0
26 Mar 2025
FF-SRL: High Performance GPU-Based Surgical Simulation For Robot Learning
Diego DallÁlba
Michał Nasket
Sabina Kaminska
Przemysław Korzeniowski
OffRL
AI4CE
67
1
0
24 Mar 2025
Mining-Gym: A Configurable RL Benchmarking Environment for Truck Dispatch Scheduling
C. Banerjee
Kien Nguyen
Clinton Fookes
OffRL
67
0
0
24 Mar 2025
Computationally and Sample Efficient Safe Reinforcement Learning Using Adaptive Conformal Prediction
Hao Zhou
Yanze Zhang
Wenhao Luo
44
0
0
22 Mar 2025
Survey on Evaluation of LLM-based Agents
Asaf Yehudai
Lilach Eden
Alan Li
Guy Uziel
Yilun Zhao
Roy Bar-Haim
Arman Cohan
Michal Shmueli-Scheuer
LLMAG
ELM
Presented at
ResearchTrend Connect | LLMAG
on
07 May 2025
107
8
0
20 Mar 2025
APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games
Yifei Chen
Lambert Schomaker
46
0
0
17 Mar 2025
MarineGym: A High-Performance Reinforcement Learning Platform for Underwater Robotics
Shuguang Chu
Zebin Huang
Yutong Li
Mingwei Lin
Ignacio Carlucho
Y. Pétillot
Canjun Yang
OffRL
AI4CE
48
0
0
13 Mar 2025
A nonlinear real time capable motion cueing algorithm based on deep reinforcement learning
Hendrik Scheidel
Camilo Gonzalez
Houshyar Asadi
Tobias Bellmann
A. Seefried
Shady M. K. Mohamed
Saeid Nahavandi
55
0
0
13 Mar 2025
Safe exploration in reproducing kernel Hilbert spaces
Abdullah Tokmak
Kiran G. Krishnan
Thomas B. Schon
Dominik Baumann
47
0
0
13 Mar 2025
RESTRAIN: Reinforcement Learning-Based Secure Framework for Trigger-Action IoT Environment
Md Morshed Alam
Lokesh Chandra Das
Sandip Roy
Sachin Shetty
Weichao Wang
AAML
OffRL
61
0
0
12 Mar 2025
Soft Actor-Critic-based Control Barrier Adaptation for Robust Autonomous Navigation in Unknown Environments
Nicholas Mohammad
Nicola Bezzo
62
1
0
11 Mar 2025
Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation
Mohit Prashant
Arvind Easwaran
Suman Das
Michael Yuhas
OffRL
80
1
0
07 Mar 2025
Review of Machine Learning for Micro-Electronic Design Verification
Christopher Bennett
Kerstin Eder
38
0
0
05 Mar 2025
Actor-Critic Cooperative Compensation to Model Predictive Control for Off-Road Autonomous Vehicles Under Unknown Dynamics
Prakhar Gupta
J. Smereka
Yunyi Jia
47
0
0
01 Mar 2025
Equivariant Reinforcement Learning Frameworks for Quadrotor Low-Level Control
Beomyeol Yu
Taeyoung Lee
44
0
0
27 Feb 2025
Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization
Lujie Yang
H.J. Terry Suh
Tong Zhao
B. P. Graesdal
Tarik Kelestemur
Jiuguang Wang
Tao Pang
Russ Tedrake
91
3
0
27 Feb 2025
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
Xuyang Li
Romit Maulik
53
0
0
24 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
75
1
0
24 Feb 2025
TAG: A Decentralized Framework for Multi-Agent Hierarchical Reinforcement Learning
Giuseppe Paolo
Abdelhakim Benechehab
Hamza Cherkaoui
Albert Thomas
Balázs Kégl
57
0
0
21 Feb 2025
Warm Starting of CMA-ES for Contextual Optimization Problems
Yuta Sekino
Kento Uchida
Shinichi Shirakawa
86
0
0
18 Feb 2025
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Elias B. Kosmatopoulos
LM&Ro
89
0
0
18 Feb 2025
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
Lakshmi Nair
Ian Trase
Mark Kim
AIFin
LRM
AI4CE
55
1
0
18 Feb 2025
MassSpecGym: A benchmark for the discovery and identification of molecules
Roman Bushuiev
Anton Bushuiev
Niek F. de Jonge
A. Young
Fleming Kretschmer
...
Justin J. J. van der Hooft
Michael A. Stravs
Sebastian Böcker
Josef Sivic
Tomáš Pluskal
54
4
0
17 Feb 2025
1
2
3
4
...
32
33
34
Next