Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.09464
Cited By
v1
v2 (latest)
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
26 February 2018
Matthias Plappert
Marcin Andrychowicz
Alex Ray
Bob McGrew
Bowen Baker
Glenn Powell
Jonas Schneider
Joshua Tobin
Maciek Chociej
Peter Welinder
Vikash Kumar
Wojciech Zaremba
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research"
50 / 370 papers shown
Partially Equivariant Reinforcement Learning in Symmetry-Breaking Environments
Junwoo Chang
Minwoo Park
Joohwan Seo
R. Horowitz
Jongmin Lee
Jongeun Choi
41
1
0
30 Nov 2025
MagBotSim: Physics-Based Simulation and Reinforcement Learning Environments for Magnetic Robotics
Lara Bergmann
Cedric Grothues
Klaus Neumann
105
0
0
20 Nov 2025
Physically-Grounded Goal Imagination: Physics-Informed Variational Autoencoder for Self-Supervised Reinforcement Learning
Lan Thi Ha Nguyen
Kien Ton Manh
Anh Do Duc
Nam Pham Hai
DRL
SSL
AI4CE
513
0
0
10 Nov 2025
Reinforcement Learning for Robotic Safe Control with Force Sensing
Nan Lin
Linrui Zhang
Yuxuan Chen
Z. Chen
Yujun Zhu
Ruoxi Chen
Peichen Wu
Xiaoping Chen
44
9
0
30 Oct 2025
Survey and Tutorial of Reinforcement Learning Methods in Process Systems Engineering
Maximilian Bloor
M. Mowbray
Ehecatl Antonio del Rio Chanona
Calvin Tsay
OffRL
128
0
0
28 Oct 2025
Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation
Daniel Bethell
Simos Gerasimou
R. Calinescu
Calum Imrie
88
0
0
21 Oct 2025
D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks
Jundong Zhang
Yuhui Situ
Fanji Zhang
Rongji Deng
Tianqi Wei
OffRL
91
0
0
20 Oct 2025
Asymptotically Stable Quaternion-valued Hopfield-structured Neural Network with Periodic Projection-based Supervised Learning Rules
Tianwei Wang
Xinhui Ma
Wei Pang
88
0
0
18 Oct 2025
Restoring Noisy Demonstration for Imitation Learning With Diffusion Models
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2025
Shang-Fu Chen
Co Yong
Shao-Hua Sun
DiffM
124
0
0
16 Oct 2025
When a Robot is More Capable than a Human: Learning from Constrained Demonstrators
Xinhu Li
Ayush Jain
Zhaojing Yang
Yigit Korkmaz
Erdem Bıyık
76
0
0
10 Oct 2025
DexMan: Learning Bimanual Dexterous Manipulation from Human and Generated Videos
Jhen Hsieh
Kuan-Hsun Tu
Kuo-Han Hung
Tsung-Wei Ke
148
1
0
09 Oct 2025
BuilderBench -- A benchmark for generalist agents
Raj Ghugare
Catherine Ji
Kathryn Wantlin
Jin Schofield
Benjamin Eysenbach
132
1
0
07 Oct 2025
General and Efficient Visual Goal-Conditioned Reinforcement Learning using Object-Agnostic Masks
Fahim Shahriar
Cheryl Wang
Alireza Azimi
Gautham Vasan
Hany Hamed Elanwar
A. Rupam Mahmood
Colin Bellinger
108
0
0
06 Oct 2025
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
Lunjun Zhang
Shuo Han
Hanrui Lyu
Bradly C. Stadie
OffRL
243
1
0
03 Oct 2025
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Yujie Zhu
Charles A. Hepburn
Matthew Thorpe
Giovanni Montana
180
0
0
19 Sep 2025
The Role of Touch: Towards Optimal Tactile Sensing Distribution in Anthropomorphic Hands for Dexterous In-Hand Manipulation
Joao Damiao Almeida
Egidio Falotico
Cecilia Laschi
J. Santos-Victor
92
0
0
18 Sep 2025
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Zeqiang Zhang
Fabian Wurzberger
Gerrit Schmid
Sebastian Gottwald
Daniel A. Braun
SSL
220
0
0
03 Sep 2025
Goal-Conditioned Reinforcement Learning for Data-Driven Maritime Navigation
Vaishnav Vaidheeswaran
Dilith Jayakody
Samruddhi Mulay
Anand Lo
Md Mahbub Alam
Gabriel Spadon
110
1
0
01 Sep 2025
Towards Safe Imitation Learning via Potential Field-Guided Flow Matching
Haoran Ding
Anqing Duan
Zezhou Sun
Leonel Rozo
Noémie Jaquier
Dezhen Song
Yoshihiko Nakamura
140
0
0
12 Aug 2025
Sensor-Space Based Robust Kinematic Control of Redundant Soft Manipulator by Learning
Yinan Meng
Kun Qian
Jiong Yang
Renbo Su
Zhenhong Li
Charlie C. L. Wang
136
0
0
19 Jul 2025
Assessing Adaptive World Models in Machines with Novel Games
Lance Ying
Katherine M. Collins
Prafull Sharma
Cédric Colas
Kaiya Ivy Zhao
...
Jacob Andreas
Thomas Griffiths
François Chollet
Kelsey R. Allen
J. Tenenbaum
227
10
0
17 Jul 2025
Dual-Objective Reinforcement Learning with Novel Hamilton-Jacobi-Bellman Formulations
William Sharpless
Dylan Hirsch
S. Tonkens
Nikhil Shinde
Sylvia Herbert
178
2
0
19 Jun 2025
Goal-based Self-Adaptive Generative Adversarial Imitation Learning (Goal-SAGAIL) for Multi-goal Robotic Manipulation Tasks
Yingyi Kuang
Luis J. Manso
George Vogiatzis
111
0
0
15 Jun 2025
Fast Bayesian Optimization of Function Networks with Partial Evaluations
Poompol Buathong
P. Frazier
171
0
0
13 Jun 2025
Risk-Sensitive Agent Compositions
Guruprerana Shabadi
Rajeev Alur
245
0
0
05 Jun 2025
Reachability Weighted Offline Goal-conditioned Resampling
Wenyan Yang
Joni Pajarinen
OffRL
199
0
0
03 Jun 2025
Safely Learning Controlled Stochastic Dynamics
Luc Brogat-Motte
Alessandro Rudi
Riccardo Bonalli
208
0
0
03 Jun 2025
Prior Reinforce: Mastering Agile Tasks with Limited Trials
Yihang Hu
Pingyue Sheng
Shengjie Wang
Yang Gao
Yang Gao
268
0
0
28 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
301
4
0
26 May 2025
CiRL: Open-Source Environments for Reinforcement Learning in Circular Economy and Net Zero
Federico Zocco
Andrea Corti
Monica Malvezzi
AI4CE
331
1
0
24 May 2025
Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning
Nicolas Castanet
Olivier Sigaud
Sylvain Lamprier
OffRL
424
0
0
23 May 2025
TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation
Hangyu Li
Qin Zhao
Haoran Xu
Xinyu Jiang
Qingwei Ben
...
Jia Zeng
Hanqing Wang
Bo Dai
Junting Dong
Jiangmiao Pang
448
4
0
19 May 2025
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
306
1
0
16 May 2025
Constructing an Optimal Behavior Basis for the Option Keyboard
L. N. Alegre
A. Bazzan
André Barreto
Bruno C. da Silva
233
1
0
01 May 2025
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
Yuxuan Li
Yicheng Gao
Ning Yang
Stephen Xia
OffRL
374
0
0
08 Apr 2025
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
IEEE International Conference on Robotics and Automation (ICRA), 2025
Luc McCutcheon
Bahman Gharesifard
Saber Fallah
214
1
0
19 Mar 2025
Target Return Optimizer for Multi-Game Decision Transformer
Kensuke Tatematsu
Akifumi Wachi
OffRL
243
0
0
04 Mar 2025
Reducing Reward Dependence in RL Through Adaptive Confidence Discounting
Muhammed Yusuf Satici
David L. Roberts
OffRL
222
0
0
28 Feb 2025
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Shangding Gu
Laixi Shi
Muning Wen
Ming Jin
Eric Mazumdar
Yuejie Chi
Adam Wierman
C. Spanos
OOD
OffRL
345
10
0
27 Feb 2025
Warm Starting of CMA-ES for Contextual Optimization Problems
Parallel Problem Solving from Nature (PPSN), 2025
Yuta Sekino
Kento Uchida
Shinichi Shirakawa
324
1
0
18 Feb 2025
Exploring the Generalizability of Geomagnetic Navigation: A Deep Reinforcement Learning approach with Policy Distillation
IEEE Transactions on Instrumentation and Measurement (IEEE Trans. Instrum. Meas.), 2025
Wenqi Bai
Shiliang Zhang
Xiaohui Zhang
Xuehui Ma
Songnan Yang
Yushuai Li
Tingwen Huang
106
3
0
07 Feb 2025
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
407
23
0
28 Jan 2025
Reinforcement Teaching
Alex Lewandowski
Calarina Muslimani
Dale Schuurmans
Matthew E. Taylor
Jun Luo
419
2
0
28 Jan 2025
Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking
Yun-Hai Liu
Bowen Yang
Licheng Zhong
He Wang
Li Yi
325
15
0
23 Dec 2024
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Xinyan Guan
Yanjiang Liu
Xinyu Lu
Boxi Cao
Xianpei Han
...
Le Sun
Jie Lou
Bowen Yu
Yaojie Lu
Hongyu Lin
ALM
574
8
0
18 Nov 2024
Precision-Focused Reinforcement Learning Model for Robotic Object Pushing
Lara Bergmann
David P. Leins
R. Haschke
Klaus Neumann
226
7
0
13 Nov 2024
Learning World Models for Unconstrained Goal Navigation
Neural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Wensen Mao
He Zhu
227
5
0
03 Nov 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Guofeng Cui
He Zhu
OffRL
373
0
0
03 Nov 2024
Maximum Entropy Hindsight Experience Replay
Douglas C. Crowder
Matthew L. Trappett
Darrien M. McKenzie
Frances S. Chance
101
0
0
31 Oct 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
International Conference on Learning Representations (ICLR), 2024
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
521
71
0
26 Oct 2024
1
2
3
4
5
6
7
8
Next