ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09464
  4. Cited By
Multi-Goal Reinforcement Learning: Challenging Robotics Environments and
  Request for Research
v1v2 (latest)

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

26 February 2018
Matthias Plappert
Marcin Andrychowicz
Alex Ray
Bob McGrew
Bowen Baker
Glenn Powell
Jonas Schneider
Joshua Tobin
Maciek Chociej
Peter Welinder
Vikash Kumar
Wojciech Zaremba
ArXiv (abs)PDFHTML

Papers citing "Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research"

50 / 370 papers shown
Learning Object-Centered Autotelic Behaviors with Graph Neural Networks
Learning Object-Centered Autotelic Behaviors with Graph Neural Networks
Ahmed Akakzia
Olivier Sigaud
187
0
0
11 Apr 2022
Learning Purely Tactile In-Hand Manipulation with a Torque-Controlled
  Hand
Learning Purely Tactile In-Hand Manipulation with a Torque-Controlled HandIEEE International Conference on Robotics and Automation (ICRA), 2022
Leon Sievers
Johannes Pitz
Berthold Bäuml
194
50
0
07 Apr 2022
Automatic Parameter Optimization Using Genetic Algorithm in Deep
  Reinforcement Learning for Robotic Manipulation Tasks
Automatic Parameter Optimization Using Genetic Algorithm in Deep Reinforcement Learning for Robotic Manipulation Tasks
Adarsh Sehgal
Nicholas Ward
Hung M. La
S. Louis
153
1
0
07 Apr 2022
On the Pitfalls of Heteroscedastic Uncertainty Estimation with
  Probabilistic Neural Networks
On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural NetworksInternational Conference on Learning Representations (ICLR), 2022
Maximilian Seitzer
Arash Tavakoli
Dimitrije Antic
Georg Martius
BDLUQCV
310
106
0
17 Mar 2022
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum
  Generation
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum GenerationInternational Conference on Learning Representations (ICLR), 2022
Yuqing Du
Pieter Abbeel
Aditya Grover
229
20
0
22 Feb 2022
Help Me Explore: Minimal Social Interventions for Graph-Based Autotelic
  Agents
Help Me Explore: Minimal Social Interventions for Graph-Based Autotelic Agents
Ahmed Akakzia
Olivier Serris
Olivier Sigaud
Cédric Colas
168
6
0
10 Feb 2022
Rethinking Goal-conditioned Supervised Learning and Its Connection to
  Offline RL
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RLInternational Conference on Learning Representations (ICLR), 2022
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
OffRL
337
88
0
09 Feb 2022
Lipschitz-constrained Unsupervised Skill Discovery
Lipschitz-constrained Unsupervised Skill DiscoveryInternational Conference on Learning Representations (ICLR), 2022
Seohong Park
Jongwook Choi
Jaekyeom Kim
Honglak Lee
Gunhee Kim
291
63
0
02 Feb 2022
Do You Need the Entropy Reward (in Practice)?
Do You Need the Entropy Reward (in Practice)?
Haonan Yu
Haichao Zhang
Wei Xu
191
12
0
28 Jan 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Haichao Zhang
Wei Xu
Haonan Yu
246
11
0
24 Jan 2022
The Paradox of Choice: Using Attention in Hierarchical Reinforcement
  Learning
The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning
A. Nica
Khimya Khetarpal
Doina Precup
138
5
0
24 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Goal-Conditioned Reinforcement Learning: Problems and SolutionsInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Minghuan Liu
Menghui Zhu
Weinan Zhang
350
182
0
20 Jan 2022
Priors, Hierarchy, and Information Asymmetry for Skill Transfer in
  Reinforcement Learning
Priors, Hierarchy, and Information Asymmetry for Skill Transfer in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Sasha Salter
Kristian Hartikainen
Walter Goodwin
Ingmar Posner
OffRL
207
5
0
20 Jan 2022
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based
  Robotics
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics
Swagat Kumar
Hayden Sampson
Ardhendu Behera
125
0
0
11 Jan 2022
Bayesian Optimization of Function Networks
Bayesian Optimization of Function NetworksNeural Information Processing Systems (NeurIPS), 2021
Raul Astudillo
P. Frazier
214
42
0
31 Dec 2021
Adaptive Multi-Goal Exploration
Adaptive Multi-Goal Exploration
Jean Tarbouriech
O. D. Domingues
Pierre Ménard
Matteo Pirotta
Michal Valko
A. Lazaric
294
4
0
23 Nov 2021
Learning Provably Robust Motion Planners Using Funnel Libraries
Learning Provably Robust Motion Planners Using Funnel Libraries
Alim Gurgen
Anirudha Majumdar
Sushant Veer
OOD
139
3
0
16 Nov 2021
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task
  Learning
Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
Wenlong Huang
Igor Mordatch
Pieter Abbeel
Deepak Pathak
281
71
0
04 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Adjacency constraint for efficient hierarchical reinforcement learningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
443
22
0
30 Oct 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward EnvironmentIEEE Access (IEEE Access), 2021
Tung M. Luu
Chang D. Yoo
165
12
0
28 Oct 2021
Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning
Goal-Aware Cross-Entropy for Multi-Target Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Kibeom Kim
Min Whoo Lee
Yoonsung Kim
Je-hwan Ryu
Minsu Lee
Byoung-Tak Zhang
183
9
0
25 Oct 2021
CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual
  Reinforcement Learning Agents
CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual Reinforcement Learning Agents
Sam Powers
Eliot Xing
Eric Kolve
Roozbeh Mottaghi
Abhinav Gupta
OffRL
333
44
0
19 Oct 2021
Imaginary Hindsight Experience Replay: Curious Model-based Learning for
  Sparse Reward Tasks
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks
Robert McCarthy
Qiang Wang
S. Redmond
OffRL
251
16
0
05 Oct 2021
Sim and Real: Better Together
Sim and Real: Better Together
Shirli Di-Castro Shashua
Dotan DiCastro
Shie Mannor
221
12
0
01 Oct 2021
Solving the Real Robot Challenge using Deep Reinforcement Learning
Solving the Real Robot Challenge using Deep Reinforcement Learning
Robert McCarthy
Francisco Roldan Sanchez
Qiang Wang
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
S. Redmond
355
11
0
30 Sep 2021
Density-based Curriculum for Multi-goal Reinforcement Learning with
  Sparse Rewards
Density-based Curriculum for Multi-goal Reinforcement Learning with Sparse Rewards
Deyu Yang
Hanbo Zhang
Xuguang Lan
Jishiyu Ding
OffRL
209
2
0
18 Sep 2021
Learning Visual Feedback Control for Dynamic Cloth Folding
Learning Visual Feedback Control for Dynamic Cloth FoldingIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Julius Hietala
David Blanco Mulero
G. Alcan
Ville Kyrki
277
38
0
10 Sep 2021
Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework
  and Survey
Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey
Richard Dazeley
Peter Vamplew
Francisco Cruz
198
74
0
20 Aug 2021
Diversity-based Trajectory and Goal Selection with Hindsight Experience
  Replay
Diversity-based Trajectory and Goal Selection with Hindsight Experience Replay
Tianhong Dai
Hengyan Liu
Kai Arulkumaran
Guangyu Ren
Anil Anthony Bharath
179
11
0
17 Aug 2021
Reward-Weighted Regression Converges to a Global Optimum
Reward-Weighted Regression Converges to a Global OptimumAAAI Conference on Artificial Intelligence (AAAI), 2021
M. Strupl
Francesco Faccio
Dylan R. Ashley
R. Srivastava
Jürgen Schmidhuber
139
5
0
19 Jul 2021
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided
  Exploration
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided ExplorationInternational Conference on Machine Learning (ICML), 2021
Yuda Song
Wen Sun
253
23
0
15 Jul 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Sungryull Sohn
Sungtae Lee
Jongwook Choi
H. V. Seijen
Mehdi Fatemi
Honglak Lee
595
8
0
13 Jul 2021
Policy Transfer across Visual and Dynamics Domain Gaps via Iterative
  Grounding
Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding
Grace Zhang
Li Zhong
Youngwoon Lee
Joseph J. Lim
184
16
0
01 Jul 2021
panda-gym: Open-source goal-conditioned environments for robotic
  learning
panda-gym: Open-source goal-conditioned environments for robotic learning
Quentin Gallouedec
Nicolas Cazin
Emmanuel Dellandrea
Liming Chen
OffRL
148
99
0
25 Jun 2021
Goal-Directed Planning by Reinforcement Learning and Active Inference
Goal-Directed Planning by Reinforcement Learning and Active Inference
Dongqi Han
Kenji Doya
Jun Tani
138
2
0
18 Jun 2021
Unbiased Methods for Multi-Goal Reinforcement Learning
Unbiased Methods for Multi-Goal Reinforcement Learning
Léonard Blier
Yann Ollivier
OffRL
126
6
0
16 Jun 2021
Variational Policy Search using Sparse Gaussian Process Priors for
  Learning Multimodal Optimal Actions
Variational Policy Search using Sparse Gaussian Process Priors for Learning Multimodal Optimal ActionsNeural Networks (NN), 2021
Hikaru Sasaki
Takamitsu Matsubara
132
8
0
14 Jun 2021
Quickest change detection with unknown parameters: Constant complexity
  and near optimality
Quickest change detection with unknown parameters: Constant complexity and near optimality
Firas Jarboui
Vianney Perchet
100
0
0
09 Jun 2021
Causal Influence Detection for Improving Efficiency in Reinforcement
  Learning
Causal Influence Detection for Improving Efficiency in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Maximilian Seitzer
Bernhard Schölkopf
Georg Martius
CML
283
95
0
07 Jun 2021
Variational Empowerment as Representation Learning for Goal-Based
  Reinforcement Learning
Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning
Jongwook Choi
Archit Sharma
Honglak Lee
Sergey Levine
S. Gu
DRL
211
23
0
02 Jun 2021
A Generalised Inverse Reinforcement Learning Framework
A Generalised Inverse Reinforcement Learning Framework
Firas Jarboui
Vianney Perchet
105
4
0
25 May 2021
An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic
  Manipulation with Pybullet
An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with PybulletTowards Autonomous Robotic Systems (TAROS), 2021
Xintong Yang
Ze Ji
Jing Wu
Yu-kun Lai
116
21
0
12 May 2021
Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model
Lingwei Peng
Hui Qian
Zhebang Shen
Chao Zhang
Fei Li
135
2
0
08 May 2021
DisCo RL: Distribution-Conditioned Reinforcement Learning for
  General-Purpose Policies
DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose PoliciesIEEE International Conference on Robotics and Automation (ICRA), 2021
Soroush Nasiriany
Vitchyr H. Pong
Ashvin Nair
Alexander Khazatsky
Glen Berseth
Sergey Levine
OffRL
290
15
0
23 Apr 2021
Outcome-Driven Reinforcement Learning via Variational Inference
Outcome-Driven Reinforcement Learning via Variational InferenceNeural Information Processing Systems (NeurIPS), 2021
Tim G. J. Rudner
Vitchyr H. Pong
R. McAllister
Y. Gal
Sergey Levine
291
22
0
20 Apr 2021
TAAC: Temporally Abstract Actor-Critic for Continuous Control
TAAC: Temporally Abstract Actor-Critic for Continuous ControlNeural Information Processing Systems (NeurIPS), 2021
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
220
26
0
13 Apr 2021
Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement
  Learning
Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement LearningIEEE Access (IEEE Access), 2021
Takato Okudo
Seiji Yamada
OffRL
127
23
0
13 Apr 2021
Reward Shaping with Dynamic Trajectory Aggregation
Reward Shaping with Dynamic Trajectory AggregationIEEE International Joint Conference on Neural Network (IJCNN), 2021
Takato Okudo
Seiji Yamada
81
2
0
13 Apr 2021
Mutual Information State Intrinsic Control
Mutual Information State Intrinsic ControlInternational Conference on Learning Representations (ICLR), 2021
Rui Zhao
Yang Gao
Pieter Abbeel
Volker Tresp
Wenyuan Xu
SSL
142
25
0
15 Mar 2021
Learning One Representation to Optimize All Rewards
Learning One Representation to Optimize All RewardsNeural Information Processing Systems (NeurIPS), 2021
Ahmed Touati
Yann Ollivier
OffRL
340
85
0
14 Mar 2021
Previous
12345678
Next