ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.09083
  4. Cited By
Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement
  Learning

Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning

22 November 2018
Sainbayar Sukhbaatar
Emily L. Denton
Arthur Szlam
Rob Fergus
    SSL
ArXiv (abs)PDFHTML

Papers citing "Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning"

19 / 19 papers shown
Title
CORD: Generalizable Cooperation via Role Diversity
CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama
Kefan Su
Jiangxing Wang
Deheng Ye
Zongqing Lu
106
0
0
04 Jan 2025
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward
  Shaping
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
132
19
0
05 Jan 2023
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi-An Ma
Sergey Levine
116
16
0
24 Dec 2022
Discrete Factorial Representations as an Abstraction for Goal
  Conditioned Reinforcement Learning
Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning
Riashat Islam
Hongyu Zang
Anirudh Goyal
Alex Lamb
Kenji Kawaguchi
Xin-hui Li
Romain Laroche
Yoshua Bengio
Rémi Tachet des Combes
OffRLAI4CE
99
11
0
01 Nov 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without
  Supervision
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
Alahari Karteek
61
4
0
23 Jun 2022
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum
  Generation
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation
Yuqing Du
Pieter Abbeel
Aditya Grover
109
18
0
22 Feb 2022
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement
  Learning
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
121
62
0
26 Oct 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning
  Algorithms
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
61
3
0
04 Sep 2021
Multi-task curriculum learning in a complex, visual, hard-exploration
  domain: Minecraft
Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
I. Kanitscheider
Joost Huizinga
David Farhi
William H. Guss
Brandon Houghton
...
Bowen Baker
Adrien Ecoffet
Jie Tang
Oleg Klimov
Jeff Clune
75
22
0
28 Jun 2021
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep
  Reinforcement Learning
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement Learning
Jinxin Liu
Donglin Wang
Qiangxing Tian
Zhengyu Chen
92
23
0
11 Apr 2021
Asymmetric self-play for automatic goal discovery in robotic
  manipulation
Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI OpenAI
Matthias Plappert
Raul Sampedro
Tao Xu
Ilge Akkaya
...
Hyeonwoo Noh
Lilian Weng
Qiming Yuan
Casey Chu
Wojciech Zaremba
SSL
145
79
0
13 Jan 2021
RODE: Learning Roles to Decompose Multi-Agent Tasks
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
115
211
0
04 Oct 2020
Adaptive Procedural Task Generation for Hard-Exploration Problems
Adaptive Procedural Task Generation for Hard-Exploration Problems
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
78
26
0
01 Jul 2020
Reinforcement Learning for Combinatorial Optimization: A Survey
Reinforcement Learning for Combinatorial Optimization: A Survey
Nina Mazyavkina
S. Sviridov
Sergei Ivanov
Evgeny Burnaev
142
627
0
07 Mar 2020
Learning to Prove Theorems by Learning to Generate Theorems
Learning to Prove Theorems by Learning to Generate Theorems
Mingzhe Wang
Jia Deng
NAI
132
50
0
17 Feb 2020
Augmenting GAIL with BC for sample efficient imitation learning
Augmenting GAIL with BC for sample efficient imitation learning
Rohit Jena
Changliu Liu
Katia Sycara
90
5
0
21 Jan 2020
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing
  Shaped Rewards
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards
Alexander R. Trott
Stephan Zheng
Caiming Xiong
R. Socher
125
112
0
04 Nov 2019
A survey on intrinsic motivation in reinforcement learning
A survey on intrinsic motivation in reinforcement learning
A. Aubret
L. Matignon
S. Hassas
AI4CE
112
144
0
19 Aug 2019
Self-supervised Learning of Distance Functions for Goal-Conditioned
  Reinforcement Learning
Self-supervised Learning of Distance Functions for Goal-Conditioned Reinforcement Learning
Srinivas Venkattaramanujam
Eric Crawford
T. Doan
Doina Precup
OffRLSSL
65
24
0
05 Jul 2019
1