Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.08225
Cited By
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery
18 July 2019
Kristian Hartikainen
Xinyang Geng
Tuomas Haarnoja
Sergey Levine
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery"
50 / 56 papers shown
Title
Subtask-Aware Visual Reward Learning from Segmented Demonstrations
Changyeon Kim
Minho Heo
Doohyun Lee
Jinwoo Shin
Honglak Lee
Joseph J. Lim
Kimin Lee
37
0
0
28 Feb 2025
Episodic Novelty Through Temporal Distance
Y. Jiang
Qihan Liu
Yiqin Yang
Xiaoteng Ma
Dianyu Zhong
...
Jun Yang
Bin Liang
Bo Xu
Chongjie Zhang
Qianchuan Zhao
OffRL
30
0
0
28 Jan 2025
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
Yuanlin Duan
Guofeng Cui
He Zhu
OffRL
29
0
0
03 Nov 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
40
0
0
11 Aug 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
38
7
0
24 Jun 2024
Rank2Reward: Learning Shaped Reward Functions from Passive Video
Daniel Yang
Davin Tjia
Jacob Berg
Dima Damen
Pulkit Agrawal
Abhishek Gupta
OffRL
25
4
0
23 Apr 2024
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Xinglin Zhou
Yifu Yuan
Shaofu Yang
Jianye Hao
32
1
0
22 Feb 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
33
1
0
17 Jan 2024
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse Reward
Fangyuan Wang
Anqing Duan
Peng Zhou
Shengzeng Huo
Guodong Guo
Chenguang Yang
D. Navarro-Alarcon
OffRL
VLM
33
0
0
25 Dec 2023
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
Xueliang Zhao
Xinting Huang
Wei Bi
Lingpeng Kong
LRM
46
0
0
19 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
32
4
0
16 Jul 2023
ViNT: A Foundation Model for Visual Navigation
Dhruv Shah
A. Sridhar
Nitish Dashora
Kyle Stachowicz
Kevin Black
Noriaki Hirose
Sergey Levine
LM&Ro
16
133
0
26 Jun 2023
Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving
Xueliang Zhao
Wenda Li
Lingpeng Kong
30
28
0
25 May 2023
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
14
3
0
30 Apr 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
Planning Goals for Exploration
E. Hu
Richard Chang
Oleh Rybkin
Dinesh Jayaraman
35
24
0
23 Mar 2023
Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning
Ali Rahimi-Kalahroudi
Janarthanan Rajendran
Ida Momennejad
H. V. Seijen
Sarath Chandar
CLL
KELM
26
2
0
15 Mar 2023
Policy Dispersion in Non-Markovian Environment
B. Qu
Xiaofeng Cao
Jielong Yang
Hechang Chen
Chang Yi
Ivor W.Tsang
Yew-Soon Ong
9
0
0
28 Feb 2023
Diverse Policy Optimization for Structured Action Space
Wenhao Li
Baoxiang Wang
Shanchao Yang
H. Zha
OffRL
19
1
0
23 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
13
20
0
13 Feb 2023
Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation
Daesol Cho
Seungjae Lee
H. J. Kim
23
14
0
27 Jan 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
34
18
0
05 Jan 2023
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi-An Ma
Sergey Levine
32
14
0
24 Dec 2022
PALMER: Perception-Action Loop with Memory for Long-Horizon Planning
Onur Beker
Mohammad Mohammadi
Amir Zamir
21
3
0
08 Dec 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala
Gabriel Dulac-Arnold
Julien Mairal
Jean Ponce
Cordelia Schmid
OffRL
24
26
0
16 Nov 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
26
3
0
28 Oct 2022
Reachability-Aware Laplacian Representation in Reinforcement Learning
Kaixin Wang
Kuangqi Zhou
Jiashi Feng
Bryan Hooi
Xinchao Wang
26
2
0
24 Oct 2022
Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization
Nathan Grinsztajn
Daniel Furelos-Blanco
Shikha Surana
Clément Bonnet
Thomas D. Barrett
52
28
0
07 Oct 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
21
3
0
24 Sep 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
139
436
0
10 Jul 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
Alahari Karteek
29
4
0
23 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
67
0
16 Jun 2022
Deep Hierarchical Planning from Pixels
Danijar Hafner
Kuang-Huei Lee
Ian S. Fischer
Pieter Abbeel
28
91
0
08 Jun 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
15
28
0
04 Apr 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
22
131
0
20 Jan 2022
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
16
18
0
01 Dec 2021
Adaptive Multi-Goal Exploration
Jean Tarbouriech
O. D. Domingues
Pierre Ménard
Matteo Pirotta
Michal Valko
A. Lazaric
18
2
0
23 Nov 2021
Automatic Goal Generation using Dynamical Distance Learning
Bharat Prakash
Nicholas R. Waytowich
T. Mohsenin
Tim Oates
17
2
0
07 Nov 2021
Adjacency constraint for efficient hierarchical reinforcement learning
Tianren Zhang
Shangqi Guo
Tian Tan
Xiao M Hu
Feng Chen
10
17
0
30 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
36
18
0
27 Oct 2021
Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning
Jinxin Liu
Hao Shen
Donglin Wang
Yachen Kang
Qiangxing Tian
22
19
0
25 Oct 2021
Discovering and Achieving Goals via World Models
Russell Mendonca
Oleh Rybkin
Kostas Daniilidis
Danijar Hafner
Deepak Pathak
25
117
0
18 Oct 2021
Adversarial Intrinsic Motivation for Reinforcement Learning
Ishan Durugkar
Mauricio Tec
S. Niekum
Peter Stone
OOD
26
36
0
27 May 2021
MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks
Menghui Zhu
Minghuan Liu
Jian Shen
Zhicheng Zhang
Sheng Chen
Weinan Zhang
Deheng Ye
Yong Yu
Qiang Fu
Wei Yang
31
22
0
13 May 2021
Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement Learning
Jinxin Liu
Donglin Wang
Qiangxing Tian
Zhengyu Chen
19
23
0
11 Apr 2021
Learning Continuous Cost-to-Go Functions for Non-holonomic Systems
Jinwook Huh
Daniel D. Lee
Volkan Isler
26
3
0
20 Mar 2021
Model-Based Visual Planning with Self-Supervised Functional Distances
Stephen Tian
Suraj Nair
F. Ebert
Sudeep Dasari
Benjamin Eysenbach
Chelsea Finn
Sergey Levine
SSL
OffRL
160
58
0
30 Dec 2020
Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey
Cédric Colas
Tristan Karch
Olivier Sigaud
Pierre-Yves Oudeyer
29
84
0
17 Dec 2020
ViNG: Learning Open-World Navigation with Visual Goals
Dhruv Shah
Benjamin Eysenbach
G. Kahn
Nicholas Rhinehart
Sergey Levine
14
89
0
17 Dec 2020
1
2
Next