ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.01657
  4. Cited By
Dynamics-Aware Unsupervised Discovery of Skills

Dynamics-Aware Unsupervised Discovery of Skills

2 July 2019
Archit Sharma
S. Gu
Sergey Levine
Vikash Kumar
Karol Hausman
ArXivPDFHTML

Papers citing "Dynamics-Aware Unsupervised Discovery of Skills"

50 / 100 papers shown
Title
Decentralized Traffic Flow Optimization Through Intrinsic Motivation
Decentralized Traffic Flow Optimization Through Intrinsic Motivation
Himaja Papala
Daniel Polani
Stas Tiomkin
2
0
0
08 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
19
0
0
02 May 2025
DHP: Discrete Hierarchical Planning for Hierarchical Reinforcement Learning Agents
DHP: Discrete Hierarchical Planning for Hierarchical Reinforcement Learning Agents
Shashank Sharma
Janina Hoffmann
Vinay P. Namboodiri
91
0
0
04 Feb 2025
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Pavel Kolev
Marin Vlastelica
Georg Martius
OffRL
35
0
0
08 Jan 2025
CORD: Generalizable Cooperation via Role Diversity
CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama
Kefan Su
Jiangxing Wang
Deheng Ye
Zongqing Lu
40
0
0
04 Jan 2025
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
57
0
0
23 Oct 2024
Exploration by Learning Diverse Skills through Successor State Measures
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
40
0
0
14 Jun 2024
Language Guided Skill Discovery
Language Guided Skill Discovery
Seungeun Rho
Laura Smith
Tianyu Li
Sergey Levine
Xue Bin Peng
Sehoon Ha
LM&Ro
42
4
0
07 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
39
1
0
01 Jun 2024
Effective Reinforcement Learning Based on Structural Information
  Principles
Effective Reinforcement Learning Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
40
0
0
15 Apr 2024
Align Your Intents: Offline Imitation Learning via Optimal Transport
Align Your Intents: Offline Imitation Learning via Optimal Transport
Maksim Bobrin
N. Buzun
Dmitrii Krylov
Dmitry V. Dylov
OffRL
51
3
0
20 Feb 2024
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy
  Learning
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning
Eric Hanchen Jiang
Andrew Lizarraga
26
0
0
06 Dec 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
41
7
0
30 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
31
1
0
12 Oct 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement
  Learning
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
44
0
0
08 Sep 2023
Language Reward Modulation for Pretraining Reinforcement Learning
Language Reward Modulation for Pretraining Reinforcement Learning
Ademi Adeniji
Amber Xie
Carmelo Sferrazza
Younggyo Seo
Stephen James
Pieter Abbeel
39
26
0
23 Aug 2023
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Xiaoyu Huang
Dhruv Batra
Akshara Rai
Andrew Szot
LM&Ro
38
21
0
19 Aug 2023
QDax: A Library for Quality-Diversity and Population-based Algorithms
  with Hardware Acceleration
QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration
Félix Chalumeau
Bryan Lim
Raphael Boige
Maxime Allard
Luca Grillotti
Manon Flageat
Valentin Macé
Arthur Flajolet
Thomas Pierrot
Antoine Cully
29
21
0
07 Aug 2023
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Shanqi Liu
Weiwei Liu
Wenzhou Chen
Guanzhong Tian
Y. Liu
33
0
0
05 Jul 2023
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
Jesse Zhang
Karl Pertsch
Jiahui Zhang
Joseph J. Lim
LM&Ro
36
17
0
20 Jun 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
32
1
0
28 May 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
36
6
0
14 Apr 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep
  Reinforcement Learning
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
29
0
0
15 Mar 2023
Neural Laplace Control for Continuous-time Delayed Systems
Neural Laplace Control for Continuous-time Delayed Systems
Samuel Holt
Alihan Huyuk
Zhaozhi Qian
Hao Sun
M. Schaar
OffRL
29
10
0
24 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
Developing Driving Strategies Efficiently: A Skill-Based Hierarchical
  Reinforcement Learning Approach
Developing Driving Strategies Efficiently: A Skill-Based Hierarchical Reinforcement Learning Approach
Yigit Gurses
Kaan Buyukdemirci
Y. Yildiz
31
5
0
04 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for
  Reinforcement Learning through Value-Decomposition
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
Intrinsic Motivation in Dynamical Control Systems
Intrinsic Motivation in Dynamical Control Systems
Stas Tiomkin
I. Nemenman
Daniel Polani
Naftali Tishby
18
4
0
29 Dec 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
28
1
0
28 Nov 2022
Assistive Teaching of Motor Control Tasks to Humans
Assistive Teaching of Motor Control Tasks to Humans
Megha Srivastava
Erdem Biyik
Suvir Mirchandani
Noah D. Goodman
Dorsa Sadigh
20
6
0
25 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
21
0
23 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Control Transformer: Robot Navigation in Unknown Environments through
  PRM-Guided Return-Conditioned Sequence Modeling
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
24
7
0
11 Nov 2022
Emergency action termination for immediate reaction in hierarchical
  reinforcement learning
Emergency action termination for immediate reaction in hierarchical reinforcement learning
Michal Bortkiewicz
Jakub Lyskawa
Pawel Wawrzyñski
M. Ostaszewski
Artur Grudkowski
Tomasz Trzciñski
21
0
0
11 Nov 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward
  Long-Horizon Goal-Conditioned Reinforcement Learning
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
31
3
0
28 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
55
9
0
23 Oct 2022
Random Actions vs Random Policies: Bootstrapping Model-Based Direct
  Policy Search
Random Actions vs Random Policies: Bootstrapping Model-Based Direct Policy Search
Elias Hanna
Alexandre Coninx
Stéphane Doncieux
OffRL
28
0
0
21 Oct 2022
Augmentative Topology Agents For Open-Ended Learning
Augmentative Topology Agents For Open-Ended Learning
Muhammad Umair Nasir
Michael Beukman
Steven D. James
C. Cleghorn
34
3
0
20 Oct 2022
A Mixture of Surprises for Unsupervised Reinforcement Learning
A Mixture of Surprises for Unsupervised Reinforcement Learning
Andrew Zhao
Matthieu Lin
Yangguang Li
Yong-Jin Liu
Gao Huang
28
13
0
13 Oct 2022
Neuroevolution is a Competitive Alternative to Reinforcement Learning
  for Skill Discovery
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
Félix Chalumeau
Raphael Boige
Bryan Lim
Valentin Macé
Maxime Allard
Arthur Flajolet
Antoine Cully
Thomas Pierrot
26
21
0
06 Oct 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns
  for Cross-Domain Adaptation
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
27
3
0
24 Sep 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
34
35
0
19 Sep 2022
Spectral Decomposition Representation for Reinforcement Learning
Spectral Decomposition Representation for Reinforcement Learning
Tongzheng Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
40
27
0
19 Aug 2022
Learning Dynamic Manipulation Skills from Haptic-Play
Learning Dynamic Manipulation Skills from Haptic-Play
Taeyoon Lee
D. Sung
Kyoung-Whan Choi
Choong-Keun Lee
Changwoo Park
Keunjun Choi
40
3
0
28 Jul 2022
Fast Population-Based Reinforcement Learning on a Single Machine
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet
Claire Bizon Monroc
Karim Beguir
Thomas Pierrot
OffRL
27
10
0
17 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
28
139
0
15 Jun 2022
Meta-Learning Parameterized Skills
Meta-Learning Parameterized Skills
Haotian Fu
Shangqun Yu
Saket Tiwari
Michael Littman
George Konidaris
38
6
0
07 Jun 2022
Uniqueness and Complexity of Inverse MDP Models
Uniqueness and Complexity of Inverse MDP Models
Marcus Hutter
Steven Hansen
22
4
0
02 Jun 2022
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning
  Environments
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments
Ryan Sullivan
J. K. Terry
Benjamin Black
John P. Dickerson
22
8
0
14 May 2022
12
Next