ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.01657
  4. Cited By
Dynamics-Aware Unsupervised Discovery of Skills

Dynamics-Aware Unsupervised Discovery of Skills

2 July 2019
Archit Sharma
S. Gu
Sergey Levine
Vikash Kumar
Karol Hausman
ArXivPDFHTML

Papers citing "Dynamics-Aware Unsupervised Discovery of Skills"

50 / 110 papers shown
Title
Decentralized Traffic Flow Optimization Through Intrinsic Motivation
Decentralized Traffic Flow Optimization Through Intrinsic Motivation
Himaja Papala
Daniel Polani
Stas Tiomkin
4
0
0
08 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
19
0
0
02 May 2025
Training a Generally Curious Agent
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
123
1
0
24 Feb 2025
DHP: Discrete Hierarchical Planning for Hierarchical Reinforcement Learning Agents
DHP: Discrete Hierarchical Planning for Hierarchical Reinforcement Learning Agents
Shashank Sharma
Janina Hoffmann
Vinay P. Namboodiri
93
0
0
04 Feb 2025
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Pavel Kolev
Marin Vlastelica
Georg Martius
OffRL
53
0
0
08 Jan 2025
CORD: Generalizable Cooperation via Role Diversity
CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama
Kefan Su
Jiangxing Wang
Deheng Ye
Zongqing Lu
42
0
0
04 Jan 2025
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
59
0
0
23 Oct 2024
Exploration by Learning Diverse Skills through Successor State Measures
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
40
0
0
14 Jun 2024
Language Guided Skill Discovery
Language Guided Skill Discovery
Seungeun Rho
Laura Smith
Tianyu Li
Sergey Levine
Xue Bin Peng
Sehoon Ha
LM&Ro
42
4
0
07 Jun 2024
Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language
  Models
Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models
Phat Nguyen
Tsun-Hsuan Wang
Zhang-Wei Hong
S. Karaman
Daniela Rus
LM&Ro
42
3
0
06 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
39
1
0
01 Jun 2024
Effective Reinforcement Learning Based on Structural Information
  Principles
Effective Reinforcement Learning Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
40
0
0
15 Apr 2024
Align Your Intents: Offline Imitation Learning via Optimal Transport
Align Your Intents: Offline Imitation Learning via Optimal Transport
Maksim Bobrin
N. Buzun
Dmitrii Krylov
Dmitry Dylov
OffRL
51
3
0
20 Feb 2024
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy
  Learning
SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning
Eric Hanchen Jiang
Andrew Lizarraga
34
0
0
06 Dec 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
41
7
0
30 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
34
1
0
12 Oct 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement
  Learning
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
47
0
0
08 Sep 2023
Language Reward Modulation for Pretraining Reinforcement Learning
Language Reward Modulation for Pretraining Reinforcement Learning
Ademi Adeniji
Amber Xie
Carmelo Sferrazza
Younggyo Seo
Stephen James
Pieter Abbeel
39
26
0
23 Aug 2023
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Xiaoyu Huang
Dhruv Batra
Akshara Rai
Andrew Szot
LM&Ro
38
21
0
19 Aug 2023
QDax: A Library for Quality-Diversity and Population-based Algorithms
  with Hardware Acceleration
QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration
Félix Chalumeau
Bryan Lim
Raphael Boige
Maxime Allard
Luca Grillotti
Manon Flageat
Valentin Macé
Arthur Flajolet
Thomas Pierrot
Antoine Cully
29
21
0
07 Aug 2023
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Shanqi Liu
Weiwei Liu
Wenzhou Chen
Guanzhong Tian
Y. Liu
35
0
0
05 Jul 2023
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
Jesse Zhang
Karl Pertsch
Jiahui Zhang
Joseph J. Lim
LM&Ro
45
17
0
20 Jun 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
36
1
0
28 May 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
35
0
0
21 May 2023
Efficient Quality-Diversity Optimization through Diverse Quality Species
Efficient Quality-Diversity Optimization through Diverse Quality Species
Ryan Wickman
Bibek Poudel
Taylor Michael Villarreal
Xiaofei Zhang
Weizi Li
36
6
0
14 Apr 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep
  Reinforcement Learning
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
T. Kanazawa
Chetan Gupta
29
0
0
15 Mar 2023
Neural Laplace Control for Continuous-time Delayed Systems
Neural Laplace Control for Continuous-time Delayed Systems
Samuel Holt
Alihan Huyuk
Zhaozhi Qian
Hao Sun
M. Schaar
OffRL
29
10
0
24 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
Predictable MDP Abstraction for Unsupervised Model-Based RL
Predictable MDP Abstraction for Unsupervised Model-Based RL
Seohong Park
Sergey Levine
24
9
0
08 Feb 2023
Developing Driving Strategies Efficiently: A Skill-Based Hierarchical
  Reinforcement Learning Approach
Developing Driving Strategies Efficiently: A Skill-Based Hierarchical Reinforcement Learning Approach
Yigit Gurses
Kaan Buyukdemirci
Y. Yildiz
31
5
0
04 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for
  Reinforcement Learning through Value-Decomposition
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
Skill Decision Transformer
Skill Decision Transformer
Shyam Sudhakaran
S. Risi
OffRL
29
5
0
31 Jan 2023
Intrinsic Motivation in Dynamical Control Systems
Intrinsic Motivation in Dynamical Control Systems
Stas Tiomkin
I. Nemenman
Daniel Polani
Naftali Tishby
26
5
0
29 Dec 2022
Learning to Optimize in Model Predictive Control
Learning to Optimize in Model Predictive Control
Jacob Sacks
Byron Boots
27
22
0
05 Dec 2022
CIM: Constrained Intrinsic Motivation for Sparse-Reward Continuous Control
Xiang Zheng
Xingjun Ma
Cong Wang
31
1
0
28 Nov 2022
Assistive Teaching of Motor Control Tasks to Humans
Assistive Teaching of Motor Control Tasks to Humans
Megha Srivastava
Erdem Biyik
Suvir Mirchandani
Noah D. Goodman
Dorsa Sadigh
20
6
0
25 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
22
0
23 Nov 2022
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments
Daniel Jarrett
Corentin Tallec
Florent Altché
Thomas Mesnard
Rémi Munos
Michal Valko
48
5
0
18 Nov 2022
Control Transformer: Robot Navigation in Unknown Environments through
  PRM-Guided Return-Conditioned Sequence Modeling
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
24
8
0
11 Nov 2022
Emergency action termination for immediate reaction in hierarchical
  reinforcement learning
Emergency action termination for immediate reaction in hierarchical reinforcement learning
Michal Bortkiewicz
Jakub Lyskawa
Pawel Wawrzyñski
M. Ostaszewski
Artur Grudkowski
Tomasz Trzciñski
24
0
0
11 Nov 2022
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward
  Long-Horizon Goal-Conditioned Reinforcement Learning
Goal Exploration Augmentation via Pre-trained Skills for Sparse-Reward Long-Horizon Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
34
3
0
28 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
62
9
0
23 Oct 2022
Random Actions vs Random Policies: Bootstrapping Model-Based Direct
  Policy Search
Random Actions vs Random Policies: Bootstrapping Model-Based Direct Policy Search
Elias Hanna
Alexandre Coninx
Stéphane Doncieux
OffRL
34
0
0
21 Oct 2022
Augmentative Topology Agents For Open-Ended Learning
Augmentative Topology Agents For Open-Ended Learning
Muhammad Umair Nasir
Michael Beukman
Steven D. James
C. Cleghorn
37
3
0
20 Oct 2022
A Mixture of Surprises for Unsupervised Reinforcement Learning
A Mixture of Surprises for Unsupervised Reinforcement Learning
Andrew Zhao
Matthieu Lin
Yangguang Li
Yong-Jin Liu
Gao Huang
28
13
0
13 Oct 2022
Neuroevolution is a Competitive Alternative to Reinforcement Learning
  for Skill Discovery
Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery
Félix Chalumeau
Raphael Boige
Bryan Lim
Valentin Macé
Maxime Allard
Arthur Flajolet
Antoine Cully
Thomas Pierrot
26
21
0
06 Oct 2022
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns
  for Cross-Domain Adaptation
Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation
Kang Xu
Yan Ma
Bingsheng Wei
Wei Li
37
3
0
24 Sep 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
37
35
0
19 Sep 2022
Spectral Decomposition Representation for Reinforcement Learning
Spectral Decomposition Representation for Reinforcement Learning
Tongzheng Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
40
27
0
19 Aug 2022
123
Next