Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.11359
Cited By
Unsupervised Control Through Non-Parametric Discriminative Rewards
28 November 2018
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
S. Hansen
Volodymyr Mnih
DRL
OffRL
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unsupervised Control Through Non-Parametric Discriminative Rewards"
40 / 40 papers shown
Title
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
31
1
0
12 Dec 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
41
7
0
30 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
35
0
0
08 Sep 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
32
4
0
16 Jul 2023
Augmenting Autotelic Agents with Large Language Models
Cédric Colas
Laetitia Teodorescu
Pierre-Yves Oudeyer
Xingdi Yuan
Marc-Alexandre Côté
LLMAG
LM&Ro
28
22
0
21 May 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
36
18
0
05 Jan 2023
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results
Sergey Levine
Dhruv Shah
SSL
24
21
0
13 Dec 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision
Ashvin Nair
Brian Zhu
Gokul Narayanan
Eugen Solowjow
Sergey Levine
OffRL
OnRL
25
14
0
27 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
31
35
0
19 Sep 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
Alahari Karteek
29
4
0
23 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
68
0
16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
25
137
0
15 Jun 2022
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Zhiwei Xu
Bin Zhang
Dapeng Li
Zeren Zhang
Guangchong Zhou
Hao Chen
Guoliang Fan
16
14
0
06 Jun 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Philippe Hansen-Estruch
Amy Zhang
Ashvin Nair
Patrick Yin
Sergey Levine
AI4CE
23
27
0
27 Apr 2022
When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation
Zhao Yang
Thomas M. Moerland
Mike Preuss
Aske Plaat
21
1
0
29 Mar 2022
Goal-directed Planning and Goal Understanding by Active Inference: Evaluation Through Simulated and Physical Robot Experiments
Takazumi Matsumoto
Wataru Ohata
Fabien C. Y. Benureau
Jun Tani
16
11
0
21 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
24
131
0
20 Jan 2022
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
Dhruv Shah
Peng-Tao Xu
Yao Lu
Ted Xiao
Alexander Toshev
Sergey Levine
Brian Ichter
OffRL
29
41
0
04 Nov 2021
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
Mengjiao Yang
Sergey Levine
Ofir Nachum
OffRL
32
42
0
27 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
36
18
0
27 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
53
31
0
06 Oct 2021
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
31
118
0
31 Aug 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
26
66
0
08 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
Discovering Generalizable Skills via Automated Generation of Diverse Tasks
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
38
6
0
26 Jun 2021
Behavior From the Void: Unsupervised Active Pre-Training
Hao Liu
Pieter Abbeel
VLM
SSL
34
195
0
08 Mar 2021
Relative Variational Intrinsic Control
Kate Baumli
David Warde-Farley
S. Hansen
Volodymyr Mnih
10
42
0
14 Dec 2020
Self-supervised Visual Reinforcement Learning with Object-centric Representations
Andrii Zadaianchuk
Maximilian Seitzer
Georg Martius
SSL
OCL
16
41
0
29 Nov 2020
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
31
55
0
15 Oct 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks
Tonghan Wang
Tarun Gupta
Anuj Mahajan
Bei Peng
Shimon Whiteson
Chongjie Zhang
OffRL
16
202
0
04 Oct 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans
Ge Yang
Amy Zhang
Ari S. Morcos
Joelle Pineau
Pieter Abbeel
Roberto Calandra
SSL
OffRL
21
27
0
07 May 2020
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery
Kristian Hartikainen
Xinyang Geng
Tuomas Haarnoja
Sergey Levine
SSL
38
74
0
18 Jul 2019
Unsupervised State Representation Learning in Atari
Ankesh Anand
Evan Racah
Sherjil Ozair
Yoshua Bengio
Marc-Alexandre Côté
R. Devon Hjelm
SSL
27
253
0
19 Jun 2019
Fast Task Inference with Variational Intrinsic Successor Features
S. Hansen
Will Dabney
André Barreto
T. Wiele
David Warde-Farley
Volodymyr Mnih
BDL
16
151
0
12 Jun 2019
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning
Vitchyr H. Pong
Murtaza Dalal
Steven Lin
Ashvin Nair
Shikhar Bahl
Sergey Levine
OffRL
SSL
23
269
0
08 Mar 2019
CLIC: Curriculum Learning and Imitation for object Control in non-rewarding environments
Pierre Fournier
Olivier Sigaud
Cédric Colas
Mohamed Chetouani
OffRL
19
26
0
28 Jan 2019
Provably Efficient Maximum Entropy Exploration
Elad Hazan
Sham Kakade
Karan Singh
A. V. Soest
20
292
0
06 Dec 2018
Determinantal point processes for machine learning
Alex Kulesza
B. Taskar
162
1,122
0
25 Jul 2012
1