Unsupervised Control Through Non-Parametric Discriminative Rewards

28 November 2018

Papers citing "Unsupervised Control Through Non-Parametric Discriminative Rewards"

40 / 40 papers shown

Title
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation Shaopeng Zhai Jie Wang Tianyi Zhang Fuxian Huang Qi Zhang Ming Zhou Jing Hou Yu Qiao Yu Liu LLMAG LM&Ro 31 1 0 12 Dec 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills Seongun Kim Kyowoon Lee Jaesik Choi SSL DRL 41 7 0 30 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction Seohong Park Oleh Rybkin Sergey Levine OffRL 33 34 0 13 Oct 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning David Yunis Justin Jung Falcon Z. Dai Matthew R. Walter OffRL 35 0 0 08 Sep 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning Hongyu Ding Yuan-Yan Tang Qing Wu Bo Wang Chunlin Chen Zhi Wang 32 4 0 16 Jul 2023
Augmenting Autotelic Agents with Large Language Models Cédric Colas Laetitia Teodorescu Pierre-Yves Oudeyer Xingdi Yuan Marc-Alexandre Côté LLMAG LM&Ro 28 22 0 21 May 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping Lina Mezghani Sainbayar Sukhbaatar Piotr Bojanowski A. Lazaric Alahari Karteek OffRL 36 18 0 05 Jan 2023
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results Sergey Levine Dhruv Shah SSL 24 21 0 13 Dec 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision Ashvin Nair Brian Zhu Gokul Narayanan Eugen Solowjow Sergey Levine OffRL OnRL 25 14 0 27 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey A. Aubret L. Matignon S. Hassas 31 35 0 19 Sep 2022
Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision Lina Mezghani Sainbayar Sukhbaatar Piotr Bojanowski Alahari Karteek 29 4 0 23 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 22 68 0 16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning Benjamin Eysenbach Tianjun Zhang Ruslan Salakhutdinov Sergey Levine SSL OffRL 25 137 0 15 Jun 2022
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning Zhiwei Xu Bin Zhang Dapeng Li Zeren Zhang Guangchong Zhou Hao Chen Guoliang Fan 16 14 0 06 Jun 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning Philippe Hansen-Estruch Amy Zhang Ashvin Nair Patrick Yin Sergey Levine AI4CE 23 27 0 27 Apr 2022
When to Go, and When to Explore: The Benefit of Post-Exploration in Intrinsic Motivation Zhao Yang Thomas M. Moerland Mike Preuss Aske Plaat 21 1 0 29 Mar 2022
Goal-directed Planning and Goal Understanding by Active Inference: Evaluation Through Simulated and Physical Robot Experiments Takazumi Matsumoto Wataru Ohata Fabien C. Y. Benureau Jun Tani 16 11 0 21 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions Robert Meier Asier Mujika 37 7 0 16 Feb 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions Minghuan Liu Menghui Zhu Weinan Zhang 24 131 0 20 Jan 2022
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning Dhruv Shah Peng-Tao Xu Yao Lu Ted Xiao Alexander Toshev Sergey Levine Brian Ichter OffRL 29 41 0 04 Nov 2021
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data Mengjiao Yang Sergey Levine Ofir Nachum OffRL 32 42 0 27 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching Pierre-Alexandre Kamienny Jean Tarbouriech Sylvain Lamprier A. Lazaric Ludovic Denoyer SSL 36 18 0 27 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning Benjamin Eysenbach Ruslan Salakhutdinov Sergey Levine SSL OffRL 53 31 0 06 Oct 2021
APS: Active Pretraining with Successor Features Hao Liu Pieter Abbeel 31 118 0 31 Aug 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision Vitchyr H. Pong Ashvin Nair Laura M. Smith Catherine Huang Sergey Levine OffRL 26 66 0 08 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research J. Luis E. Crawley B. Cameron OffRL 25 6 0 07 Jul 2021
Discovering Generalizable Skills via Automated Generation of Diverse Tasks Kuan Fang Yuke Zhu Silvio Savarese Li Fei-Fei 38 6 0 26 Jun 2021
Behavior From the Void: Unsupervised Active Pre-Training Hao Liu Pieter Abbeel VLM SSL 34 195 0 08 Mar 2021
Relative Variational Intrinsic Control Kate Baumli David Warde-Farley S. Hansen Volodymyr Mnih 10 42 0 14 Dec 2020
Self-supervised Visual Reinforcement Learning with Object-centric Representations Andrii Zadaianchuk Maximilian Seitzer Georg Martius SSL OCL 16 41 0 29 Nov 2020
Masked Contrastive Representation Learning for Reinforcement Learning Jinhua Zhu Yingce Xia Lijun Wu Jiajun Deng Wen-gang Zhou Tao Qin Houqiang Li SSL OffRL 31 55 0 15 Oct 2020
RODE: Learning Roles to Decompose Multi-Agent Tasks Tonghan Wang Tarun Gupta Anuj Mahajan Bei Peng Shimon Whiteson Chongjie Zhang OffRL 16 202 0 04 Oct 2020
Plan2Vec: Unsupervised Representation Learning by Latent Plans Ge Yang Amy Zhang Ari S. Morcos Joelle Pineau Pieter Abbeel Roberto Calandra SSL OffRL 21 27 0 07 May 2020
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery Kristian Hartikainen Xinyang Geng Tuomas Haarnoja Sergey Levine SSL 38 74 0 18 Jul 2019
Unsupervised State Representation Learning in Atari Ankesh Anand Evan Racah Sherjil Ozair Yoshua Bengio Marc-Alexandre Côté R. Devon Hjelm SSL 27 253 0 19 Jun 2019
Fast Task Inference with Variational Intrinsic Successor Features S. Hansen Will Dabney André Barreto T. Wiele David Warde-Farley Volodymyr Mnih BDL 16 151 0 12 Jun 2019
Skew-Fit: State-Covering Self-Supervised Reinforcement Learning Vitchyr H. Pong Murtaza Dalal Steven Lin Ashvin Nair Shikhar Bahl Sergey Levine OffRL SSL 23 269 0 08 Mar 2019
CLIC: Curriculum Learning and Imitation for object Control in non-rewarding environments Pierre Fournier Olivier Sigaud Cédric Colas Mohamed Chetouani OffRL 19 26 0 28 Jan 2019
Provably Efficient Maximum Entropy Exploration Elad Hazan Sham Kakade Karan Singh A. V. Soest 20 292 0 06 Dec 2018
Determinantal point processes for machine learning Alex Kulesza B. Taskar 162 1,122 0 25 Jul 2012