v1v2v3 (latest)

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

15 November 2016

Pieter Abbeel

Papers citing "#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning"

50 / 466 papers shown

Title
Curiosity in Hindsight: Intrinsic Exploration in Stochastic EnvironmentsInternational Conference on Machine Learning (ICML), 2022 Daniel Jarrett Corentin Tallec Florent Altché Thomas Mesnard Rémi Munos Michal Valko 189 6 0 18 Nov 2022
Foundation Models for Semantic Novelty in Reinforcement Learning Tarun Gupta Peter Karkus Tong Che Danfei Xu Marco Pavone VLM OffRL LRM 110 9 0 09 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey Zhihui Xie Zichuan Lin Junyou Li Shuai Li Deheng Ye OffRL OnRL AI4CE 158 30 0 08 Nov 2022
Curiosity-Driven Multi-Agent Exploration with Mixed Objectives Roben Delos Reyes Kyunghwan Son Jinhwan Jung Wan Ju Kang Yung Yi 151 5 0 29 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning Henrique Donancio L. Vercouter H. Roclawski AI4CE 279 1 0 20 Oct 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample ComplexityNeural Information Processing Systems (NeurIPS), 2022 Abhishek Gupta Aldo Pacchiano Yuexiang Zhai Sham Kakade Sergey Levine OffRL 189 91 0 18 Oct 2022
Symbol Guided Hindsight Priors for Reward Learning from Human Preferences Mudit Verma Katherine Metcalf 156 9 0 17 Oct 2022
Exploration via Elliptical Episodic BonusesNeural Information Processing Systems (NeurIPS), 2022 Mikael Henaff Roberta Raileanu Minqi Jiang Tim Rocktaschel OffRL 235 52 0 11 Oct 2022
LECO: Learnable Episodic Count for Task-Specific Intrinsic RewardNeural Information Processing Systems (NeurIPS), 2022 DaeJin Jo Sungwoong Kim D. W. Nam Taehwan Kwon Seungeun Rho Jongmin Kim Donghoon Lee OffRL 113 11 0 11 Oct 2022
Reward-Mixing MDPs with a Few Latent Contexts are Learnable Jeongyeol Kwon Yonathan Efroni Constantine Caramanis Shie Mannor 170 5 0 05 Oct 2022
Query The Agent: Improving sample efficiency through epistemic uncertainty estimation Julian Alverio Boris Katz Andrei Barbu 177 1 0 05 Oct 2022
Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic SpecificationsAmerican Control Conference (ACC), 2022 Mingyu Cai Makai Mann Zachary Serlin Kevin J. Leahy C. Vasile 302 14 0 03 Oct 2022
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel StatesIEEE Conference on Decision and Control (CDC), 2022 C. Banerjee Zhiyong Chen N. Noman 148 6 0 01 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey A. Aubret L. Matignon S. Hassas 214 50 0 19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning Mingqi Yuan Bo Li Xin Jin Wenjun Zeng 337 12 0 19 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping Hao Sun Lei Han Rui Yang Xiaoteng Ma Jian Guo Bolei Zhou OffRL OnRL 159 11 0 15 Sep 2022
Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning Bang You Jingming Xie Youping Chen Jan Peters Oleg Arenz 98 3 0 12 Sep 2022
Go-Explore Complex 3D Game Environments for Automated Reachability TestingIEEE Transactions on Games (IEEE Trans. Games), 2022 Cong Lu Raluca Georgescu J. Verwey 131 8 0 01 Sep 2022
Normality-Guided Distributional Reinforcement Learning for Continuous Control Ju-Seung Byun Andrew Perrault OffRL 283 1 0 28 Aug 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review N. Botteghi M. Poel C. Brune SSL OffRL 305 22 0 27 Aug 2022
A Review of Uncertainty for Deep Reinforcement LearningArtificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022 Owen Lockwood Mei Si 178 65 0 18 Aug 2022
Reinforcement learning with experience replay and adaptation of action dispersion Pawel Wawrzyñski Wojciech Masarczyk M. Ostaszewski 75 1 0 30 Jul 2022
Annealed Training for Combinatorial Optimization on Graphs Haoran Sun E. Guha H. Dai 164 25 0 23 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning C. Steinparz Thomas Schmied Fabian Paischer Marius-Constantin Dinu Vihang Patil Angela Bitto-Nemling Hamid Eghbalzadeh Sepp Hochreiter CLL 212 16 0 12 Jul 2022
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement LearningInternational Conference on Agents and Artificial Intelligence (ICAART), 2022 Dogay Kamar N. K. Üre Gözde B. Ünal 77 1 0 28 Jun 2022
Towards Understanding How Machines Can Learn Causal OverhypothesesAnnual Meeting of the Cognitive Science Society (CogSci), 2022 Eliza Kosoy David M. Chan Adrian Liu Jasmine Collins Bryanna Kaufmann Sandy Han Huang Jessica B. Hamrick John F. Canny Nan Rosemary Ke Alison Gopnik CML AI4CE 169 19 0 16 Jun 2022
BYOL-Explore: Exploration by Bootstrapped PredictionNeural Information Processing Systems (NeurIPS), 2022 Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 222 84 0 16 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance Jakob J. Hollenstein Sayantan Auddy Matteo Saveriano Erwan Renaudo J. Piater 282 27 0 08 Jun 2022
Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes Tetsuro Morimura Kazuhiro Ota Kenshi Abe Peinan Zhang OffRL 255 0 0 02 Jun 2022
Graph Backup: Data Efficient Backup Exploiting Markovian Transitions Zhengyao Jiang Tianjun Zhang Robert Kirk Tim Rocktaschel Edward Grefenstette OffRL 81 2 0 31 May 2022
k-Means Maximum Entropy Exploration Alexander Nedergaard Matthew Cook 147 15 0 31 May 2022
Off-Beat Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022 Wei Qiu Weixun Wang Rongpin Wang Bo An Yujing Hu S. Obraztsova Zinovi Rabinovich Jianye Hao Yingfeng Chen Changjie Fan OffRL 125 2 0 27 May 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning Marco Bagatella Sammy Christen Otmar Hilliges OffRL 225 6 0 26 May 2022
Reward Uncertainty for Exploration in Preference-based Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022 Xinran Liang Katherine Shu Kimin Lee Pieter Abbeel 116 71 0 24 May 2022
Concurrent Credit Assignment for Data-efficient Reinforcement LearningIEEE International Joint Conference on Neural Network (IJCNN), 2022 Emmanuel Daucé 72 2 0 24 May 2022
An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration EnvironmentsInternational Cross-Domain Conference on Machine Learning and Knowledge Extraction (CD-MAKE), 2022 Alain Andres Esther Villar-Rodriguez Javier Del Ser 147 11 0 23 May 2022
Complex behavior from intrinsic motivation to occupy action-state path spaceNature Communications (Nat Commun), 2022 Jorge Ramírez-Ruiz D. Grytskyy Chiara Mastrogiuseppe Yamen Habib R. Moreno-Bote 216 14 0 20 May 2022
ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated CharactersACM Transactions on Graphics (TOG), 2022 Xue Bin Peng Yunrong Guo L. Halper Sergey Levine Sanja Fidler 162 15 0 04 May 2022
Exploration in Deep Reinforcement Learning: A SurveyInformation Fusion (Inf. Fusion), 2022 Pawel Ladosz Lilian Weng Minwoo Kim H. Oh OffRL 272 479 0 02 May 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at ScaleComputer Vision and Pattern Recognition (CVPR), 2022 Ram Ramrakhya Eric Undersander Dhruv Batra Abhishek Das LM&Ro 303 130 0 07 Apr 2022
Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging EnvironmentsIEEE Robotics and Automation Letters (RA-L), 2022 Yafei Hu Shaoshu Su Chen Wang John Keller Sebastian Scherer OffRL 177 15 0 07 Apr 2022
Curiosity Driven Self-supervised Tactile Exploration of Unknown Objects Yujie Lu Jianren Wang Vikash Kumar 181 4 0 31 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from VideosInternational Conference on Machine Learning (ICML), 2022 Younggyo Seo Kimin Lee Stephen James Pieter Abbeel SSL OnRL 262 133 0 25 Mar 2022
Intrinsically-Motivated Reinforcement Learning: A Brief Introduction Mingqi Yuan 172 2 0 03 Mar 2022
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2022 Durgesh Kalwar Omkar Shelke Somjit Nath Hardik Meisheri H. Khadilkar 156 1 0 02 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share? Alain Andres Esther Villar-Rodriguez Javier Del Ser 139 11 0 24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in IntralogisticsApplied Sciences (Appl. Sci.), 2022 Honghu Xue Benedikt Hein M. Bakr Georg Schildbach Bengt Abel Elmar Rueckert 213 21 0 23 Feb 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 211 18 0 23 Feb 2022
Learning Causal Overhypotheses through Exploration in Children and Computational ModelsCLEaR (CLEaR), 2022 Eliza Kosoy Adrian Liu Jasmine Collins David M. Chan Jessica B. Hamrick Nan Rosemary Ke Sandy H Huang Bryanna Kaufmann John F. Canny Alison Gopnik CML 166 11 0 21 Feb 2022
Improving Intrinsic Exploration with Language AbstractionsNeural Information Processing Systems (NeurIPS), 2022 Jesse Mu Victor Zhong Roberta Raileanu Minqi Jiang Noah D. Goodman Tim Rocktaschel Edward Grefenstette 228 72 0 17 Feb 2022