ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.12894
  4. Cited By
Exploration by Random Network Distillation

Exploration by Random Network Distillation

30 October 2018
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
ArXivPDFHTML

Papers citing "Exploration by Random Network Distillation"

50 / 277 papers shown
Title
Adaptive trajectory-constrained exploration strategy for deep
  reinforcement learning
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
30
3
0
27 Dec 2023
Adaptive parameter sharing for multi-agent reinforcement learning
Adaptive parameter sharing for multi-agent reinforcement learning
Dapeng Li
Na Lou
Bin Zhang
Zhiwei Xu
Guoliang Fan
27
3
0
14 Dec 2023
Building Open-Ended Embodied Agent via Language-Policy Bidirectional
  Adaptation
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Fuxian Huang
Qi Zhang
Ming Zhou
Jing Hou
Yu Qiao
Yu Liu
LLMAG
LM&Ro
37
1
0
12 Dec 2023
Agent-Aware Training for Agent-Agnostic Action Advising in Deep
  Reinforcement Learning
Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning
Yaoquan Wei
Shunyu Liu
Jie Song
Tongya Zheng
Kaixuan Chen
Yong Wang
Mingli Song
25
0
0
28 Nov 2023
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Andrew Zhao
Erle Zhu
Rui Lu
Matthieu Lin
Yong-Jin Liu
Gao Huang
SSL
34
1
0
16 Nov 2023
Ensemble sampling for linear bandits: small ensembles suffice
Ensemble sampling for linear bandits: small ensembles suffice
David Janz
A. Litvak
Csaba Szepesvári
30
2
0
14 Nov 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
41
7
0
30 Oct 2023
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic
  Forgetting in Curiosity
Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity
Jaedong Hwang
Zhang-Wei Hong
Eric Chen
Akhilan Boopathy
Pulkit Agrawal
Ila Fiete
CLL
35
5
0
26 Oct 2023
Iteratively Learn Diverse Strategies with State Distance Information
Iteratively Learn Diverse Strategies with State Distance Information
Wei Fu
Weihua Du
Jingwei Li
Sunli Chen
Jingzhao Zhang
Yi Wu
51
3
0
23 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
31
1
0
12 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
20
6
0
09 Oct 2023
LESSON: Learning to Integrate Exploration Strategies for Reinforcement
  Learning via an Option Framework
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
Woojun Kim
Jeonghye Kim
Young-Jin Sung
20
5
0
05 Oct 2023
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic
  Manipulation Tasks
Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks
Wenke Huang
Filippos Christianos
Zhibin Li
37
8
0
28 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
36
0
0
26 Sep 2023
Contrastive Initial State Buffer for Reinforcement Learning
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
44
9
0
18 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement
  Learning
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
39
0
0
08 Sep 2023
Adversarial Style Transfer for Robust Policy Optimization in Deep
  Reinforcement Learning
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
29
4
0
29 Aug 2023
Language Reward Modulation for Pretraining Reinforcement Learning
Language Reward Modulation for Pretraining Reinforcement Learning
Ademi Adeniji
Amber Xie
Carmelo Sferrazza
Younggyo Seo
Stephen James
Pieter Abbeel
39
26
0
23 Aug 2023
FoX: Formation-aware exploration in multi-agent reinforcement learning
FoX: Formation-aware exploration in multi-agent reinforcement learning
Yonghyeon Jo
Sunwoo Lee
Junghyuk Yum
Seungyul Han
35
5
0
22 Aug 2023
Goal-Conditioned Reinforcement Learning with Disentanglement-based
  Reachability Planning
Goal-Conditioned Reinforcement Learning with Disentanglement-based Reachability Planning
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Bin He
26
3
0
20 Jul 2023
Markov Decision Processes with Time-Varying Geometric Discounting
Markov Decision Processes with Time-Varying Geometric Discounting
Jiarui Gan
Ann-Kathrin Hennes
R. Majumdar
Debmalya Mandal
Goran Radanović
13
1
0
19 Jul 2023
Diverse Projection Ensembles for Distributional Reinforcement Learning
Diverse Projection Ensembles for Distributional Reinforcement Learning
Moritz A. Zanger
Wendelin Bohmer
M. Spaan
25
4
0
12 Jun 2023
Prediction Error-based Classification for Class-Incremental Learning
Prediction Error-based Classification for Class-Incremental Learning
Michal Zajkac
Tinne Tuytelaars
Gido M. van de Ven
CLL
25
8
0
30 May 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning
  via Langevin Monte Carlo
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
28
20
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
31
9
0
29 May 2023
Visual Affordance Prediction for Guiding Robot Exploration
Visual Affordance Prediction for Guiding Robot Exploration
Homanga Bharadhwaj
Abhi Gupta
Shubham Tulsiani
44
12
0
28 May 2023
A Reminder of its Brittleness: Language Reward Shaping May Hinder
  Learning for Instruction Following Agents
A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents
Sukai Huang
N. Lipovetzky
Trevor Cohn
30
2
0
26 May 2023
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement
  Learning
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Wenhao Li
Dan Qiao
Baoxiang Wang
Xiangfeng Wang
Bo Jin
H. Zha
35
5
0
18 May 2023
Demonstration-free Autonomous Reinforcement Learning via Implicit and
  Bidirectional Curriculum
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Jigang Kim
Daesol Cho
H. J. Kim
22
3
0
17 May 2023
MIMEx: Intrinsic Rewards from Masked Input Modeling
MIMEx: Intrinsic Rewards from Masked Input Modeling
Toru Lin
Allan Jabri
OffRL
23
6
0
15 May 2023
Learning Achievement Structure for Structured Exploration in Domains
  with Sparse Reward
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
19
3
0
30 Apr 2023
Accelerating exploration and representation learning with offline
  pre-training
Accelerating exploration and representation learning with offline pre-training
Bogdan Mazoure
Jake Bruce
Doina Precup
Rob Fergus
Ankit Anand
OffRL
31
5
0
31 Mar 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic
  Environments
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
11
14
0
24 Mar 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under
  Low-rank MDPs
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs
Yuan Cheng
Ruiquan Huang
J. Yang
Yitao Liang
OffRL
41
8
0
20 Mar 2023
SVDE: Scalable Value-Decomposition Exploration for Cooperative
  Multi-Agent Reinforcement Learning
SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning
Shuhan Qi
Shuhao Zhang
Qiang-qiang Wang
Jia-jia Zhang
Jing Xiao
Xinyu Wang
34
0
0
16 Mar 2023
Fast Rates for Maximum Entropy Exploration
Fast Rates for Maximum Entropy Exploration
D. Tiapkin
Denis Belomestny
Daniele Calandriello
Eric Moulines
Rémi Munos
A. Naumov
Pierre Perrault
Yunhao Tang
Michal Valko
Pierre Menard
41
17
0
14 Mar 2023
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent
  Reinforcement Learning
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning
Ji-Yun Oh
Joonkee Kim
Minchan Jeong
Se-Young Yun
32
1
0
03 Mar 2023
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement
  Learning
Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning
Archit Sharma
Ahmed M. Ahmed
Rehaan Ahmad
Chelsea Finn
SSL
51
17
0
02 Mar 2023
Expert-Free Online Transfer Learning in Multi-Agent Reinforcement
  Learning
Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning
A. Castagna
Ivana Dusparic
OffRL
18
2
0
02 Mar 2023
Failure-aware Policy Learning for Self-assessable Robotics Tasks
Failure-aware Policy Learning for Self-assessable Robotics Tasks
Kechun Xu
Runjian Chen
Shuqing Zhao
Zizhang Li
Hongxiang Yu
Ci Chen
Yue Wang
R. Xiong
20
1
0
25 Feb 2023
Self-supervised network distillation: an effective approach to
  exploration in sparse reward environments
Self-supervised network distillation: an effective approach to exploration in sparse reward environments
Matej Pecháč
M. Chovanec
Igor Farkaš
32
3
0
22 Feb 2023
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement
  Learning
Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning
Jiong Li
Pratik Gajane
37
4
0
21 Feb 2023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Guiding Pretraining in Reinforcement Learning with Large Language Models
Yuqing Du
Olivia Watkins
Zihan Wang
Cédric Colas
Trevor Darrell
Pieter Abbeel
Abhishek Gupta
Jacob Andreas
LM&Ro
21
174
0
13 Feb 2023
Improving robot navigation in crowded environments using intrinsic rewards
Improving robot navigation in crowded environments using intrinsic rewards
Diego Martínez Baselga
L. Riazuelo
Luis Montano
45
12
0
13 Feb 2023
Investigating the role of model-based learning in exploration and
  transfer
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
7
0
08 Feb 2023
Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in
  Reinforcement Learning
Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in Reinforcement Learning
Marius Lindegaard
Hjalmar Jacob Vinje
Odin Severinsen
22
2
0
06 Feb 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
Reinforcing User Retention in a Billion Scale Short Video Recommender
  System
Reinforcing User Retention in a Billion Scale Short Video Recommender System
Qingpeng Cai
Shuchang Liu
Xueliang Wang
Tianyou Zuo
Wentao Xie
Bin Yang
Dong Zheng
Peng Jiang
Kun Gai
OffRL
22
41
0
03 Feb 2023
Diversity Through Exclusion (DTE): Niche Identification for
  Reinforcement Learning through Value-Decomposition
Diversity Through Exclusion (DTE): Niche Identification for Reinforcement Learning through Value-Decomposition
P. Sunehag
A. Vezhnevets
Edgar A. Duénez-Guzmán
Igor Mordach
Joel Z Leibo
26
2
0
02 Feb 2023
Previous
123456
Next