ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.12894
  4. Cited By
Exploration by Random Network Distillation

Exploration by Random Network Distillation

30 October 2018
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
ArXivPDFHTML

Papers citing "Exploration by Random Network Distillation"

50 / 277 papers shown
Title
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Seungjae Lee
Daniel Ekpo
Haowen Liu
Furong Huang
Abhinav Shrivastava
Jia-Bin Huang
LM&Ro
40
0
0
12 May 2025
Interpretable Learning Dynamics in Unsupervised Reinforcement Learning
Interpretable Learning Dynamics in Unsupervised Reinforcement Learning
Shashwat Pandey
AI4CE
16
0
0
06 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
66
0
0
04 May 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
38
0
0
09 Apr 2025
Learning Bipedal Locomotion on Gear-Driven Humanoid Robot Using Foot-Mounted IMUs
Learning Bipedal Locomotion on Gear-Driven Humanoid Robot Using Foot-Mounted IMUs
Sotaro Katayama
Yuta Koda
Norio Nagatsuka
Masaya Kinoshita
53
0
0
01 Apr 2025
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
93
1
0
26 Mar 2025
World Model Agents with Change-Based Intrinsic Motivation
World Model Agents with Change-Based Intrinsic Motivation
Jeremias Ferrao
Rafael Cunha
OffRL
MoE
52
0
0
26 Mar 2025
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
Yongshuai Liu
Xin Liu
GAN
103
2
0
24 Mar 2025
Causally Aligned Curriculum Learning
Causally Aligned Curriculum Learning
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
61
3
0
21 Mar 2025
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Zihao Liu
Xing Liu
Yizhai Zhang
Zhengxiong Liu
Panfeng Huang
69
0
0
19 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCV
BDL
149
0
0
14 Mar 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Jingyi Wang
Shuicheng Yan
OffRL
53
0
0
10 Mar 2025
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Nguyen H K. Do
Truc Nguyen
Malik Hassanaly
Raed Alharbi
Jung Taek Seo
My T. Thai
54
0
0
09 Mar 2025
Reducing Reward Dependence in RL Through Adaptive Confidence Discounting
Reducing Reward Dependence in RL Through Adaptive Confidence Discounting
Muhammed Yusuf Satici
David L. Roberts
OffRL
41
0
0
28 Feb 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
143
0
0
17 Feb 2025
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Wesley A. Suttle
A. Suresh
Carlos Nieto-Granda
OffRL
95
0
0
06 Feb 2025
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
Yuanchen Yuan
Jin Cheng
Núria Armengol Urpí
Stelian Coros
74
1
0
02 Feb 2025
NBDI: A Simple and Efficient Termination Condition for Skill Extraction from Task-Agnostic Demonstrations
NBDI: A Simple and Efficient Termination Condition for Skill Extraction from Task-Agnostic Demonstrations
Myunsoo Kim
Hayeong Lee
Seong-Woong Shim
JunHo Seo
Byung-Jun Lee
LLMAG
37
0
0
22 Jan 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
175
0
0
22 Jan 2025
Learning to Assist Humans without Inferring Rewards
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
40
2
0
17 Jan 2025
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
Sang-Hyun Lee
Daehyeok Kwon
Seung-Woo Seo
76
1
0
17 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
65
0
0
03 Jan 2025
Efficient Active Imitation Learning with Random Network Distillation
Efficient Active Imitation Learning with Random Network Distillation
Emilien Biré
Anthony Kobanda
Ludovic Denoyer
Rémy Portelas
38
1
0
04 Nov 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
57
0
0
23 Oct 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
114
2
0
23 Oct 2024
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
Haoran Wang
Yaoru Sun
Zeshen Tang
Haibo Shi
Chenyuan Jiao
29
0
0
12 Oct 2024
Neuroplastic Expansion in Deep Reinforcement Learning
Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Aaron C. Courville
L. Pan
42
3
0
10 Oct 2024
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan
Elias Stengel-Eskin
Jaemin Cho
Joey Tianyi Zhou
VGen
43
1
0
08 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
38
1
0
07 Oct 2024
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Anthony GX-Chen
Kenneth Marino
Rob Fergus
OCL
56
1
0
21 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
31
5
0
06 Aug 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Learning to Steer Markovian Agents under Model Uncertainty
Learning to Steer Markovian Agents under Model Uncertainty
Jiawei Huang
Vinzenz Thoma
Zebang Shen
H. Nax
Niao He
43
2
0
14 Jul 2024
Exploration by Learning Diverse Skills through Successor State Measures
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
38
0
0
14 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
46
1
0
11 Jun 2024
Language Guided Skill Discovery
Language Guided Skill Discovery
Seungeun Rho
Laura Smith
Tianyu Li
Sergey Levine
Xue Bin Peng
Sehoon Ha
LM&Ro
42
4
0
07 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
34
21
0
06 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
39
1
0
01 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
43
1
0
30 May 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
40
0
0
29 May 2024
Learning Future Representation with Synthetic Observations for
  Sample-efficient Reinforcement Learning
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
43
1
0
20 May 2024
Visual Episodic Memory-based Exploration
Visual Episodic Memory-based Exploration
J. Vice
Natalie Ruiz-Sanchez
P. Douglas
G. Sukthankar
31
0
0
18 May 2024
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned
  Reinforcement Learning
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
26
0
0
19 Apr 2024
Grasper: A Generalist Pursuer for Pursuit-Evasion Problems
Grasper: A Generalist Pursuer for Pursuit-Evasion Problems
Pengdeng Li
Shuxin Li
Xinrun Wang
Jakub Cerny
Youzhi Zhang
Stephen McAleer
Hau Chan
Bo An
42
1
0
19 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
25
0
0
31 Mar 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
47
4
0
29 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy
  Regularization
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
48
11
0
22 Feb 2024
Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent
  Deep Reinforcement Learning
Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning
Maxime Toquebiau
Nicolas Bredeche
F. Benamar
Jae-Yun Jun
36
1
0
06 Feb 2024
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty
  Sharing
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing
Haobin Jiang
Ziluo Ding
Zongqing Lu
24
2
0
03 Feb 2024
RLPlanner: Reinforcement Learning based Floorplanning for Chiplets with
  Fast Thermal Analysis
RLPlanner: Reinforcement Learning based Floorplanning for Chiplets with Fast Thermal Analysis
Yuanyuan Duan
Xingchen Liu
Zhiping Yu
Hanming Wu
Leilai Shao
Xiaolei Zhu
15
7
0
28 Dec 2023
123456
Next