ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.05363
  4. Cited By
Curiosity-driven Exploration by Self-supervised Prediction

Curiosity-driven Exploration by Self-supervised Prediction

15 May 2017
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
    LRM
    SSL
ArXivPDFHTML

Papers citing "Curiosity-driven Exploration by Self-supervised Prediction"

50 / 458 papers shown
Title
Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning
Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning
Wei-Chen Liao
Ti-Rong Wu
I-Chen Wu
12
0
0
19 May 2025
Automatic Reward Shaping from Confounded Offline Data
Automatic Reward Shaping from Confounded Offline Data
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRL
OnRL
33
1
0
16 May 2025
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Miguel Arana-Catania
Weisi Guo
CML
30
0
0
13 May 2025
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Seungjae Lee
Daniel Ekpo
Haowen Liu
Furong Huang
Abhinav Shrivastava
Jia-Bin Huang
LM&Ro
40
0
0
12 May 2025
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Baida Zhang
Yakai Chen
Huichun Li
Zhenghu Zu
24
0
0
07 May 2025
ARDNS-FN-Quantum: A Quantum-Enhanced Reinforcement Learning Framework with Cognitive-Inspired Adaptive Exploration for Dynamic Environments
ARDNS-FN-Quantum: A Quantum-Enhanced Reinforcement Learning Framework with Cognitive-Inspired Adaptive Exploration for Dynamic Environments
Umberto Gonçalves de Sousa
19
0
0
07 May 2025
Interpretable Learning Dynamics in Unsupervised Reinforcement Learning
Interpretable Learning Dynamics in Unsupervised Reinforcement Learning
Shashwat Pandey
AI4CE
21
0
0
06 May 2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Lang Feng
Weihao Tan
Zhiyi Lyu
Longtao Zheng
Haiyang Xu
M. Yan
Fei Huang
Jingyi Wang
29
0
0
01 May 2025
Curiosity Driven Exploration to Optimize Structure-Property Learning in Microscopy
Curiosity Driven Exploration to Optimize Structure-Property Learning in Microscopy
Aditya Vatsavai
Ganesh Narasimha
Yongtao Liu
Jan-Chi Yang
Hiroshu Funakubo
M. Ziatdinov
Rama K Vasudevan
19
0
0
28 Apr 2025
Wanting to be Understood
Wanting to be Understood
Chrisantha Fernando
Dylan Banarse
Simon Osindero
41
0
0
09 Apr 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
38
0
0
09 Apr 2025
World Model Agents with Change-Based Intrinsic Motivation
World Model Agents with Change-Based Intrinsic Motivation
Jeremias Ferrao
Rafael Cunha
OffRL
MoE
57
0
0
26 Mar 2025
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Zihao Liu
Xing Liu
Yizhai Zhang
Zhengxiong Liu
Panfeng Huang
69
0
0
19 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCV
BDL
182
0
0
14 Mar 2025
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Nguyen H K. Do
Truc Nguyen
Malik Hassanaly
Raed Alharbi
Jung Taek Seo
My T. Thai
54
0
0
09 Mar 2025
M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Ziyan Wang
Zhicheng Zhang
Fei Fang
Yali Du
44
0
0
03 Mar 2025
Reducing Reward Dependence in RL Through Adaptive Confidence Discounting
Reducing Reward Dependence in RL Through Adaptive Confidence Discounting
Muhammed Yusuf Satici
David L. Roberts
OffRL
46
0
0
28 Feb 2025
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control
Jielong Yang
Daoyuan Huang
46
0
0
21 Feb 2025
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Wesley A. Suttle
A. Suresh
Carlos Nieto-Granda
OffRL
97
0
0
06 Feb 2025
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
Yuanchen Yuan
Jin Cheng
Núria Armengol Urpí
Stelian Coros
74
1
0
02 Feb 2025
Regularized Langevin Dynamics for Combinatorial Optimization
Regularized Langevin Dynamics for Combinatorial Optimization
Shengyu Feng
Yiming Yang
73
0
0
01 Feb 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
54
4
0
29 Jan 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
50
0
0
24 Jan 2025
Boosting MCTS with Free Energy Minimization
Boosting MCTS with Free Energy Minimization
Mawaba Pascal Dao
Adrian Peter
76
0
0
22 Jan 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
213
0
0
22 Jan 2025
NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations
NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations
Myunsoo Kim
Hayeong Lee
Seong-Woong Shim
JunHo Seo
Byung-Jun Lee
LLMAG
37
0
0
22 Jan 2025
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
Sang-Hyun Lee
Daehyeok Kwon
Seung-Woo Seo
76
1
0
17 Jan 2025
Grounding Video Models to Actions through Goal Conditioned Exploration
Grounding Video Models to Actions through Goal Conditioned Exploration
Yunhao Luo
Yilun Du
LM&Ro
VGen
85
1
0
11 Nov 2024
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Xingrui Yu
Zhenglin Wan
David Mark Bossens
Yueming Lyu
Qing Guo
Ivor W. Tsang
173
0
0
11 Nov 2024
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
57
1
0
11 Nov 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
59
0
0
23 Oct 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
114
2
0
23 Oct 2024
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Devdhar Patel
H. Siegelmann
OffRL
37
0
0
11 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
40
1
0
07 Oct 2024
Autonomous Network Defence using Reinforcement Learning
Autonomous Network Defence using Reinforcement Learning
Myles Foley
Chris Hicks
Kate Highnam
V. Mavroudis
AAML
19
29
0
26 Sep 2024
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control
Zichen Jeff Cui
Hengkai Pan
Aadhithya Iyer
Siddhant Haldar
Lerrel Pinto
VGen
36
10
0
18 Sep 2024
Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality
  with Exploration-Enhanced Contrastive Learning
Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning
Wen-Han Hsieh
Jen-Yuan Chang
23
0
0
26 Aug 2024
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
Anthony GX-Chen
Kenneth Marino
Rob Fergus
OCL
56
1
0
21 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
39
6
0
06 Aug 2024
Automatic Environment Shaping is the Next Frontier in RL
Automatic Environment Shaping is the Next Frontier in RL
Younghyo Park
G. Margolis
Pulkit Agrawal
OffRL
40
3
0
23 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Wind Estimation in Unmanned Aerial Vehicles with Causal Machine Learning
Wind Estimation in Unmanned Aerial Vehicles with Causal Machine Learning
Abdulaziz Alwalan
Miguel Arana-Catania
24
0
0
01 Jul 2024
Exploration by Learning Diverse Skills through Successor State Measures
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
40
0
0
14 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
46
1
0
11 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
40
22
0
06 Jun 2024
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
43
1
0
30 May 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
40
0
0
29 May 2024
A social path to human-like artificial intelligence
A social path to human-like artificial intelligence
Edgar A. Duénez-Guzmán
Suzanne Sadedin
Jane X. Wang
Kevin R. McKee
Joel Z Leibo
GNN
31
28
0
22 May 2024
Ensuring Ground Truth Accuracy in Healthcare with the EVINCE framework
Ensuring Ground Truth Accuracy in Healthcare with the EVINCE framework
Edward Y. Chang
44
0
0
20 May 2024
1234...8910
Next