v1v2 (latest)

Explicit Explore-Exploit Algorithms in Continuous State Spaces

Neural Information Processing Systems (NeurIPS), 2019

1 November 2019

Mikael Henaff

OffRL

ArXiv (abs)PDF HTML

Papers citing "Explicit Explore-Exploit Algorithms in Continuous State Spaces"

24 / 24 papers shown

Grounding Video Models to Actions through Goal Conditioned ExplorationInternational Conference on Learning Representations (ICLR), 2024

Yunhao Luo

Yilun Du

LM&Ro VGen

401

11 Nov 2024

Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024

380

03 Nov 2024

Beyond Optimism: Exploration With Partially Observable Rewards

216

20 Jun 2024

Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning

Amos Storkey

263

09 Oct 2023

Reward Model Ensembles Help Mitigate OveroptimizationInternational Conference on Learning Representations (ICLR), 2023

333

181

04 Oct 2023

A Study of Global and Episodic Bonuses for Exploration in Contextual MDPsInternational Conference on Machine Learning (ICML), 2023

Mikael Henaff

Minqi Jiang

Roberta Raileanu

185

05 Jun 2023

What model does MuZero learn?European Conference on Artificial Intelligence (ECAI), 2023

Jinke He

Thomas M. Moerland

F. Oliehoek

333

01 Jun 2023

Conditional Mutual Information for Disentangled Representations in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

220

23 May 2023

Planning Goals for ExplorationInternational Conference on Learning Representations (ICLR), 2023

217

23 Mar 2023

A Survey of Historical Learning: Learning Models with Learning History

Xiang Li

Lingfeng Yang

Jian Yang

248

23 Mar 2023

Curiosity in Hindsight: Intrinsic Exploration in Stochastic EnvironmentsInternational Conference on Machine Learning (ICML), 2022

246

18 Nov 2022

Exploration in Deep Reinforcement Learning: A SurveyInformation Fusion (Inf. Fusion), 2022

309

493

02 May 2022

Generative Planning for Temporally Coordinated Exploration in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022

Haichao Zhang

Wei Xu

Haonan Yu

246

24 Jan 2022

Multi-Stage Episodic Control for Strategic Exploration in Text GamesInternational Conference on Learning Representations (ICLR), 2022

296

04 Jan 2022

Explicit Explore, Exploit, or Escape (

E^4

): near-optimal safety-constrained reinforcement learning in polynomial timeMachine-mediated learning (ML), 2021

David M. Bossens

Nick Bishop

277

14 Nov 2021

Adaptive Discretization in Online Reinforcement LearningOperational Research (OR), 2021

252

29 Oct 2021

Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks

247

05 Oct 2021

Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model EnsemblesConference on Robot Learning (CoRL), 2020

Tim Seyde

Wilko Schwarting

S. Karaman

Daniela Rus

262

27 Oct 2020

Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration

Priyank Agrawal

Jinglin Chen

Nan Jiang

315

23 Oct 2020

PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient LearningNeural Information Processing Systems (NeurIPS), 2020

267

119

16 Jul 2020

Adaptive Discretization for Model-Based Reinforcement Learning

230

01 Jul 2020

Meta-Model-Based Meta-Policy OptimizationAsian Conference on Machine Learning (ACML), 2020

407

04 Jun 2020

Planning to Explore via Self-Supervised World Models

Pieter Abbeel

335

468

12 May 2020

Ready Policy One: World Building Through Active LearningInternational Conference on Machine Learning (ICML), 2020

227

07 Feb 2020