ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.14755
11
13

Improved Sample Complexity for Incremental Autonomous Exploration in MDPs

29 December 2020
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
ArXivPDFHTML
Abstract

We investigate the exploration of an unknown environment when no reward function is provided. Building on the incremental exploration setting introduced by Lim and Auer [1], we define the objective of learning the set of ϵ\epsilonϵ-optimal goal-conditioned policies attaining all states that are incrementally reachable within LLL steps (in expectation) from a reference state s0s_0s0​. In this paper, we introduce a novel model-based approach that interleaves discovering new states from s0s_0s0​ and improving the accuracy of a model estimate that is used to compute goal-conditioned policies to reach newly discovered states. The resulting algorithm, DisCo, achieves a sample complexity scaling as O~(L5SL+ϵΓL+ϵAϵ−2)\tilde{O}(L^5 S_{L+\epsilon} \Gamma_{L+\epsilon} A \epsilon^{-2})O~(L5SL+ϵ​ΓL+ϵ​Aϵ−2), where AAA is the number of actions, SL+ϵS_{L+\epsilon}SL+ϵ​ is the number of states that are incrementally reachable from s0s_0s0​ in L+ϵL+\epsilonL+ϵ steps, and ΓL+ϵ\Gamma_{L+\epsilon}ΓL+ϵ​ is the branching factor of the dynamics over such states. This improves over the algorithm proposed in [1] in both ϵ\epsilonϵ and LLL at the cost of an extra ΓL+ϵ\Gamma_{L+\epsilon}ΓL+ϵ​ factor, which is small in most environments of interest. Furthermore, DisCo is the first algorithm that can return an ϵ/cmin⁡\epsilon/c_{\min}ϵ/cmin​-optimal policy for any cost-sensitive shortest-path problem defined on the LLL-reachable states with minimum cost cmin⁡c_{\min}cmin​. Finally, we report preliminary empirical results confirming our theoretical findings.

View on arXiv
Comments on this paper