ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.15670
25
4

Knowledge-Guided Exploration in Deep Reinforcement Learning

26 October 2022
Sahisnu Mazumder
Bing-Quan Liu
Shuai Wang
Yingxuan Zhu
Xiaotian Yin
Lifeng Liu
Jian Li
ArXivPDFHTML
Abstract

This paper proposes a new method to drastically speed up deep reinforcement learning (deep RL) training for problems that have the property of state-action permissibility (SAP). Two types of permissibility are defined under SAP. The first type says that after an action ata_tat​ is performed in a state sts_tst​ and the agent has reached the new state st+1s_{t+1}st+1​, the agent can decide whether ata_tat​ is permissible or not permissible in sts_tst​. The second type says that even without performing ata_tat​ in sts_tst​, the agent can already decide whether ata_tat​ is permissible or not in sts_tst​. An action is not permissible in a state if the action can never lead to an optimal solution and thus should not be tried (over and over again). We incorporate the proposed SAP property and encode action permissibility knowledge into two state-of-the-art deep RL algorithms to guide their state-action exploration together with a virtual stopping strategy. Results show that the SAP-based guidance can markedly speed up RL training.

View on arXiv
Comments on this paper