Adaptive Data Exploitation in Deep Reinforcement Learning

22 January 2025

Mingqi Yuan

Main:8 Pages

37 Figures

Bibliography:3 Pages

4 Tables

Appendix:29 Pages

Abstract

We introduce ADEPT: Adaptive Data ExPloiTation, a simple yet powerful framework to enhance the **data efficiency** and **generalization** in deep reinforcement learning (RL). Specifically, ADEPT adaptively manages the use of sampled data across different learning stages via multi-armed bandit (MAB) algorithms, optimizing data utilization while mitigating overfitting. Moreover, ADEPT can significantly reduce the computational overhead and accelerate a wide range of RL algorithms. We test ADEPT on benchmarks including Procgen, MiniGrid, and PyBullet. Extensive simulation demonstrates that ADEPT can achieve superior performance with remarkable computational efficiency, offering a practical solution to data-efficient RL. Our code is available atthis https URL.

View on arXiv

Comments on this paper