v1v2 (latest)

Ensemble Bootstrapping for Q-Learning

International Conference on Machine Learning (ICML), 2021

28 February 2021

ArXiv (abs)PDF HTML Github (1668★)

Papers citing "Ensemble Bootstrapping for Q-Learning"

21 / 21 papers shown

An Arbitration Control for an Ensemble of Diversified DQN variants in Continual Reinforcement Learning

Wonseo Jang

Dongjae Kim

245

05 Sep 2025

Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies

242

05 Aug 2025

Broad Critic Deep Actor Reinforcement Learning for Continuous ControlIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

Tianang Sun

549

24 Nov 2024

Batch Ensemble for Variance Dependent Regret in Stochastic BanditsAAAI Conference on Artificial Intelligence (AAAI), 2024

Asaf B. Cassel

Orin Levy

Yishay Mansour

OffRL

227

13 Sep 2024

Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network OptimizationInternational Workshop on Signal Processing Advances in Wireless Communications (SPAWC), 2024

Talha Bozkus

Urbashi Mitra

305

29 Aug 2024

Mixture of Experts in a Mixture of RL settings

Jakob Foerster

Pablo Samuel Castro

389

26 Jun 2024

Oracle-Efficient Reinforcement Learning for Max Value Ensembles

262

27 May 2024

The Curse of Diversity in Ensemble-Based Exploration

354

07 May 2024

Dissecting Deep RL with High Update Ratios: Combatting Value Divergence

Marcel Hussing

C. Voelcker

Igor Gilitschenski

Amir-massoud Farahmand

Eric Eaton

456

09 Mar 2024

Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks

Talha Bozkus

Urbashi Mitra

307

12 Feb 2024

Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization

Talha Bozkus

Urbashi Mitra

OffRL

322

08 Feb 2024

Learning Uncertainty-Aware Temporally-Extended Actions

171

08 Feb 2024

Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed RewardResults in Control and Optimization (RCO), 2023

Taisuke Kobayashi

227

24 Aug 2023

Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error FeedbackNeural Information Processing Systems (NeurIPS), 2023

Hang Wang

Sen Lin

Junshan Zhang

213

20 Jun 2023

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-CriticInternational Conference on Machine Learning (ICML), 2023

494

05 Jun 2023

Graph Exploration for Effective Multi-agent Q-LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023

Ainur Zhaikhan

Ali H. Sayed

326

19 Apr 2023

Ensemble Reinforcement Learning: A SurveyApplied Soft Computing (Appl. Soft Comput.), 2023

Yanjie Song

Ponnuthurai Nagaratnam Suganthan

Witold Pedrycz

319

05 Mar 2023

Factors of Influence of the Overestimation Bias of Q-Learning

Julius Wagenbach

M. Sabatelli

326

11 Oct 2022

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep NetworksInternational Conference on Machine Learning (ICML), 2022

Pieter Abbeel

324

16 Sep 2022

A Review of Uncertainty for Deep Reinforcement LearningArtificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022

Owen Lockwood

Mei Si

286

18 Aug 2022

Balancing Value Underestimation and Overestimation with Realistic Actor-Critic

408

19 Oct 2021