ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.00445
  4. Cited By
Ensemble Bootstrapping for Q-Learning
v1v2 (latest)

Ensemble Bootstrapping for Q-Learning

International Conference on Machine Learning (ICML), 2021
28 February 2021
Oren Peer
Chen Tessler
Nadav Merlis
Ron Meir
ArXiv (abs)PDFHTMLGithub (1668★)

Papers citing "Ensemble Bootstrapping for Q-Learning"

21 / 21 papers shown
An Arbitration Control for an Ensemble of Diversified DQN variants in Continual Reinforcement Learning
An Arbitration Control for an Ensemble of Diversified DQN variants in Continual Reinforcement Learning
Wonseo Jang
Dongjae Kim
245
0
0
05 Sep 2025
Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies
Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies
Yi Ma
Hongyao Tang
Chenjun Xiao
Yaodong Yang
Wei Wei
Jianye Hao
Jiye Liang
OffRL
242
0
0
05 Aug 2025
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
Broad Critic Deep Actor Reinforcement Learning for Continuous ControlIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Shiron Thalagala
Pak Kin Wong
Xiaozheng Wang
Tianang Sun
OffRL
549
2
0
24 Nov 2024
Batch Ensemble for Variance Dependent Regret in Stochastic Bandits
Batch Ensemble for Variance Dependent Regret in Stochastic BanditsAAAI Conference on Artificial Intelligence (AAAI), 2024
Asaf B. Cassel
Orin Levy
Yishay Mansour
OffRL
227
3
0
13 Sep 2024
Coverage Analysis of Multi-Environment Q-Learning Algorithms for
  Wireless Network Optimization
Coverage Analysis of Multi-Environment Q-Learning Algorithms for Wireless Network OptimizationInternational Workshop on Signal Processing Advances in Wireless Communications (SPAWC), 2024
Talha Bozkus
Urbashi Mitra
305
3
0
29 Aug 2024
Mixture of Experts in a Mixture of RL settings
Mixture of Experts in a Mixture of RL settings
Timon Willi
J. Obando-Ceron
Jakob Foerster
Karolina Dziugaite
Pablo Samuel Castro
MoE
389
18
0
26 Jun 2024
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
Marcel Hussing
Michael Kearns
Aaron Roth
S. B. Sengupta
Jessica Sorrell
262
1
0
27 May 2024
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
354
9
0
07 May 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
456
16
0
09 Mar 2024
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale
  Wireless Networks
Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks
Talha Bozkus
Urbashi Mitra
307
10
0
12 Feb 2024
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy
  Optimization
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization
Talha Bozkus
Urbashi Mitra
OffRL
322
8
0
08 Feb 2024
Learning Uncertainty-Aware Temporally-Extended Actions
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee
Seung Joon Park
Yunhao Tang
Min-hwan Oh
171
3
0
08 Feb 2024
Intentionally-underestimated Value Function at Terminal State for
  Temporal-difference Learning with Mis-designed Reward
Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed RewardResults in Control and Optimization (RCO), 2023
Taisuke Kobayashi
227
5
0
24 Aug 2023
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error
  Feedback
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error FeedbackNeural Information Processing Systems (NeurIPS), 2023
Hang Wang
Sen Lin
Junshan Zhang
213
27
0
20 Jun 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy
  Actor-Critic
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-CriticInternational Conference on Machine Learning (ICML), 2023
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRLOnRL
494
23
0
05 Jun 2023
Graph Exploration for Effective Multi-agent Q-Learning
Graph Exploration for Effective Multi-agent Q-LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Ainur Zhaikhan
Ali H. Sayed
326
3
0
19 Apr 2023
Ensemble Reinforcement Learning: A Survey
Ensemble Reinforcement Learning: A SurveyApplied Soft Computing (Appl. Soft Comput.), 2023
Yanjie Song
Ponnuthurai Nagaratnam Suganthan
Witold Pedrycz
Junwei Ou
Yongming He
Yihao Chen
Yutong Wu
OffRL
319
62
0
05 Mar 2023
Factors of Influence of the Overestimation Bias of Q-Learning
Factors of Influence of the Overestimation Bias of Q-Learning
Julius Wagenbach
M. Sabatelli
326
3
0
11 Oct 2022
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep NetworksInternational Conference on Machine Learning (ICML), 2022
Litian Liang
Yaosheng Xu
Alexander Shmakov
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
324
26
0
16 Sep 2022
A Review of Uncertainty for Deep Reinforcement Learning
A Review of Uncertainty for Deep Reinforcement LearningArtificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022
Owen Lockwood
Mei Si
286
79
0
18 Aug 2022
Balancing Value Underestimation and Overestimation with Realistic
  Actor-Critic
Balancing Value Underestimation and Overestimation with Realistic Actor-Critic
Sicen Li
Qinyun Tang
G. Wang
Xinmeng Ma
Li-quan Wang
OffRL
408
4
0
19 Oct 2021
1
Page 1 of 1