ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.07670
  4. Cited By
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

International Conference on Machine Learning (ICML), 2022
16 September 2022
Litian Liang
Yaosheng Xu
Alexander Shmakov
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
    OOD
ArXiv (abs)PDFHTMLGithub (11★)

Papers citing "Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks"

9 / 9 papers shown
RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models
RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models
Lianghuan Huang
Sagnik Anupam
Insup Lee
Shuo Li
Osbert Bastani
167
1
0
03 Oct 2025
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
499
1
0
22 Oct 2024
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim
Joonhun Lee
Namhoon Cho
Sungjun Han
Seungeon Baek
482
0
0
05 Aug 2024
The Curse of Diversity in Ensemble-Based Exploration
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Rameswar Panda
249
6
0
07 May 2024
REValueD: Regularised Ensemble Value-Decomposition for Factorisable
  Markov Decision Processes
REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision ProcessesInternational Conference on Learning Representations (ICLR), 2024
David Ireland
Giovanni Montana
300
5
0
16 Jan 2024
On the Importance of Exploration for Generalization in Reinforcement
  Learning
On the Importance of Exploration for Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Yiding Jiang
J. Zico Kolter
Roberta Raileanu
UQCVOffRL
203
38
0
08 Jun 2023
Ensemble Value Functions for Efficient Exploration in Multi-Agent
  Reinforcement Learning
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Lukas Schafer
Oliver Slumbers
Alexander Shmakov
Yali Du
Stefano V. Albrecht
D. Mguni
646
7
0
07 Feb 2023
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch
  Size
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
263
19
0
20 Nov 2022
An Experimental Comparison Between Temporal Difference and Residual
  Gradient with Neural Network Approximation
An Experimental Comparison Between Temporal Difference and Residual Gradient with Neural Network Approximation
Shuyu Yin
Yaoyu Zhang
Peilin Liu
Z. Xu
241
2
0
25 May 2022
1