ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1302.7175
  4. Cited By
Estimating the Maximum Expected Value: An Analysis of (Nested) Cross
  Validation and the Maximum Sample Average
v1v2 (latest)

Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average

28 February 2013
H. V. Hasselt
ArXiv (abs)PDFHTML

Papers citing "Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average"

9 / 9 papers shown
Anti-Overestimation Dialogue Policy Learning for Task-Completion
  Dialogue System
Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System
T. Chang
Wenpeng Yin
Marie-Francine Moens
OffRL
209
12
0
24 Jul 2022
Calibration Matters: Tackling Maximization Bias in Large-scale
  Advertising Recommendation Systems
Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation SystemsInternational Conference on Learning Representations (ICLR), 2022
Yewen Fan
Nian Si
Kun Zhang
337
6
0
19 May 2022
Action Candidate Driven Clipped Double Q-learning for Discrete and
  Continuous Action Tasks
Action Candidate Driven Clipped Double Q-learning for Discrete and Continuous Action TasksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Qianliang Wu
Jin Xie
Jian Yang
OffRL
179
29
0
22 Mar 2022
Action Candidate Based Clipped Double Q-learning for Discrete and
  Continuous Action Tasks
Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action TasksAAAI Conference on Artificial Intelligence (AAAI), 2021
Qianliang Wu
Jin Xie
Zhiqiang Wang
OffRL
223
19
0
03 May 2021
Regularized Softmax Deep Multi-Agent $Q$-Learning
Regularized Softmax Deep Multi-Agent QQQ-LearningNeural Information Processing Systems (NeurIPS), 2021
L. Pan
Tabish Rashid
Bei Peng
Longbo Huang
Shimon Whiteson
271
45
0
22 Mar 2021
Cross Learning in Deep Q-Networks
Cross Learning in Deep Q-Networks
Xing Wang
A. Vinel
99
3
0
29 Sep 2020
Controlling Overestimation Bias with Truncated Mixture of Continuous
  Distributional Quantile Critics
Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics
Arsenii Kuznetsov
Pavel Shvechikov
Alexander Grishin
Dmitry Vetrov
527
263
0
08 May 2020
Deep Reinforcement Learning with Weighted Q-Learning
Deep Reinforcement Learning with Weighted Q-Learning
Andrea Cini
Carlo DÉramo
Jan Peters
Cesare Alippi
OffRL
231
11
0
20 Mar 2020
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling
E. Kaufmann
Wouter M. Koolen
Aurélien Garivier
246
28
0
04 Jun 2018
1
Page 1 of 1