ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1812.00456
  4. Cited By
Revisiting the Softmax Bellman Operator: New Benefits and New
  Perspective

Revisiting the Softmax Bellman Operator: New Benefits and New Perspective

2 December 2018
Zhao Song
Ronald E. Parr
Lawrence Carin
ArXivPDFHTML

Papers citing "Revisiting the Softmax Bellman Operator: New Benefits and New Perspective"

2 / 2 papers shown
Title
Reducing Variance in Temporal-Difference Value Estimation via Ensemble
  of Deep Networks
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
32
17
0
16 Sep 2022
VIREL: A Variational Inference Framework for Reinforcement Learning
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
1