Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.00456
Cited By
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective
2 December 2018
Zhao Song
Ronald E. Parr
Lawrence Carin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Revisiting the Softmax Bellman Operator: New Benefits and New Perspective"
2 / 2 papers shown
Title
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Litian Liang
Yaosheng Xu
Stephen Marcus McAleer
Dailin Hu
Alexander Ihler
Pieter Abbeel
Roy Fox
OOD
32
17
0
16 Sep 2022
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
1