v1v2 (latest)
Regularization of Soft Actor-Critic Algorithms with Automatic
Temperature Adjustment
Abstract
This work presents a comprehensive analysis to regularize the Soft Actor-Critic (SAC) algorithm with automatic temperature adjustment. The the policy evaluation, the policy improvement and the temperature adjustment are reformulated, addressing certain modification and enhancing the clarity of the original theory in a more explicit manner.
View on arXivComments on this paper
