v1v2 (latest)

Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains

International Joint Conference on Artificial Intelligence (IJCAI), 2019

10 June 2019

Papers citing "Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains"

3 / 3 papers shown

139

17 May 2024

Coordinate Ascent for Off-Policy RL with Global Convergence GuaranteesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

282

10 Dec 2022

165

26 Nov 2019