Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.06851
Cited By
Policy Gradient Algorithms Implicitly Optimize by Continuation
11 May 2023
Adrien Bolland
Gilles Louppe
D. Ernst
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Policy Gradient Algorithms Implicitly Optimize by Continuation"
5 / 5 papers shown
Title
Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Alessandro Montenegro
Marco Mussi
Alberto Maria Metelli
Matteo Papini
25
2
0
03 May 2024
Behind the Myth of Exploration in Policy Gradients
Adrien Bolland
Gaspard Lambrechts
Damien Ernst
24
0
0
31 Jan 2024
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
8
5
0
06 Nov 2022
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Amrit Singh Bedi
Souradip Chakraborty
Anjaly Parayil
Brian M. Sadler
Pratap Tokekar
Alec Koppel
28
17
0
28 Jan 2022
Variational Optimization
J. Staines
David Barber
DRL
53
52
0
18 Dec 2012
1