ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.06851
  4. Cited By
Policy Gradient Algorithms Implicitly Optimize by Continuation

Policy Gradient Algorithms Implicitly Optimize by Continuation

11 May 2023
Adrien Bolland
Gilles Louppe
D. Ernst
ArXivPDFHTML

Papers citing "Policy Gradient Algorithms Implicitly Optimize by Continuation"

5 / 5 papers shown
Title
Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Learning Optimal Deterministic Policies with Stochastic Policy Gradients
Alessandro Montenegro
Marco Mussi
Alberto Maria Metelli
Matteo Papini
25
2
0
03 May 2024
Behind the Myth of Exploration in Policy Gradients
Behind the Myth of Exploration in Policy Gradients
Adrien Bolland
Gaspard Lambrechts
Damien Ernst
24
0
0
31 Jan 2024
On learning history based policies for controlling Markov decision
  processes
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
8
5
0
06 Nov 2022
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Amrit Singh Bedi
Souradip Chakraborty
Anjaly Parayil
Brian M. Sadler
Pratap Tokekar
Alec Koppel
28
17
0
28 Jan 2022
Variational Optimization
Variational Optimization
J. Staines
David Barber
DRL
55
52
0
18 Dec 2012
1