ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.04843
  4. Cited By
Policy Gradient using Weak Derivatives for Reinforcement Learning

Policy Gradient using Weak Derivatives for Reinforcement Learning

IEEE Conference on Decision and Control (CDC), 2019
9 April 2020
Sujay Bhatt
Alec Koppel
Vikram Krishnamurthy
ArXiv (abs)PDFHTML

Papers citing "Policy Gradient using Weak Derivatives for Reinforcement Learning"

6 / 6 papers shown
Behind the Myth of Exploration in Policy Gradients
Behind the Myth of Exploration in Policy Gradients
Adrien Bolland
Gaspard Lambrechts
Damien Ernst
474
3
0
31 Jan 2024
A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces
A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces
B. Kerimkulov
J. Leahy
David Siska
Lukasz Szpruch
Yufei Zhang
459
17
0
04 Oct 2023
An Analysis of Measure-Valued Derivatives for Policy Gradients
An Analysis of Measure-Valued Derivatives for Policy Gradients
João Carvalho
Jan Peters
OffRL
98
0
0
08 Mar 2022
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
On the Hidden Biases of Policy Mirror Ascent in Continuous Action SpacesInternational Conference on Machine Learning (ICML), 2022
Amrit Singh Bedi
Souradip Chakraborty
Anjaly Parayil
Brian M Sadler
Erfaun Noorani
Alec Koppel
394
20
0
28 Jan 2022
An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients
An Empirical Analysis of Measure-Valued Derivatives for Policy GradientsIEEE International Joint Conference on Neural Network (IJCNN), 2021
João Carvalho
Davide Tateo
Fabio Muratore
Jan Peters
OffRL
157
7
0
20 Jul 2021
On the Sample Complexity and Metastability of Heavy-tailed Policy Search
  in Continuous Control
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
Amrit Singh Bedi
Anjaly Parayil
Junyu Zhang
Mengdi Wang
Alec Koppel
222
20
0
15 Jun 2021
1
Page 1 of 1