ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.07323
  4. Cited By
Nonparametric Stochastic Compositional Gradient Descent for Q-Learning
  in Continuous Markov Decision Problems

Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems

19 April 2018
Alec Koppel
Ekaterina V. Tolstaya
Ethan Stump
Alejandro Ribeiro
ArXivPDFHTML

Papers citing "Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems"

5 / 5 papers shown
Title
Matrix Low-Rank Trust Region Policy Optimization
Matrix Low-Rank Trust Region Policy Optimization
Sergio Rozada
Antonio G. Marques
43
0
0
27 May 2024
Stability and Generalization of Stochastic Compositional Gradient
  Descent Algorithms
Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms
Minghao Yang
Xiyuan Wei
Tianbao Yang
Yiming Ying
42
1
0
07 Jul 2023
Policy Gradient using Weak Derivatives for Reinforcement Learning
Policy Gradient using Weak Derivatives for Reinforcement Learning
Sujay Bhatt
Alec Koppel
Vikram Krishnamurthy
13
12
0
09 Apr 2020
On the Sample Complexity of Actor-Critic Method for Reinforcement
  Learning with Function Approximation
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
104
80
0
18 Oct 2019
Learning from Conditional Distributions via Dual Embeddings
Learning from Conditional Distributions via Dual Embeddings
Bo Dai
Niao He
Yunpeng Pan
Byron Boots
Le Song
35
21
0
15 Jul 2016
1