ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1504.06043
  4. Cited By
Stability of Stochastic Approximations with `Controlled Markov' Noise
  and Temporal Difference Learning
v1v2 (latest)

Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning

23 April 2015
Arunselvan Ramaswamy
S. Bhatnagar
ArXiv (abs)PDFHTML

Papers citing "Stability of Stochastic Approximations with `Controlled Markov' Noise and Temporal Difference Learning"

7 / 7 papers shown
Title
STOPS: Short-Term-based Volatility-controlled Policy Search and its
  Global Convergence
STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence
Liang Xu
Daoming Lyu
Yangchen Pan
Aiwen Jiang
Bo Liu
95
0
0
24 Jan 2022
Schedule Based Temporal Difference Algorithms
Schedule Based Temporal Difference Algorithms
Rohan Deb
Meet Gandhi
S. Bhatnagar
28
0
0
23 Nov 2021
Gradient Temporal Difference with Momentum: Stability and Convergence
Gradient Temporal Difference with Momentum: Stability and Convergence
Rohan Deb
S. Bhatnagar
55
5
0
22 Nov 2021
The ODE Method for Asymptotic Statistics in Stochastic Approximation and
  Reinforcement Learning
The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning
Vivek Borkar
Shuhang Chen
Adithya M. Devraj
Ioannis Kontoyiannis
Sean P. Meyn
79
32
0
27 Oct 2021
Zap Q-Learning With Nonlinear Function Approximation
Zap Q-Learning With Nonlinear Function Approximation
Shuhang Chen
Adithya M. Devraj
Fan Lu
Ana Bušić
Sean P. Meyn
67
20
0
11 Oct 2019
Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over
  Markovian Samples
Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples
Tengyu Xu
Shaofeng Zou
Yingbin Liang
76
73
0
26 Sep 2019
Finite-Sample Analysis of Nonlinear Stochastic Approximation with
  Applications in Reinforcement Learning
Finite-Sample Analysis of Nonlinear Stochastic Approximation with Applications in Reinforcement Learning
Zaiwei Chen
Sheng Zhang
Thinh T. Doan
John-Paul Clarke
S. T. Maguluri
172
59
0
27 May 2019
1