ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.03900
  4. Cited By
Q-learning with Nearest Neighbors

Q-learning with Nearest Neighbors

12 February 2018
Devavrat Shah
Qiaomin Xie
    OffRL
ArXivPDFHTML

Papers citing "Q-learning with Nearest Neighbors"

19 / 19 papers shown
Title
Convergence Rates for Stochastic Approximation: Biased Noise with
  Unbounded Variance, and Applications
Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications
Rajeeva Laxman Karandikar
M. Vidyasagar
30
8
0
05 Dec 2023
Memory-Consistent Neural Networks for Imitation Learning
Memory-Consistent Neural Networks for Imitation Learning
Kaustubh Sridhar
Souradeep Dutta
Dinesh Jayaraman
James Weimer
Insup Lee
44
8
0
09 Oct 2023
q-Learning in Continuous Time
q-Learning in Continuous Time
Yanwei Jia
X. Zhou
OffRL
51
70
0
02 Jul 2022
The Efficacy of Pessimism in Asynchronous Q-Learning
The Efficacy of Pessimism in Asynchronous Q-Learning
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
78
40
0
14 Mar 2022
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
Xiang Li
Wenhao Yang
Jiadong Liang
Zhihua Zhang
Michael I. Jordan
45
15
0
29 Dec 2021
The Surprising Effectiveness of Representation Learning for Visual
  Imitation
The Surprising Effectiveness of Representation Learning for Visual Imitation
Jyothish Pari
Nur Muhammad (Mahi) Shafiullah
Sridhar Pandian Arunachalam
Lerrel Pinto
SSL
25
161
0
02 Dec 2021
Q-Learning for MDPs with General Spaces: Convergence and Near Optimality
  via Quantization under Weak Continuity
Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity
A. D. Kara
Naci Saldi
S. Yüksel
38
27
0
12 Nov 2021
Adaptive Discretization in Online Reinforcement Learning
Adaptive Discretization in Online Reinforcement Learning
Sean R. Sinclair
Siddhartha Banerjee
Chao Yu
OffRL
45
15
0
29 Oct 2021
Coarse-Grained Smoothness for RL in Metric Spaces
Coarse-Grained Smoothness for RL in Metric Spaces
Giorgio Giannone
Kavosh Asadi
Cameron Allen
Sam Lobel
George Konidaris
Michael Littman
47
3
0
23 Oct 2021
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis
Gen Li
Changxiao Cai
Ee
Yuting Wei
Yuejie Chi
OffRL
55
75
0
12 Feb 2021
Finite-Time Analysis for Double Q-learning
Finite-Time Analysis for Double Q-learning
Huaqing Xiong
Linna Zhao
Yingbin Liang
Wei Zhang
25
31
0
29 Sep 2020
Adaptive Discretization for Model-Based Reinforcement Learning
Adaptive Discretization for Model-Based Reinforcement Learning
Sean R. Sinclair
Tianyu Wang
Gauri Jain
Siddhartha Banerjee
Chao Yu
OffRL
19
21
0
01 Jul 2020
Multi-Agent Reinforcement Learning in Stochastic Networked Systems
Multi-Agent Reinforcement Learning in Stochastic Networked Systems
Yiheng Lin
Guannan Qu
Longbo Huang
Adam Wierman
34
38
0
11 Jun 2020
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning
  with a Generative Model
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model
Gen Li
Yuting Wei
Yuejie Chi
Yuxin Chen
34
125
0
26 May 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
21
11
0
25 Feb 2020
Mean-Field Controls with Q-learning for Cooperative MARL: Convergence
  and Complexity Analysis
Mean-Field Controls with Q-learning for Cooperative MARL: Convergence and Complexity Analysis
Haotian Gu
Xin Guo
Xiaoli Wei
Renyuan Xu
32
65
0
10 Feb 2020
Learning When-to-Treat Policies
Learning When-to-Treat Policies
Xinkun Nie
Emma Brunskill
Stefan Wager
CML
OffRL
11
89
0
23 May 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
19
9
0
14 Feb 2019
Finite-Sample Analysis for SARSA with Linear Function Approximation
Finite-Sample Analysis for SARSA with Linear Function Approximation
Shaofeng Zou
Tengyu Xu
Yingbin Liang
32
146
0
06 Feb 2019
1