Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.00137
Cited By
A Theoretical Analysis of Deep Q-Learning
1 January 2019
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Theoretical Analysis of Deep Q-Learning"
29 / 79 papers shown
Title
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games
Xiaotie Deng
Ningyuan Li
D. Mguni
Jun Wang
Yaodong Yang
21
46
0
04 Sep 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
29
111
0
19 Aug 2021
Towards General Function Approximation in Zero-Sum Markov Games
Baihe Huang
Jason D. Lee
Zhaoran Wang
Zhuoran Yang
33
47
0
30 Jul 2021
Deeply-Debiased Off-Policy Interval Estimation
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
17
35
0
10 May 2021
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Yaqi Duan
Chi Jin
Zhiyuan Li
OffRL
20
47
0
25 Mar 2021
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis
Gen Li
Changxiao Cai
Ee
Yuting Wei
Yuejie Chi
OffRL
48
75
0
12 Feb 2021
A Lyapunov Theory for Finite-Sample Guarantees of Asynchronous Q-Learning and TD-Learning Variants
Zaiwei Chen
S. T. Maguluri
Sanjay Shakkottai
Karthikeyan Shanmugam
OffRL
89
53
0
02 Feb 2021
Fast Rates for the Regret of Offline Reinforcement Learning
Yichun Hu
Nathan Kallus
Masatoshi Uehara
OffRL
11
30
0
31 Jan 2021
A novel policy for pre-trained Deep Reinforcement Learning for Speech Emotion Recognition
Thejan Rajapakshe
R. Rana
Sara Khalifa
Björn W. Schuller
Jiajun Liu
OffRL
21
11
0
04 Jan 2021
Is Pessimism Provably Efficient for Offline RL?
Ying Jin
Zhuoran Yang
Zhaoran Wang
OffRL
27
346
0
30 Dec 2020
Logistic Q-Learning
Joan Bas-Serrano
Sebastian Curi
Andreas Krause
Gergely Neu
14
40
0
21 Oct 2020
Deep Reinforcement Learning for Adaptive Network Slicing in 5G for Intelligent Vehicular Systems and Smart Cities
A. Nassar
Y. Yilmaz
AI4CE
19
55
0
19 Oct 2020
Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
15
42
0
02 Aug 2020
The Kolmogorov-Arnold representation theorem revisited
Johannes Schmidt-Hieber
30
125
0
31 Jul 2020
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Kaipeng Zhang
Sham Kakade
Tamer Bacsar
Lin F. Yang
47
119
0
15 Jul 2020
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning
Lingxiao Wang
Zhuoran Yang
Zhaoran Wang
27
26
0
21 Jun 2020
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model
Gen Li
Yuting Wei
Yuejie Chi
Yuxin Chen
34
124
0
26 May 2020
Deep Reinforcement Learning for QoS-Constrained Resource Allocation in Multiservice Networks
J. Saraiva
I. M. Braga
V. F. Monteiro
F. Lima
T. Maciel
W. Freitas
F. Cavalcanti
6
9
0
03 Mar 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
13
11
0
25 Feb 2020
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
Qiaomin Xie
Yudong Chen
Zhaoran Wang
Zhuoran Yang
39
124
0
17 Feb 2020
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kaipeng Zhang
Zhuoran Yang
Tamer Basar
55
1,181
0
24 Nov 2019
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu
Qi Cai
Zhuoran Yang
Zhaoran Wang
24
108
0
25 Jun 2019
Feature-Based Q-Learning for Two-Player Stochastic Games
Zeyu Jia
Lin F. Yang
Mengdi Wang
11
45
0
02 Jun 2019
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
Kaipeng Zhang
Zhuoran Yang
Tamer Basar
27
125
0
31 May 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
11
9
0
14 Feb 2019
Finite-Sample Analysis for SARSA with Linear Function Approximation
Shaofeng Zou
Tengyu Xu
Yingbin Liang
13
146
0
06 Feb 2019
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
32
53
0
03 Nov 2018
Norm-Based Capacity Control in Neural Networks
Behnam Neyshabur
Ryota Tomioka
Nathan Srebro
127
577
0
27 Feb 2015
Q-learning with censored data
Y. Goldberg
Michael R. Kosorok
OffRL
57
137
0
30 May 2012
Previous
1
2