Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.10250
Cited By
Diagnosing Bottlenecks in Deep Q-learning Algorithms
26 February 2019
Justin Fu
Aviral Kumar
Matthew Soh
Sergey Levine
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diagnosing Bottlenecks in Deep Q-learning Algorithms"
22 / 22 papers shown
Title
Monitor and Recover: A Paradigm for Future Research on Distribution Shift in Learning-Enabled Cyber-Physical Systems
Vivian Lin
Insup Lee
33
0
0
18 Apr 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
100
1
0
22 Dec 2024
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
22
2
0
08 Nov 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
30
1
0
15 Oct 2024
Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Weiqin Chen
Sandipan Mishra
Santiago Paternain
OffRL
40
2
0
22 Aug 2024
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
30
2
0
22 May 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
51
88
0
24 Feb 2023
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Chengqian Gao
Kelvin Xu
Liu Liu
Deheng Ye
P. Zhao
Zhiqiang Xu
OffRL
37
2
0
19 Oct 2022
Multi-objective Optimization of Notifications Using Offline Reinforcement Learning
Prakruthi Prabhakar
Yiping Yuan
Guangyu Yang
Wensheng Sun
A. Muralidharan
OffRL
28
6
0
07 Jul 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Aviral Kumar
Rishabh Agarwal
Tengyu Ma
Aaron Courville
George Tucker
Sergey Levine
OffRL
31
65
0
09 Dec 2021
Convergence Results For Q-Learning With Experience Replay
Liran Szlak
Ohad Shamir
OffRL
29
5
0
08 Dec 2021
The Impact of Data Distribution on Q-learning with Function Approximation
Pedro P. Santos
Diogo S. Carvalho
A. Sardinha
Francisco S. Melo
OffRL
11
2
0
23 Nov 2021
On the Estimation Bias in Double Q-Learning
Zhizhou Ren
Guangxiang Zhu
Haotian Hu
Beining Han
Jian-Hai Chen
Chongjie Zhang
16
17
0
29 Sep 2021
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
15
1
0
19 Sep 2021
Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment
Michael Chang
Sid Kaushik
Sergey Levine
Thomas L. Griffiths
28
8
0
28 Jun 2021
Reverb: A Framework For Experience Replay
Albin Cassirer
Gabriel Barth-Maron
E. Brevdo
Sabela Ramos
Toby Boyd
Thibault Sottiaux
M. Kroiss
VLM
OffRL
21
38
0
09 Feb 2021
COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
Avi Singh
Albert Yu
Jonathan Yang
Jesse Zhang
Aviral Kumar
Sergey Levine
SSL
OffRL
OnRL
35
103
0
27 Oct 2020
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
31
45
0
23 Aug 2020
Trade-off on Sim2Real Learning: Real-world Learning Faster than Simulations
Jingyi Huang
Yizheng Zhang
F. Giardina
A. Rosendo
OffRL
22
1
0
21 Jul 2020
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization
Jianhao Wang
Zhizhou Ren
Beining Han
Jianing Ye
Chongjie Zhang
OffRL
21
32
0
31 May 2020
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
18
17
0
11 Mar 2019
1