ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.10250
  4. Cited By
Diagnosing Bottlenecks in Deep Q-learning Algorithms

Diagnosing Bottlenecks in Deep Q-learning Algorithms

26 February 2019
Justin Fu
Aviral Kumar
Matthew Soh
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Diagnosing Bottlenecks in Deep Q-learning Algorithms"

22 / 22 papers shown
Title
Monitor and Recover: A Paradigm for Future Research on Distribution Shift in Learning-Enabled Cyber-Physical Systems
Monitor and Recover: A Paradigm for Future Research on Distribution Shift in Learning-Enabled Cyber-Physical Systems
Vivian Lin
Insup Lee
33
0
0
18 Apr 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
100
1
0
22 Dec 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
22
2
0
08 Nov 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive
  Revaluation
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
30
1
0
15 Oct 2024
Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Weiqin Chen
Sandipan Mishra
Santiago Paternain
OffRL
40
2
0
22 Aug 2024
Regularization and Variance-Weighted Regression Achieves Minimax
  Optimality in Linear MDPs: Theory and Practice
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
30
2
0
22 May 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
51
88
0
24 Feb 2023
Robust Offline Reinforcement Learning with Gradient Penalty and
  Constraint Relaxation
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Chengqian Gao
Kelvin Xu
Liu Liu
Deheng Ye
P. Zhao
Zhiqiang Xu
OffRL
37
2
0
19 Oct 2022
Multi-objective Optimization of Notifications Using Offline
  Reinforcement Learning
Multi-objective Optimization of Notifications Using Offline Reinforcement Learning
Prakruthi Prabhakar
Yiping Yuan
Guangyu Yang
Wensheng Sun
A. Muralidharan
OffRL
28
6
0
07 Jul 2022
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement
  for Value Error
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto
D. Meger
Doina Precup
Ofir Nachum
S. Gu
30
32
0
28 Jan 2022
DR3: Value-Based Deep Reinforcement Learning Requires Explicit
  Regularization
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Aviral Kumar
Rishabh Agarwal
Tengyu Ma
Aaron Courville
George Tucker
Sergey Levine
OffRL
31
65
0
09 Dec 2021
Convergence Results For Q-Learning With Experience Replay
Convergence Results For Q-Learning With Experience Replay
Liran Szlak
Ohad Shamir
OffRL
29
5
0
08 Dec 2021
The Impact of Data Distribution on Q-learning with Function
  Approximation
The Impact of Data Distribution on Q-learning with Function Approximation
Pedro P. Santos
Diogo S. Carvalho
A. Sardinha
Francisco S. Melo
OffRL
11
2
0
23 Nov 2021
On the Estimation Bias in Double Q-Learning
On the Estimation Bias in Double Q-Learning
Zhizhou Ren
Guangxiang Zhu
Haotian Hu
Beining Han
Jian-Hai Chen
Chongjie Zhang
16
17
0
29 Sep 2021
Dual Behavior Regularized Reinforcement Learning
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
15
1
0
19 Sep 2021
Modularity in Reinforcement Learning via Algorithmic Independence in
  Credit Assignment
Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment
Michael Chang
Sid Kaushik
Sergey Levine
Thomas L. Griffiths
28
8
0
28 Jun 2021
Reverb: A Framework For Experience Replay
Reverb: A Framework For Experience Replay
Albin Cassirer
Gabriel Barth-Maron
E. Brevdo
Sabela Ramos
Toby Boyd
Thibault Sottiaux
M. Kroiss
VLM
OffRL
21
38
0
09 Feb 2021
COG: Connecting New Skills to Past Experience with Offline Reinforcement
  Learning
COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
Avi Singh
Albert Yu
Jonathan Yang
Jesse Zhang
Aviral Kumar
Sergey Levine
SSL
OffRL
OnRL
35
103
0
27 Oct 2020
Learning Off-Policy with Online Planning
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
31
45
0
23 Aug 2020
Trade-off on Sim2Real Learning: Real-world Learning Faster than
  Simulations
Trade-off on Sim2Real Learning: Real-world Learning Faster than Simulations
Jingyi Huang
Yizheng Zhang
F. Giardina
A. Rosendo
OffRL
22
1
0
21 Jul 2020
Towards Understanding Cooperative Multi-Agent Q-Learning with Value
  Factorization
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization
Jianhao Wang
Zhizhou Ren
Beining Han
Jianing Ye
Chongjie Zhang
OffRL
21
32
0
31 May 2020
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
18
17
0
11 Mar 2019
1