ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13203
  4. Cited By
A Finite Sample Complexity Bound for Distributionally Robust Q-learning

A Finite Sample Complexity Bound for Distributionally Robust Q-learning

26 February 2023
Shengbo Wang
Nian Si
Jose H. Blanchet
Zhengyuan Zhou
    OOD
    OffRL
ArXivPDFHTML

Papers citing "A Finite Sample Complexity Bound for Distributionally Robust Q-learning"

19 / 19 papers shown
Title
Efficient Learning for Entropy-Regularized Markov Decision Processes via Multilevel Monte Carlo
Efficient Learning for Entropy-Regularized Markov Decision Processes via Multilevel Monte Carlo
Matthieu Meunier
C. Reisinger
Yufei Zhang
34
0
0
27 Mar 2025
Planning and Learning in Average Risk-aware MDPs
Planning and Learning in Average Risk-aware MDPs
Weikai Wang
Erick Delage
48
0
0
22 Mar 2025
Learning a Single Neuron Robustly to Distributional Shifts and
  Adversarial Label Noise
Learning a Single Neuron Robustly to Distributional Shifts and Adversarial Label Noise
Shuyao Li
Sushrut Karmalkar
Ilias Diakonikolas
Jelena Diakonikolas
OOD
47
0
0
11 Nov 2024
Robust Q-Learning for finite ambiguity sets
Robust Q-Learning for finite ambiguity sets
Cécile Decker
Julian Sester
21
0
0
05 Jul 2024
Model-Free Robust Reinforcement Learning with Sample Complexity Analysis
Model-Free Robust Reinforcement Learning with Sample Complexity Analysis
Yudan Wang
Shaofeng Zou
Yue Wang
OOD
18
1
0
24 Jun 2024
Statistical Learning of Distributionally Robust Stochastic Control in
  Continuous State Spaces
Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces
Shengbo Wang
Nian Si
Jose H. Blanchet
Zhengyuan Zhou
24
0
0
17 Jun 2024
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic
  Variation Penalty
Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty
Yanwei Jia
31
2
0
19 Apr 2024
Distributionally Robust Reinforcement Learning with Interactive Data
  Collection: Fundamental Hardness and Near-Optimal Algorithm
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm
Miao Lu
Han Zhong
Tong Zhang
Jose H. Blanchet
OffRL
OOD
68
4
0
04 Apr 2024
Sample Complexity of Offline Distributionally Robust Linear Markov
  Decision Processes
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
He Wang
Laixi Shi
Yuejie Chi
OffRL
29
6
0
19 Mar 2024
On the Foundation of Distributionally Robust Reinforcement Learning
On the Foundation of Distributionally Robust Reinforcement Learning
Shengbo Wang
Nian Si
Jose H. Blanchet
Zhengyuan Zhou
OffRL
24
16
0
15 Nov 2023
Distributionally Robust Model-based Reinforcement Learning with Large
  State Spaces
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Yifan Hu
Andreas Krause
Ilija Bogunovic
OOD
31
10
0
05 Sep 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function
  Approximation
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
35
19
0
17 Jul 2023
Seeing is not Believing: Robust Reinforcement Learning against Spurious
  Correlation
Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation
Wenhao Ding
Laixi Shi
Yuejie Chi
Ding Zhao
OOD
27
18
0
15 Jul 2023
Sample Complexity of Variance-reduced Distributionally Robust Q-learning
Sample Complexity of Variance-reduced Distributionally Robust Q-learning
Shengbo Wang
Nian Si
Jose H. Blanchet
Zhengyuan Zhou
OOD
13
12
0
28 May 2023
The Curious Price of Distributional Robustness in Reinforcement Learning
  with a Generative Model
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
M. Geist
Yuejie Chi
OOD
25
23
0
26 May 2023
Double Pessimism is Provably Efficient for Distributionally Robust
  Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
Jose H. Blanchet
Miao Lu
Tong Zhang
Han Zhong
OffRL
37
29
0
16 May 2023
Robust Markov Decision Processes without Model Estimation
Robust Markov Decision Processes without Model Estimation
Wenhao Yang
Hanfengzhai Wang
Tadashi Kozuno
S. Jordan
Zhihua Zhang
8
2
0
02 Feb 2023
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous
  Unobserved Confounders
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
David Bruns-Smith
Angela Zhou
OffRL
13
9
0
01 Feb 2023
CAD2RL: Real Single-Image Flight without a Single Real Image
CAD2RL: Real Single-Image Flight without a Single Real Image
Fereshteh Sadeghi
Sergey Levine
SSL
216
809
0
13 Nov 2016
1