ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.00775
  4. Cited By
Exploration in Structured Reinforcement Learning

Exploration in Structured Reinforcement Learning

3 June 2018
Jungseul Ok
Alexandre Proutière
Damianos Tranos
ArXivPDFHTML

Papers citing "Exploration in Structured Reinforcement Learning"

14 / 14 papers shown
Title
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition
Zhong Zheng
Haochen Zhang
Lingzhou Xue
OffRL
78
2
0
10 Oct 2024
Reduce, Reuse, Recycle: Categories for Compositional Reinforcement Learning
Reduce, Reuse, Recycle: Categories for Compositional Reinforcement Learning
Georgios Bakirtzis
M. Savvas
Ruihan Zhao
Sandeep P. Chinchali
Ufuk Topcu
40
2
0
23 Aug 2024
Blending Data-Driven Priors in Dynamic Games
Blending Data-Driven Priors in Dynamic Games
Justin Lidard
Haimin Hu
Asher Hancock
Zixu Zhang
Albert Gimó Contreras
...
Deepak Gopinath
Guy Rosman
Naomi Ehrich Leonard
María Santos
J. F. Fisac
OffRL
43
5
0
21 Feb 2024
Categorical semantics of compositional reinforcement learning
Categorical semantics of compositional reinforcement learning
Georgios Bakirtzis
M. Savvas
Ufuk Topcu
CoGe
40
4
0
29 Aug 2022
Logarithmic regret bounds for continuous-time average-reward Markov
  decision processes
Logarithmic regret bounds for continuous-time average-reward Markov decision processes
Xuefeng Gao
X. Zhou
39
8
0
23 May 2022
Offline Reinforcement Learning Under Value and Density-Ratio
  Realizability: The Power of Gaps
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps
Jinglin Chen
Nan Jiang
OffRL
21
33
0
25 Mar 2022
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Jingfeng Wu
Vladimir Braverman
Lin F. Yang
30
12
0
11 Aug 2021
Improved Corruption Robust Algorithms for Episodic Reinforcement
  Learning
Improved Corruption Robust Algorithms for Episodic Reinforcement Learning
Yifang Chen
S. Du
Kevin G. Jamieson
24
22
0
13 Feb 2021
Task-Optimal Exploration in Linear Dynamical Systems
Task-Optimal Exploration in Linear Dynamical Systems
Andrew Wagenmaker
Max Simchowitz
Kevin G. Jamieson
14
18
0
10 Feb 2021
$Q$-learning with Logarithmic Regret
QQQ-learning with Logarithmic Regret
Kunhe Yang
Lin F. Yang
S. Du
43
59
0
16 Jun 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic
  Optimality
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
10
10
0
15 Jun 2020
Adaptive Exploration in Linear Contextual Bandit
Adaptive Exploration in Linear Contextual Bandit
Botao Hao
Tor Lattimore
Csaba Szepesvári
14
73
0
15 Oct 2019
From self-tuning regulators to reinforcement learning and back again
From self-tuning regulators to reinforcement learning and back again
Nikolai Matni
Alexandre Proutière
Anders Rantzer
Stephen Tu
19
88
0
27 Jun 2019
Regret Bounds for Reinforcement Learning via Markov Chain Concentration
Regret Bounds for Reinforcement Learning via Markov Chain Concentration
R. Ortner
22
46
0
06 Aug 2018
1