Exploration in Structured Reinforcement Learning

Exploration in Structured Reinforcement Learning

3 June 2018

Jungseul Ok

Alexandre Proutière

Damianos Tranos

Papers citing "Exploration in Structured Reinforcement Learning"

14 / 14 papers shown

Title
Gap-Dependent Bounds for Q-Learning using Reference-Advantage Decomposition Zhong Zheng Haochen Zhang Lingzhou Xue OffRL 78 2 0 10 Oct 2024
Reduce, Reuse, Recycle: Categories for Compositional Reinforcement Learning Georgios Bakirtzis M. Savvas Ruihan Zhao Sandeep P. Chinchali Ufuk Topcu 40 2 0 23 Aug 2024
Blending Data-Driven Priors in Dynamic Games Justin Lidard Haimin Hu Asher Hancock Zixu Zhang Albert Gimó Contreras ... Deepak Gopinath Guy Rosman Naomi Ehrich Leonard María Santos J. F. Fisac OffRL 43 5 0 21 Feb 2024
Categorical semantics of compositional reinforcement learning Georgios Bakirtzis M. Savvas Ufuk Topcu CoGe 40 4 0 29 Aug 2022
Logarithmic regret bounds for continuous-time average-reward Markov decision processes Xuefeng Gao X. Zhou 39 8 0 23 May 2022
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps Jinglin Chen Nan Jiang OffRL 21 33 0 25 Mar 2022
Gap-Dependent Unsupervised Exploration for Reinforcement Learning Jingfeng Wu Vladimir Braverman Lin F. Yang 30 12 0 11 Aug 2021
Improved Corruption Robust Algorithms for Episodic Reinforcement Learning Yifang Chen S. Du Kevin G. Jamieson 24 22 0 13 Feb 2021
Task-Optimal Exploration in Linear Dynamical Systems Andrew Wagenmaker Max Simchowitz Kevin G. Jamieson 14 18 0 10 Feb 2021
$Q$ -learning with Logarithmic Regret Kunhe Yang Lin F. Yang S. Du 43 59 0 16 Jun 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality Kwang-Sung Jun Chicheng Zhang 10 10 0 15 Jun 2020
Adaptive Exploration in Linear Contextual Bandit Botao Hao Tor Lattimore Csaba Szepesvári 14 73 0 15 Oct 2019
From self-tuning regulators to reinforcement learning and back again Nikolai Matni Alexandre Proutière Anders Rantzer Stephen Tu 19 88 0 27 Jun 2019
Regret Bounds for Reinforcement Learning via Markov Chain Concentration R. Ortner 22 46 0 06 Aug 2018