v1v2 (latest)

Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound

International Conference on Machine Learning (ICML), 2019

24 May 2019

Papers citing "Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound"

50 / 226 papers shown

Title
Stability of Q-Learning Through Design and Optimism Sean P. Meyn 237 10 0 05 Jul 2023
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDPInternational Conference on Machine Learning (ICML), 2023 Jiacheng Guo Zihao Li Huazheng Wang Mengdi Wang Zhuoran Yang Xuezhou Zhang 208 7 0 21 Jun 2023
On the Model-Misspecification in Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Yunfan Li Lin F. Yang 254 6 0 19 Jun 2023
Low-Switching Policy Gradient with Exploration via Online Sensitivity SamplingInternational Conference on Machine Learning (ICML), 2023 Yunfan Li Yiran Wang Y. Cheng Lin F. Yang OffRL 189 6 0 15 Jun 2023
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023 Amin Karbasi Nikki Lijing Kuang Yi-An Ma Siddharth Mitra OffRL 174 6 0 15 Jun 2023
Kernelized Reinforcement Learning with Order Optimal Regret BoundsNeural Information Processing Systems (NeurIPS), 2023 Sattar Vakili Julia Olkhovskaya 286 13 0 13 Jun 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte CarloInternational Conference on Learning Representations (ICLR), 2023 Haque Ishfaq Qingfeng Lan Pan Xu A. R. Mahmood Doina Precup Anima Anandkumar Kamyar Azizzadenesheli BDL OffRL 300 27 0 29 May 2023
Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure Xumei Xi Chao Yu Yudong Chen OffRL 140 0 0 24 May 2023
What can online reinforcement learning with function approximation benefit from general coverage conditions?International Conference on Machine Learning (ICML), 2023 Fanghui Liu Luca Viano Volkan Cevher OffRL 223 4 0 25 Apr 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward LearningNeural Information Processing Systems (NeurIPS), 2023 Dingwen Kong Lin F. Yang 229 16 0 18 Apr 2023
Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic EmbeddingIEEE Conference on Decision and Control (CDC), 2023 Zhaolin Ren Tongzheng Ren Haitong Ma Na Li Bo Dai 273 12 0 08 Apr 2023
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPsInternational Conference on Machine Learning (ICML), 2023 Junkai Zhang Weitong Zhang Quanquan Gu 251 5 0 17 Mar 2023
Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards Xiang Li Qiang Sun 217 9 0 09 Mar 2023
Finite-sample Guarantees for Nash Q-learning with Linear Function ApproximationConference on Uncertainty in Artificial Intelligence (UAI), 2023 Pedro Cisneros-Velarde Oluwasanmi Koyejo 226 1 0 01 Mar 2023
Optimistic Planning by Regularized Dynamic ProgrammingInternational Conference on Machine Learning (ICML), 2023 Antoine Moulin Gergely Neu 255 8 0 27 Feb 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationInternational Conference on Learning Representations (ICLR), 2023 Thanh Nguyen-Tang R. Arora OffRL 194 6 0 24 Feb 2023
Provably Efficient Reinforcement Learning via Surprise BoundInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Hanlin Zhu Ruosong Wang Jason D. Lee OffRL 151 5 0 22 Feb 2023
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case RegretInternational Conference on Machine Learning (ICML), 2023 Han Zhong Jiachen Hu Yecheng Xue Tongyang Li Liwei Wang 205 11 0 21 Feb 2023
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational EfficiencyAnnual Conference Computational Learning Theory (COLT), 2023 Heyang Zhao Jiafan He Dongruo Zhou Tong Zhang Quanquan Gu 239 37 0 21 Feb 2023
Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization Fang-yuan Kong Xiangcheng Zhang Baoxiang Wang Shuai Li 186 14 0 14 Feb 2023
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2023 Qiwen Cui Jianchao Tan S. Du 315 28 0 07 Feb 2023
Online Reinforcement Learning with Uncertain Episode LengthsAAAI Conference on Artificial Intelligence (AAAI), 2023 Debmalya Mandal Goran Radanović Jiarui Gan Adish Singla R. Majumdar OffRL 163 9 0 07 Feb 2023
Sample Complexity of Kernel-Based Q-LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Sing-Yuan Yeh Fu-Chieh Chang Chang-Wei Yueh Pei-Yuan Wu A. Bernacchia Sattar Vakili OffRL 259 6 0 01 Feb 2023
Learning in POMDPs is Sample-Efficient with Hindsight ObservabilityInternational Conference on Machine Learning (ICML), 2023 Jonathan Lee Alekh Agarwal Christoph Dann Tong Zhang 243 23 0 31 Jan 2023
Improved Regret for Efficient Online Reinforcement Learning with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2023 Uri Sherman Tomer Koren Yishay Mansour 309 13 0 30 Jan 2023
Refined Regret for Adversarial MDPs with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2023 Yan Dai Haipeng Luo Chen-Yu Wei Julian Zimmert 252 13 0 30 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023 Souradip Chakraborty Amrit Singh Bedi Alec Koppel Mengdi Wang Furong Huang Dinesh Manocha 158 9 0 28 Jan 2023
Multi-Agent Congestion Cost Minimization With Linear Function ApproximationsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 Prashant Trivedi N. Hemachandra 200 1 0 26 Jan 2023
Model-Based Reinforcement Learning with Multinomial Logistic Function ApproximationAAAI Conference on Artificial Intelligence (AAAI), 2022 Taehyun Hwang Min Hwan Oh 206 11 0 27 Dec 2022
Latent Variable Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022 Zhaolin Ren Chenjun Xiao Tianjun Zhang Na Li Zhaoran Wang Sujay Sanghavi Dale Schuurmans Bo Dai OffRL 202 11 0 17 Dec 2022
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022 Jiafan He Heyang Zhao Dongruo Zhou Quanquan Gu OffRL 417 63 0 12 Dec 2022
Causal Deep Reinforcement Learning Using Observational DataInternational Joint Conference on Artificial Intelligence (IJCAI), 2022 Wenxuan Zhu Chao Yu Qiaosheng Zhang CML OffRL 159 8 0 28 Nov 2022
Linear Reinforcement Learning with Ball Structure Action SpaceInternational Conference on Algorithmic Learning Theory (ALT), 2022 Zeyu Jia Randy Jia Dhruv Madeka Dean Phillips Foster 134 1 0 14 Nov 2022
Leveraging Offline Data in Online Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022 Andrew Wagenmaker Aldo Pacchiano OffRL OnRL 235 44 0 09 Nov 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample ComplexityNeural Information Processing Systems (NeurIPS), 2022 Abhishek Gupta Aldo Pacchiano Yuexiang Zhai Sham Kakade Sergey Levine OffRL 189 93 0 18 Oct 2022
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and PlanningAAAI Conference on Artificial Intelligence (AAAI), 2022 Reda Ouhamma D. Basu Odalric-Ambrym Maillard OffRL 162 12 0 05 Oct 2022
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient Ming Yin Mengdi Wang Yu Wang OffRL 269 12 0 03 Oct 2022
A General Framework for Sample-Efficient Function Approximation in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022 Zixiang Chen C. J. Li An Yuan Quanquan Gu Michael I. Jordan OffRL 212 30 0 30 Sep 2022
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022 Shen Zhang 134 6 0 16 Sep 2022
Understanding Deep Neural Function Approximation in Reinforcement Learning via $ε$ -Greedy ExplorationNeural Information Processing Systems (NeurIPS), 2022 Fanghui Liu Luca Viano Volkan Cevher 293 23 0 15 Sep 2022
Dynamic Regret of Online Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022 Peng Zhao Longfei Li Zhi Zhou OffRL 184 19 0 26 Aug 2022
Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium C. J. Li Dongruo Zhou Quanquan Gu Sai Li 147 2 0 10 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022 Delin Qu Lingxiao Wang Chenjia Bai Zhuoran Yang Zhaoran Wang SSL OffRL 372 32 0 29 Jul 2022
Making Linear MDPs Practical via Contrastive Representation LearningInternational Conference on Machine Learning (ICML), 2022 Tianjun Zhang Zhaolin Ren Mengjiao Yang Joseph E. Gonzalez Dale Schuurmans Bo Dai 197 52 0 14 Jul 2022
PAC Reinforcement Learning for Predictive State RepresentationsInternational Conference on Learning Representations (ICLR), 2022 Wenhao Zhan Masatoshi Uehara Wen Sun Jason D. Lee 344 43 0 12 Jul 2022
Model Selection in Reinforcement Learning with General Function Approximations Avishek Ghosh Sayak Ray Chowdhury 114 3 0 06 Jul 2022
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization Kaixuan Huang Yuehua Wu Xuezhou Zhang Shenyinying Tu Qingyun Wu Mengdi Wang Huazheng Wang 106 1 0 29 Jun 2022
Joint Representation Training in Sequential Tasks with Shared Structure Aldo Pacchiano Ofir Nachum Nilseh Tripuraneni Peter L. Bartlett 243 5 0 24 Jun 2022
Provably Efficient Reinforcement Learning in Partially Observable Dynamical SystemsNeural Information Processing Systems (NeurIPS), 2022 Masatoshi Uehara Ayush Sekhari Jason D. Lee Nathan Kallus Wen Sun OffRL 271 40 0 24 Jun 2022
Nearly Minimax Optimal Reinforcement Learning with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2022 Pihe Hu Yu Chen Longbo Huang 411 36 0 23 Jun 2022