Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1905.10389
Cited By
v1
v2 (latest)
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound
International Conference on Machine Learning (ICML), 2019
24 May 2019
Lin F. Yang
Mengdi Wang
OffRL
GP
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound"
50 / 226 papers shown
Title
Stability of Q-Learning Through Design and Optimism
Sean P. Meyn
229
10
0
05 Jul 2023
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
International Conference on Machine Learning (ICML), 2023
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
200
7
0
21 Jun 2023
On the Model-Misspecification in Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yunfan Li
Lin F. Yang
254
6
0
19 Jun 2023
Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling
International Conference on Machine Learning (ICML), 2023
Yunfan Li
Yiran Wang
Y. Cheng
Lin F. Yang
OffRL
189
6
0
15 Jun 2023
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Amin Karbasi
Nikki Lijing Kuang
Yi-An Ma
Siddharth Mitra
OffRL
174
6
0
15 Jun 2023
Kernelized Reinforcement Learning with Order Optimal Regret Bounds
Neural Information Processing Systems (NeurIPS), 2023
Sattar Vakili
Julia Olkhovskaya
286
13
0
13 Jun 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
International Conference on Learning Representations (ICLR), 2023
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDL
OffRL
296
27
0
29 May 2023
Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure
Xumei Xi
Chao Yu
Yudong Chen
OffRL
140
0
0
24 May 2023
What can online reinforcement learning with function approximation benefit from general coverage conditions?
International Conference on Machine Learning (ICML), 2023
Fanghui Liu
Luca Viano
Volkan Cevher
OffRL
223
4
0
25 Apr 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning
Neural Information Processing Systems (NeurIPS), 2023
Dingwen Kong
Lin F. Yang
225
16
0
18 Apr 2023
Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding
IEEE Conference on Decision and Control (CDC), 2023
Zhaolin Ren
Tongzheng Ren
Haitong Ma
Na Li
Bo Dai
273
12
0
08 Apr 2023
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
International Conference on Machine Learning (ICML), 2023
Junkai Zhang
Weitong Zhang
Quanquan Gu
175
5
0
17 Mar 2023
Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards
Xiang Li
Qiang Sun
217
9
0
09 Mar 2023
Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation
Conference on Uncertainty in Artificial Intelligence (UAI), 2023
Pedro Cisneros-Velarde
Oluwasanmi Koyejo
226
1
0
01 Mar 2023
Optimistic Planning by Regularized Dynamic Programming
International Conference on Machine Learning (ICML), 2023
Antoine Moulin
Gergely Neu
255
8
0
27 Feb 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
International Conference on Learning Representations (ICLR), 2023
Thanh Nguyen-Tang
R. Arora
OffRL
194
6
0
24 Feb 2023
Provably Efficient Reinforcement Learning via Surprise Bound
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Hanlin Zhu
Ruosong Wang
Jason D. Lee
OffRL
151
5
0
22 Feb 2023
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret
International Conference on Machine Learning (ICML), 2023
Han Zhong
Jiachen Hu
Yecheng Xue
Tongyang Li
Liwei Wang
197
11
0
21 Feb 2023
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Annual Conference Computational Learning Theory (COLT), 2023
Heyang Zhao
Jiafan He
Dongruo Zhou
Tong Zhang
Quanquan Gu
235
37
0
21 Feb 2023
Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization
Fang-yuan Kong
Xiangcheng Zhang
Baoxiang Wang
Shuai Li
186
14
0
14 Feb 2023
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation
Annual Conference Computational Learning Theory (COLT), 2023
Qiwen Cui
Jianchao Tan
S. Du
307
28
0
07 Feb 2023
Online Reinforcement Learning with Uncertain Episode Lengths
AAAI Conference on Artificial Intelligence (AAAI), 2023
Debmalya Mandal
Goran Radanović
Jiarui Gan
Adish Singla
R. Majumdar
OffRL
163
9
0
07 Feb 2023
Sample Complexity of Kernel-Based Q-Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Sing-Yuan Yeh
Fu-Chieh Chang
Chang-Wei Yueh
Pei-Yuan Wu
A. Bernacchia
Sattar Vakili
OffRL
259
6
0
01 Feb 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
International Conference on Machine Learning (ICML), 2023
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
231
23
0
31 Jan 2023
Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation
International Conference on Machine Learning (ICML), 2023
Uri Sherman
Tomer Koren
Yishay Mansour
301
13
0
30 Jan 2023
Refined Regret for Adversarial MDPs with Linear Function Approximation
International Conference on Machine Learning (ICML), 2023
Yan Dai
Haipeng Luo
Chen-Yu Wei
Julian Zimmert
252
13
0
30 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Mengdi Wang
Furong Huang
Dinesh Manocha
158
9
0
28 Jan 2023
Multi-Agent Congestion Cost Minimization With Linear Function Approximations
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Prashant Trivedi
N. Hemachandra
200
1
0
26 Jan 2023
Model-Based Reinforcement Learning with Multinomial Logistic Function Approximation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Taehyun Hwang
Min Hwan Oh
206
11
0
27 Dec 2022
Latent Variable Representation for Reinforcement Learning
International Conference on Learning Representations (ICLR), 2022
Zhaolin Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
202
11
0
17 Dec 2022
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes
International Conference on Machine Learning (ICML), 2022
Jiafan He
Heyang Zhao
Dongruo Zhou
Quanquan Gu
OffRL
353
63
0
12 Dec 2022
Causal Deep Reinforcement Learning Using Observational Data
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CML
OffRL
151
8
0
28 Nov 2022
Linear Reinforcement Learning with Ball Structure Action Space
International Conference on Algorithmic Learning Theory (ALT), 2022
Zeyu Jia
Randy Jia
Dhruv Madeka
Dean Phillips Foster
130
1
0
14 Nov 2022
Leveraging Offline Data in Online Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Andrew Wagenmaker
Aldo Pacchiano
OffRL
OnRL
235
44
0
09 Nov 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Neural Information Processing Systems (NeurIPS), 2022
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
189
92
0
18 Oct 2022
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning
AAAI Conference on Artificial Intelligence (AAAI), 2022
Reda Ouhamma
D. Basu
Odalric-Ambrym Maillard
OffRL
158
12
0
05 Oct 2022
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient
Ming Yin
Mengdi Wang
Yu Wang
OffRL
269
12
0
03 Oct 2022
A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
International Conference on Learning Representations (ICLR), 2022
Zixiang Chen
C. J. Li
An Yuan
Quanquan Gu
Michael I. Jordan
OffRL
212
30
0
30 Sep 2022
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Shen Zhang
134
6
0
16 Sep 2022
Understanding Deep Neural Function Approximation in Reinforcement Learning via
ε
ε
ε
-Greedy Exploration
Neural Information Processing Systems (NeurIPS), 2022
Fanghui Liu
Luca Viano
Volkan Cevher
285
23
0
15 Sep 2022
Dynamic Regret of Online Markov Decision Processes
International Conference on Machine Learning (ICML), 2022
Peng Zhao
Longfei Li
Zhi Zhou
OffRL
184
19
0
26 Aug 2022
Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium
C. J. Li
Dongruo Zhou
Quanquan Gu
Sai Li
139
2
0
10 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
International Conference on Machine Learning (ICML), 2022
Delin Qu
Lingxiao Wang
Chenjia Bai
Zhuoran Yang
Zhaoran Wang
SSL
OffRL
372
32
0
29 Jul 2022
Making Linear MDPs Practical via Contrastive Representation Learning
International Conference on Machine Learning (ICML), 2022
Tianjun Zhang
Zhaolin Ren
Mengjiao Yang
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
181
52
0
14 Jul 2022
PAC Reinforcement Learning for Predictive State Representations
International Conference on Learning Representations (ICLR), 2022
Wenhao Zhan
Masatoshi Uehara
Wen Sun
Jason D. Lee
328
43
0
12 Jul 2022
Model Selection in Reinforcement Learning with General Function Approximations
Avishek Ghosh
Sayak Ray Chowdhury
114
3
0
06 Jul 2022
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
Kaixuan Huang
Yuehua Wu
Xuezhou Zhang
Shenyinying Tu
Qingyun Wu
Mengdi Wang
Huazheng Wang
98
1
0
29 Jun 2022
Joint Representation Training in Sequential Tasks with Shared Structure
Aldo Pacchiano
Ofir Nachum
Nilseh Tripuraneni
Peter L. Bartlett
235
5
0
24 Jun 2022
Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems
Neural Information Processing Systems (NeurIPS), 2022
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
OffRL
259
40
0
24 Jun 2022
Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation
International Conference on Machine Learning (ICML), 2022
Pihe Hu
Yu Chen
Longbo Huang
311
36
0
23 Jun 2022
Previous
1
2
3
4
5
Next