Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1905.10389
Cited By
v1
v2 (latest)
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound
International Conference on Machine Learning (ICML), 2019
24 May 2019
Lin F. Yang
Mengdi Wang
OffRL
GP
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound"
50 / 226 papers shown
Title
Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity
Neural Information Processing Systems (NeurIPS), 2022
Alekh Agarwal
Tong Zhang
256
26
0
15 Jun 2022
Overcoming the Long Horizon Barrier for Sample-Efficient Reinforcement Learning with Latent Low-Rank Structure
Measurement and Modeling of Computer Systems (SIGMETRICS), 2022
Tyler Sam
Yudong Chen
Chao Yu
OffRL
392
8
0
07 Jun 2022
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
Neural Information Processing Systems (NeurIPS), 2022
Hui Yuan
Chengzhuo Ni
Huazheng Wang
Xuezhou Zhang
Le Cong
Csaba Szepesvári
Mengdi Wang
156
2
0
05 Jun 2022
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2022
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Brian M. Sadler
Furong Huang
Erfaun Noorani
Tianyi Zhou
213
10
0
02 Jun 2022
Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning
International Conference on Machine Learning (ICML), 2022
Andrea Zanette
Martin J. Wainwright
OOD
266
5
0
01 Jun 2022
Byzantine-Robust Online and Offline Distributed Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Yiding Chen
Xuezhou Zhang
Jianchao Tan
Mengdi Wang
Xiaojin Zhu
OffRL
272
22
0
01 Jun 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Annual Conference Computational Learning Theory (COLT), 2022
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
259
37
0
29 May 2022
Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs
Neural Information Processing Systems (NeurIPS), 2022
Dongruo Zhou
Quanquan Gu
205
52
0
23 May 2022
Provably Efficient Kernelized Q-Learning
Shuang Liu
H. Su
MLT
241
4
0
21 Apr 2022
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
International Conference on Machine Learning (ICML), 2022
Qi Cai
Zhuoran Yang
Zhaoran Wang
161
16
0
20 Apr 2022
Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling
Annual Conference Computational Learning Theory (COLT), 2022
Alekh Agarwal
Tong Zhang
155
9
0
15 Mar 2022
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
International Conference on Learning Representations (ICLR), 2022
Ming Yin
Yaqi Duan
Mengdi Wang
Yu Wang
OffRL
228
68
0
11 Mar 2022
A Complete Characterization of Linear Estimators for Offline Policy Evaluation
Journal of machine learning research (JMLR), 2022
Juan C. Perdomo
A. Krishnamurthy
Peter L. Bartlett
Sham Kakade
OffRL
236
4
0
08 Mar 2022
Target Network and Truncation Overcome The Deadly Triad in
Q
Q
Q
-Learning
SIAM Journal on Mathematics of Data Science (SIMODS), 2022
Zaiwei Chen
John-Paul Clarke
S. T. Maguluri
179
26
0
05 Mar 2022
Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
Delin Qu
Boxiang Lyu
Qing-xin Meng
Zhaoran Wang
Zhuoran Yang
Sai Li
207
9
0
25 Feb 2022
Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints
Yuhao Ding
Javad Lavaei
315
12
0
28 Jan 2022
STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence
Liang Xu
Daoming Lyu
Yangchen Pan
Aiwen Jiang
Bo Liu
373
0
0
24 Jan 2022
Meta Learning MDPs with Linear Transition Models
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Robert Muller
Aldo Pacchiano
189
4
0
21 Jan 2022
Exponential Family Model-Based Reinforcement Learning via Score Matching
Neural Information Processing Systems (NeurIPS), 2021
Gen Li
Junbo Li
Anmol Kabra
Nathan Srebro
Zhaoran Wang
Zhuoran Yang
217
5
0
28 Dec 2021
Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic
Neural Information Processing Systems (NeurIPS), 2021
Yufeng Zhang
Siyu Chen
Zhuoran Yang
Sai Li
Zhaoran Wang
225
5
0
27 Dec 2021
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Sai Li
245
33
0
27 Dec 2021
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP
International Conference on Machine Learning (ICML), 2021
Liyu Chen
Rahul Jain
Haipeng Luo
154
15
0
18 Dec 2021
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
465
229
0
08 Dec 2021
Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
International Conference on Learning Representations (ICLR), 2021
Thanh Nguyen-Tang
Sunil R. Gupta
A. Nguyen
Svetha Venkatesh
OffRL
182
33
0
27 Nov 2021
A Free Lunch from the Noise: Provable and Practical Exploration for Representation Learning
Conference on Uncertainty in Artificial Intelligence (UAI), 2021
Zhaolin Ren
Tianjun Zhang
Csaba Szepesvári
Bo Dai
213
22
0
22 Nov 2021
Safe Policy Optimization with Local Generalized Linear Function Approximations
Akifumi Wachi
Yunyue Wei
Yanan Sui
OffRL
172
11
0
09 Nov 2021
Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs
Neural Information Processing Systems (NeurIPS), 2021
Yeoneung Kim
Insoon Yang
Kwang-Sung Jun
183
43
0
05 Nov 2021
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Neural Information Processing Systems (NeurIPS), 2021
Matteo Papini
Andrea Tirinzoni
Aldo Pacchiano
Marcello Restelli
A. Lazaric
Matteo Pirotta
203
21
0
27 Oct 2021
Learning Stochastic Shortest Path with Linear Function Approximation
International Conference on Machine Learning (ICML), 2021
Steffen Czolbe
Jiafan He
Adrian Dalca
Quanquan Gu
306
33
0
25 Oct 2021
Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes
Chonghua Liao
Jiafan He
Quanquan Gu
204
18
0
19 Oct 2021
On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game
Delin Qu
Jieping Ye
Zhaoran Wang
Zhuoran Yang
OffRL
221
25
0
19 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
295
21
0
18 Oct 2021
Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation
Weitong Zhang
Dongruo Zhou
Quanquan Gu
OffRL
253
31
0
12 Oct 2021
Representation Learning for Online and Offline RL in Low-rank MDPs
International Conference on Learning Representations (ICLR), 2021
Masatoshi Uehara
Xuezhou Zhang
Wen Sun
OffRL
389
134
0
09 Oct 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
238
128
0
19 Aug 2021
Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation
Zhihan Liu
Yufeng Zhang
Zuyue Fu
Zhuoran Yang
Zhaoran Wang
OffRL
111
7
0
19 Aug 2021
Efficient Local Planning with Linear Function Approximation
International Conference on Algorithmic Learning Theory (ALT), 2021
Dong Yin
Botao Hao
Yasin Abbasi-Yadkori
N. Lazić
Csaba Szepesvári
333
21
0
12 Aug 2021
Towards General Function Approximation in Zero-Sum Markov Games
International Conference on Learning Representations (ICLR), 2021
Baihe Huang
Jason D. Lee
Zhaoran Wang
Zhuoran Yang
218
49
0
30 Jul 2021
Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses
Neural Information Processing Systems (NeurIPS), 2021
Haipeng Luo
Chen-Yu Wei
Chung-Wei Lee
222
48
0
18 Jul 2021
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration
International Conference on Machine Learning (ICML), 2021
Yuda Song
Wen Sun
237
23
0
15 Jul 2021
Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Baihe Huang
Kaixuan Huang
Sham Kakade
Jason D. Lee
Qi Lei
Runzhe Wang
Jiaqi Yang
162
9
0
14 Jul 2021
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
329
159
0
13 Jul 2021
Model Selection for Generic Reinforcement Learning
Avishek Ghosh
Sayak Ray Chowdhury
Kannan Ramchandran
166
1
0
13 Jul 2021
Gap-Dependent Bounds for Two-Player Markov Games
Zehao Dou
Zhuoran Yang
Zhaoran Wang
S. Du
105
8
0
01 Jul 2021
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL
Conference on Uncertainty in Artificial Intelligence (UAI), 2021
Weitong Zhang
Jiafan He
Dongruo Zhou
Amy Zhang
Quanquan Gu
OffRL
211
12
0
22 Jun 2021
Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations
Neural Information Processing Systems (NeurIPS), 2021
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
OffRL
121
12
0
22 Jun 2021
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
International Conference on Machine Learning (ICML), 2021
Dhruv Malik
Aldo Pacchiano
Vishwak Srinivasan
Yuanzhi Li
141
7
0
15 Jun 2021
Online Sub-Sampling for Reinforcement Learning with General Function Approximation
Dingwen Kong
Ruslan Salakhutdinov
Ruosong Wang
Lin F. Yang
OffRL
187
1
0
14 Jun 2021
Multi-facet Contextual Bandits: A Neural Network Perspective
Knowledge Discovery and Data Mining (KDD), 2021
Yikun Ban
Jingrui He
C. Cook
309
29
0
06 Jun 2021
Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model
Neural Information Processing Systems (NeurIPS), 2021
Bingyan Wang
Yuling Yan
Jianqing Fan
389
22
0
28 May 2021
Previous
1
2
3
4
5
Next