ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.10389
  4. Cited By
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and
  Regret Bound
v1v2 (latest)

Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound

International Conference on Machine Learning (ICML), 2019
24 May 2019
Lin F. Yang
Mengdi Wang
    OffRLGP
ArXiv (abs)PDFHTML

Papers citing "Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound"

50 / 226 papers shown
Title
Stability of Q-Learning Through Design and Optimism
Stability of Q-Learning Through Design and Optimism
Sean P. Meyn
229
10
0
05 Jul 2023
Provably Efficient Representation Learning with Tractable Planning in
  Low-Rank POMDP
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDPInternational Conference on Machine Learning (ICML), 2023
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
200
7
0
21 Jun 2023
On the Model-Misspecification in Reinforcement Learning
On the Model-Misspecification in Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yunfan Li
Lin F. Yang
254
6
0
19 Jun 2023
Low-Switching Policy Gradient with Exploration via Online Sensitivity
  Sampling
Low-Switching Policy Gradient with Exploration via Online Sensitivity SamplingInternational Conference on Machine Learning (ICML), 2023
Yunfan Li
Yiran Wang
Y. Cheng
Lin F. Yang
OffRL
189
6
0
15 Jun 2023
Langevin Thompson Sampling with Logarithmic Communication: Bandits and
  Reinforcement Learning
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Amin Karbasi
Nikki Lijing Kuang
Yi-An Ma
Siddharth Mitra
OffRL
174
6
0
15 Jun 2023
Kernelized Reinforcement Learning with Order Optimal Regret Bounds
Kernelized Reinforcement Learning with Order Optimal Regret BoundsNeural Information Processing Systems (NeurIPS), 2023
Sattar Vakili
Julia Olkhovskaya
286
13
0
13 Jun 2023
Provable and Practical: Efficient Exploration in Reinforcement Learning
  via Langevin Monte Carlo
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte CarloInternational Conference on Learning Representations (ICLR), 2023
Haque Ishfaq
Qingfeng Lan
Pan Xu
A. R. Mahmood
Doina Precup
Anima Anandkumar
Kamyar Azizzadenesheli
BDLOffRL
296
27
0
29 May 2023
Matrix Estimation for Offline Reinforcement Learning with Low-Rank
  Structure
Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure
Xumei Xi
Chao Yu
Yudong Chen
OffRL
140
0
0
24 May 2023
What can online reinforcement learning with function approximation
  benefit from general coverage conditions?
What can online reinforcement learning with function approximation benefit from general coverage conditions?International Conference on Machine Learning (ICML), 2023
Fanghui Liu
Luca Viano
Volkan Cevher
OffRL
223
4
0
25 Apr 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward
  Learning
Provably Feedback-Efficient Reinforcement Learning via Active Reward LearningNeural Information Processing Systems (NeurIPS), 2023
Dingwen Kong
Lin F. Yang
225
16
0
18 Apr 2023
Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding
Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic EmbeddingIEEE Conference on Decision and Control (CDC), 2023
Zhaolin Ren
Tongzheng Ren
Haitong Ma
Na Li
Bo Dai
273
12
0
08 Apr 2023
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPsInternational Conference on Machine Learning (ICML), 2023
Junkai Zhang
Weitong Zhang
Quanquan Gu
175
5
0
17 Mar 2023
Variance-aware robust reinforcement learning with linear function
  approximation under heavy-tailed rewards
Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards
Xiang Li
Qiang Sun
217
9
0
09 Mar 2023
Finite-sample Guarantees for Nash Q-learning with Linear Function
  Approximation
Finite-sample Guarantees for Nash Q-learning with Linear Function ApproximationConference on Uncertainty in Artificial Intelligence (UAI), 2023
Pedro Cisneros-Velarde
Oluwasanmi Koyejo
226
1
0
01 Mar 2023
Optimistic Planning by Regularized Dynamic Programming
Optimistic Planning by Regularized Dynamic ProgrammingInternational Conference on Machine Learning (ICML), 2023
Antoine Moulin
Gergely Neu
255
8
0
27 Feb 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function
  Approximation
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationInternational Conference on Learning Representations (ICLR), 2023
Thanh Nguyen-Tang
R. Arora
OffRL
194
6
0
24 Feb 2023
Provably Efficient Reinforcement Learning via Surprise Bound
Provably Efficient Reinforcement Learning via Surprise BoundInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Hanlin Zhu
Ruosong Wang
Jason D. Lee
OffRL
151
5
0
22 Feb 2023
Provably Efficient Exploration in Quantum Reinforcement Learning with
  Logarithmic Worst-Case Regret
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case RegretInternational Conference on Machine Learning (ICML), 2023
Han Zhong
Jiachen Hu
Yecheng Xue
Tongyang Li
Liwei Wang
197
11
0
21 Feb 2023
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement
  Learning: Adaptivity and Computational Efficiency
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational EfficiencyAnnual Conference Computational Learning Theory (COLT), 2023
Heyang Zhao
Jiafan He
Dongruo Zhou
Tong Zhang
Quanquan Gu
235
37
0
21 Feb 2023
Improved Regret Bounds for Linear Adversarial MDPs via Linear
  Optimization
Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization
Fang-yuan Kong
Xiangcheng Zhang
Baoxiang Wang
Shuai Li
186
14
0
14 Feb 2023
Breaking the Curse of Multiagents in a Large State Space: RL in Markov
  Games with Independent Linear Function Approximation
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2023
Qiwen Cui
Jianchao Tan
S. Du
307
28
0
07 Feb 2023
Online Reinforcement Learning with Uncertain Episode Lengths
Online Reinforcement Learning with Uncertain Episode LengthsAAAI Conference on Artificial Intelligence (AAAI), 2023
Debmalya Mandal
Goran Radanović
Jiarui Gan
Adish Singla
R. Majumdar
OffRL
163
9
0
07 Feb 2023
Sample Complexity of Kernel-Based Q-Learning
Sample Complexity of Kernel-Based Q-LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Sing-Yuan Yeh
Fu-Chieh Chang
Chang-Wei Yueh
Pei-Yuan Wu
A. Bernacchia
Sattar Vakili
OffRL
259
6
0
01 Feb 2023
Learning in POMDPs is Sample-Efficient with Hindsight Observability
Learning in POMDPs is Sample-Efficient with Hindsight ObservabilityInternational Conference on Machine Learning (ICML), 2023
Jonathan Lee
Alekh Agarwal
Christoph Dann
Tong Zhang
231
23
0
31 Jan 2023
Improved Regret for Efficient Online Reinforcement Learning with Linear
  Function Approximation
Improved Regret for Efficient Online Reinforcement Learning with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2023
Uri Sherman
Tomer Koren
Yishay Mansour
301
13
0
30 Jan 2023
Refined Regret for Adversarial MDPs with Linear Function Approximation
Refined Regret for Adversarial MDPs with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2023
Yan Dai
Haipeng Luo
Chen-Yu Wei
Julian Zimmert
252
13
0
30 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based
  Reinforcement Learning
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Mengdi Wang
Furong Huang
Dinesh Manocha
158
9
0
28 Jan 2023
Multi-Agent Congestion Cost Minimization With Linear Function
  Approximations
Multi-Agent Congestion Cost Minimization With Linear Function ApproximationsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Prashant Trivedi
N. Hemachandra
200
1
0
26 Jan 2023
Model-Based Reinforcement Learning with Multinomial Logistic Function
  Approximation
Model-Based Reinforcement Learning with Multinomial Logistic Function ApproximationAAAI Conference on Artificial Intelligence (AAAI), 2022
Taehyun Hwang
Min Hwan Oh
206
11
0
27 Dec 2022
Latent Variable Representation for Reinforcement Learning
Latent Variable Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Zhaolin Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
202
11
0
17 Dec 2022
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision
  Processes
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022
Jiafan He
Heyang Zhao
Dongruo Zhou
Quanquan Gu
OffRL
353
63
0
12 Dec 2022
Causal Deep Reinforcement Learning Using Observational Data
Causal Deep Reinforcement Learning Using Observational DataInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Wenxuan Zhu
Chao Yu
Qiaosheng Zhang
CMLOffRL
151
8
0
28 Nov 2022
Linear Reinforcement Learning with Ball Structure Action Space
Linear Reinforcement Learning with Ball Structure Action SpaceInternational Conference on Algorithmic Learning Theory (ALT), 2022
Zeyu Jia
Randy Jia
Dhruv Madeka
Dean Phillips Foster
130
1
0
14 Nov 2022
Leveraging Offline Data in Online Reinforcement Learning
Leveraging Offline Data in Online Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Andrew Wagenmaker
Aldo Pacchiano
OffRLOnRL
235
44
0
09 Nov 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward
  Engineering on Sample Complexity
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample ComplexityNeural Information Processing Systems (NeurIPS), 2022
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
189
92
0
18 Oct 2022
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with
  Tractable Exploration and Planning
Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and PlanningAAAI Conference on Artificial Intelligence (AAAI), 2022
Reda Ouhamma
D. Basu
Odalric-Ambrym Maillard
OffRL
158
12
0
05 Oct 2022
Offline Reinforcement Learning with Differentiable Function
  Approximation is Provably Efficient
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient
Ming Yin
Mengdi Wang
Yu Wang
OffRL
269
12
0
03 Oct 2022
A General Framework for Sample-Efficient Function Approximation in
  Reinforcement Learning
A General Framework for Sample-Efficient Function Approximation in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Zixiang Chen
C. J. Li
An Yuan
Quanquan Gu
Michael I. Jordan
OffRL
212
30
0
30 Sep 2022
Conservative Dual Policy Optimization for Efficient Model-Based
  Reinforcement Learning
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Shen Zhang
134
6
0
16 Sep 2022
Understanding Deep Neural Function Approximation in Reinforcement
  Learning via $ε$-Greedy Exploration
Understanding Deep Neural Function Approximation in Reinforcement Learning via εεε-Greedy ExplorationNeural Information Processing Systems (NeurIPS), 2022
Fanghui Liu
Luca Viano
Volkan Cevher
285
23
0
15 Sep 2022
Dynamic Regret of Online Markov Decision Processes
Dynamic Regret of Online Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022
Peng Zhao
Longfei Li
Zhi Zhou
OffRL
184
19
0
26 Aug 2022
Learning Two-Player Mixture Markov Games: Kernel Function Approximation
  and Correlated Equilibrium
Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium
C. J. Li
Dongruo Zhou
Quanquan Gu
Sai Li
139
2
0
10 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning
  in Online Reinforcement Learning
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Delin Qu
Lingxiao Wang
Chenjia Bai
Zhuoran Yang
Zhaoran Wang
SSLOffRL
372
32
0
29 Jul 2022
Making Linear MDPs Practical via Contrastive Representation Learning
Making Linear MDPs Practical via Contrastive Representation LearningInternational Conference on Machine Learning (ICML), 2022
Tianjun Zhang
Zhaolin Ren
Mengjiao Yang
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
181
52
0
14 Jul 2022
PAC Reinforcement Learning for Predictive State Representations
PAC Reinforcement Learning for Predictive State RepresentationsInternational Conference on Learning Representations (ICLR), 2022
Wenhao Zhan
Masatoshi Uehara
Wen Sun
Jason D. Lee
328
43
0
12 Jul 2022
Model Selection in Reinforcement Learning with General Function
  Approximations
Model Selection in Reinforcement Learning with General Function Approximations
Avishek Ghosh
Sayak Ray Chowdhury
114
3
0
06 Jul 2022
Provably Efficient Reinforcement Learning for Online Adaptive Influence
  Maximization
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization
Kaixuan Huang
Yuehua Wu
Xuezhou Zhang
Shenyinying Tu
Qingyun Wu
Mengdi Wang
Huazheng Wang
98
1
0
29 Jun 2022
Joint Representation Training in Sequential Tasks with Shared Structure
Joint Representation Training in Sequential Tasks with Shared Structure
Aldo Pacchiano
Ofir Nachum
Nilseh Tripuraneni
Peter L. Bartlett
235
5
0
24 Jun 2022
Provably Efficient Reinforcement Learning in Partially Observable
  Dynamical Systems
Provably Efficient Reinforcement Learning in Partially Observable Dynamical SystemsNeural Information Processing Systems (NeurIPS), 2022
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
OffRL
259
40
0
24 Jun 2022
Nearly Minimax Optimal Reinforcement Learning with Linear Function
  Approximation
Nearly Minimax Optimal Reinforcement Learning with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2022
Pihe Hu
Yu Chen
Longbo Huang
311
36
0
23 Jun 2022
Previous
12345
Next