Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1905.10389
Cited By
v1
v2 (latest)
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound
International Conference on Machine Learning (ICML), 2019
24 May 2019
Lin F. Yang
Mengdi Wang
OffRL
GP
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound"
50 / 226 papers shown
Title
Distributionally Robust Online Markov Game with Linear Function Approximation
Zewu Zheng
Yuanyuan Lin
OOD
OffRL
264
0
0
11 Nov 2025
Reinforcement Learning Using known Invariances
Alexandru Cioba
Aya Kayal
Laura Toni
Sattar Vakili
A. Bernacchia
112
0
0
05 Nov 2025
No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes
Jasmine Bayrooti
Sattar Vakili
Amanda Prorok
Carl Henrik Ek
116
0
0
23 Oct 2025
Generalized Kernelized Bandits: A Novel Self-Normalized Bernstein-Like Dimension-Free Inequality and Regret Bounds
Alberto Maria Metelli
Simone Drago
Marco Mussi
101
2
0
03 Aug 2025
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
197
2
0
19 Jul 2025
Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior
Jiyi Wang
Jingyang Ke
Bo Dai
Anqi Wu
144
0
0
18 Jun 2025
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
Jiachen Hu
Rui Ai
Han Zhong
Xiaoyu Chen
L. Wang
Zhaoran Wang
Zhuoran Yang
193
0
0
11 Jun 2025
Generalized Linear Markov Decision Process
Sinian Zhang
Kaicheng Zhang
Ziping Xu
Tianxi Cai
D. Zhou
192
0
0
01 Jun 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Neural Information Processing Systems (NeurIPS), 2024
Long-Fei Li
Yu Zhang
Peng Zhao
Zhi Zhou
479
8
0
17 Jan 2025
Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs
Philips George John
Arnab Bhattacharyya
Silviu Maniu
Dimitrios Myrisiotis
Zhenan Wu
OffRL
210
0
0
16 Nov 2024
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
International Conference on Learning Representations (ICLR), 2024
Joongkyu Lee
Min-hwan Oh
183
5
0
31 Oct 2024
Primal-Dual Spectral Representation for Off-policy Evaluation
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
258
3
0
23 Oct 2024
Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded Span
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Woojin Chae
Kihyuk Hong
Yufan Zhang
Ambuj Tewari
Dabeen Lee
162
1
0
19 Oct 2024
Neural Combinatorial Clustered Bandits for Recommendation Systems
AAAI Conference on Artificial Intelligence (AAAI), 2024
Baran Atalar
Carlee Joe-Wong
CML
OffRL
156
3
0
18 Oct 2024
Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Zhishuai Liu
Weixin Wang
Pan Xu
340
13
0
30 Sep 2024
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Hao-ming Lin
Wenhao Ding
Jian Chen
Laixi Shi
Jiacheng Zhu
Yue Liu
Ding Zhao
OffRL
CML
437
3
0
15 Jul 2024
Spectral Representation for Causal Estimation with Hidden Confounders
Zhaolin Ren
Haotian Sun
Antoine Moulin
Arthur Gretton
Bo Dai
CML
226
9
0
15 Jul 2024
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
320
8
0
23 Jun 2024
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
Haque Ishfaq
Yixin Tan
Yu Yang
Qingfeng Lan
Jianfeng Lu
A. Rupam Mahmood
Doina Precup
Pan Xu
170
8
0
18 Jun 2024
Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions
Noah Golowich
Ankur Moitra
OffRL
217
1
0
17 Jun 2024
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
306
1
0
15 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert D. Nowak
596
5
0
07 Jun 2024
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Haotian Hu
Yiqin Yang
Jianing Ye
Chengjie Wu
Ziqing Mai
Yujing Hu
Tangjie Lv
Changjie Fan
Qianchuan Zhao
Chongjie Zhang
OffRL
OnRL
215
7
0
31 May 2024
Efficient Duple Perturbation Robustness in Low-rank MDPs
Yang Hu
Haitong Ma
Bo Dai
Na Li
158
0
0
11 Apr 2024
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint
Haitong Ma
Tongzheng Ren
Bo Dai
Na Li
169
3
0
07 Apr 2024
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
He Wang
Laixi Shi
Yuejie Chi
OffRL
366
15
0
19 Mar 2024
RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Junyi Fan
Yuxuan Han
Jialin Zeng
Jian-Feng Cai
Yang Wang
Yang Xiang
Jiheng Zhang
400
1
0
18 Mar 2024
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
Zhishuai Liu
Pan Xu
OOD
OffRL
260
20
0
23 Feb 2024
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li
Boyi Liu
Zhuoran Yang
Zhaoran Wang
Mengdi Wang
277
2
0
16 Feb 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
Chen Ye
Jiafan He
Quanquan Gu
Tong Zhang
279
10
0
14 Feb 2024
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation
Annual Conference Computational Learning Theory (COLT), 2024
Yan Dai
Qiwen Cui
S. S. Du
310
1
0
11 Feb 2024
Information-Theoretic State Variable Selection for Reinforcement Learning
Charles Westphal
Stephen Hailes
Mirco Musolesi
195
4
0
21 Jan 2024
Long-term Safe Reinforcement Learning with Binary Feedback
AAAI Conference on Artificial Intelligence (AAAI), 2024
Akifumi Wachi
Wataru Hashimoto
Kazumune Hashimoto
OffRL
330
6
0
08 Jan 2024
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jiahao Qiu
Hui Yuan
Jinghong Zhang
Wentao Chen
Huazheng Wang
Mengdi Wang
178
2
0
08 Jan 2024
Risk-sensitive Markov Decision Process and Learning under General Utility Functions
Social Science Research Network (SSRN), 2023
Zhengqi Wu
Renyuan Xu
192
4
0
22 Nov 2023
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Hongming Zhang
Zhaolin Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
357
6
0
20 Nov 2023
Data-Guided Regulator for Adaptive Nonlinear Control
Niyousha Rahimi
M. Mesbahi
208
0
0
20 Nov 2023
Low-Rank MDPs with Continuous Action Spaces
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Andrew Bennett
Nathan Kallus
Miruna Oprescu
229
2
0
06 Nov 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Neural Information Processing Systems (NeurIPS), 2023
Nikki Lijing Kuang
Ming Yin
Mengdi Wang
Yu Wang
Yian Ma
281
6
0
29 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Neural Information Processing Systems (NeurIPS), 2023
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
253
14
0
28 Oct 2023
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
262
1
0
16 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Neural Information Processing Systems (NeurIPS), 2023
Wenzhuo Zhou
OffRL
283
5
0
10 Oct 2023
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Qiwei Di
Heyang Zhao
Jiafan He
Quanquan Gu
OffRL
217
8
0
02 Oct 2023
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Zhihan Liu
Hao Hu
Shenao Zhang
Hongyi Guo
Shuqi Ke
Boyi Liu
Zhaoran Wang
LLMAG
LRM
456
45
0
29 Sep 2023
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
246
1
0
28 Sep 2023
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
International Conference on Machine Learning (ICML), 2023
Uri Sherman
Alon Cohen
Tomer Koren
Yishay Mansour
367
8
0
28 Aug 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
International Conference on Machine Learning (ICML), 2023
Byeongchang Kim
Min Hwan Oh
OffRL
179
14
0
21 Jul 2023
Online Network Source Optimization with Graph-Kernel MAB
Laura Toni
P. Frossard
277
1
0
07 Jul 2023
Sequential Neural Barriers for Scalable Dynamic Obstacle Avoidance
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Hong-Den Yu
Chiaki Hirayama
Chenning Yu
Sylvia Herbert
Sicun Gao
203
21
0
06 Jul 2023
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback
International Conference on Learning Representations (ICLR), 2023
Yu Chen
Yihan Du
Pihe Hu
Si-Yi Wang
De-hui Wu
Longbo Huang
291
11
0
06 Jul 2023
1
2
3
4
5
Next