ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.10389
  4. Cited By
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and
  Regret Bound
v1v2 (latest)

Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound

International Conference on Machine Learning (ICML), 2019
24 May 2019
Lin F. Yang
Mengdi Wang
    OffRLGP
ArXiv (abs)PDFHTML

Papers citing "Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound"

50 / 226 papers shown
Title
Distributionally Robust Online Markov Game with Linear Function Approximation
Distributionally Robust Online Markov Game with Linear Function Approximation
Zewu Zheng
Yuanyuan Lin
OODOffRL
220
0
0
11 Nov 2025
Reinforcement Learning Using known Invariances
Reinforcement Learning Using known Invariances
Alexandru Cioba
Aya Kayal
Laura Toni
Sattar Vakili
A. Bernacchia
68
0
0
05 Nov 2025
No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes
No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes
Jasmine Bayrooti
Sattar Vakili
Amanda Prorok
Carl Henrik Ek
100
0
0
23 Oct 2025
Generalized Kernelized Bandits: A Novel Self-Normalized Bernstein-Like Dimension-Free Inequality and Regret Bounds
Generalized Kernelized Bandits: A Novel Self-Normalized Bernstein-Like Dimension-Free Inequality and Regret Bounds
Alberto Maria Metelli
Simone Drago
Marco Mussi
89
2
0
03 Aug 2025
Statistical and Algorithmic Foundations of Reinforcement Learning
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
173
2
0
19 Jul 2025
Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior
Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior
Jiyi Wang
Jingyang Ke
Bo Dai
Anqi Wu
136
0
0
18 Jun 2025
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
Jiachen Hu
Rui Ai
Han Zhong
Xiaoyu Chen
L. Wang
Zhaoran Wang
Zhuoran Yang
165
0
0
11 Jun 2025
Generalized Linear Markov Decision Process
Generalized Linear Markov Decision Process
Sinian Zhang
Kaicheng Zhang
Ziping Xu
Tianxi Cai
D. Zhou
172
0
0
01 Jun 2025
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation
Provably Efficient Reinforcement Learning with Multinomial Logit Function ApproximationNeural Information Processing Systems (NeurIPS), 2024
Long-Fei Li
Yu Zhang
Peng Zhao
Zhi Zhou
467
8
0
17 Jan 2025
Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs
Philips George John
Arnab Bhattacharyya
Silviu Maniu
Dimitrios Myrisiotis
Zhenan Wu
OffRL
178
0
0
16 Nov 2024
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Demystifying Linear MDPs and Novel Dynamics Aggregation FrameworkInternational Conference on Learning Representations (ICLR), 2024
Joongkyu Lee
Min-hwan Oh
155
5
0
31 Oct 2024
Primal-Dual Spectral Representation for Off-policy Evaluation
Primal-Dual Spectral Representation for Off-policy EvaluationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
246
2
0
23 Oct 2024
Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded
  Span
Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded SpanInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Woojin Chae
Kihyuk Hong
Yufan Zhang
Ambuj Tewari
Dabeen Lee
158
1
0
19 Oct 2024
Neural Combinatorial Clustered Bandits for Recommendation Systems
Neural Combinatorial Clustered Bandits for Recommendation SystemsAAAI Conference on Artificial Intelligence (AAAI), 2024
Baran Atalar
Carlee Joe-Wong
CMLOffRL
152
3
0
18 Oct 2024
Upper and Lower Bounds for Distributionally Robust Off-Dynamics
  Reinforcement Learning
Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning
Zhishuai Liu
Weixin Wang
Pan Xu
290
12
0
30 Sep 2024
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Hao-ming Lin
Wenhao Ding
Jian Chen
Laixi Shi
Jiacheng Zhu
Yue Liu
Ding Zhao
OffRLCML
409
3
0
15 Jul 2024
Spectral Representation for Causal Estimation with Hidden Confounders
Spectral Representation for Causal Estimation with Hidden Confounders
Zhaolin Ren
Haotian Sun
Antoine Moulin
Arthur Gretton
Bo Dai
CML
206
7
0
15 Jul 2024
Diffusion Spectral Representation for Reinforcement Learning
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
288
7
0
23 Jun 2024
More Efficient Randomized Exploration for Reinforcement Learning via
  Approximate Sampling
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
Haque Ishfaq
Yixin Tan
Yu Yang
Qingfeng Lan
Jianfeng Lu
A. Rupam Mahmood
Doina Precup
Pan Xu
158
8
0
18 Jun 2024
Linear Bellman Completeness Suffices for Efficient Online Reinforcement
  Learning with Few Actions
Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions
Noah Golowich
Ankur Moitra
OffRL
193
1
0
17 Jun 2024
Graph Neural Thompson Sampling
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
282
0
0
15 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert D. Nowak
548
5
0
07 Jun 2024
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
Haotian Hu
Yiqin Yang
Jianing Ye
Chengjie Wu
Ziqing Mai
Yujing Hu
Tangjie Lv
Changjie Fan
Qianchuan Zhao
Chongjie Zhang
OffRLOnRL
195
7
0
31 May 2024
Efficient Duple Perturbation Robustness in Low-rank MDPs
Efficient Duple Perturbation Robustness in Low-rank MDPs
Yang Hu
Haitong Ma
Bo Dai
Na Li
158
0
0
11 Apr 2024
Skill Transfer and Discovery for Sim-to-Real Learning: A
  Representation-Based Viewpoint
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint
Haitong Ma
Tongzheng Ren
Bo Dai
Na Li
157
3
0
07 Apr 2024
Sample Complexity of Offline Distributionally Robust Linear Markov
  Decision Processes
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
He Wang
Laixi Shi
Yuejie Chi
OffRL
321
14
0
19 Mar 2024
RL in Markov Games with Independent Function Approximation: Improved
  Sample Complexity Bound under the Local Access Model
RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access ModelInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Junyi Fan
Yuxuan Han
Jialin Zeng
Jian-Feng Cai
Yang Wang
Yang Xiang
Jiheng Zhang
328
1
0
18 Mar 2024
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable
  Efficiency with Linear Function Approximation
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
Zhishuai Liu
Pan Xu
OODOffRL
216
19
0
23 Feb 2024
Double Duality: Variational Primal-Dual Policy Optimization for
  Constrained Reinforcement Learning
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li
Boyi Liu
Zhuoran Yang
Zhaoran Wang
Mengdi Wang
261
2
0
16 Feb 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial
  Corruption
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
Chen Ye
Jiafan He
Quanquan Gu
Tong Zhang
243
10
0
14 Feb 2024
Refined Sample Complexity for Markov Games with Independent Linear
  Function Approximation
Refined Sample Complexity for Markov Games with Independent Linear Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2024
Yan Dai
Qiwen Cui
S. S. Du
270
1
0
11 Feb 2024
Information-Theoretic State Variable Selection for Reinforcement
  Learning
Information-Theoretic State Variable Selection for Reinforcement Learning
Charles Westphal
Stephen Hailes
Mirco Musolesi
191
4
0
21 Jan 2024
Long-term Safe Reinforcement Learning with Binary Feedback
Long-term Safe Reinforcement Learning with Binary FeedbackAAAI Conference on Artificial Intelligence (AAAI), 2024
Akifumi Wachi
Wataru Hashimoto
Kazumune Hashimoto
OffRL
306
6
0
08 Jan 2024
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization
Tree Search-Based Evolutionary Bandits for Protein Sequence OptimizationAAAI Conference on Artificial Intelligence (AAAI), 2024
Jiahao Qiu
Hui Yuan
Jinghong Zhang
Wentao Chen
Huazheng Wang
Mengdi Wang
149
2
0
08 Jan 2024
Risk-sensitive Markov Decision Process and Learning under General
  Utility Functions
Risk-sensitive Markov Decision Process and Learning under General Utility FunctionsSocial Science Research Network (SSRN), 2023
Zhengqi Wu
Renyuan Xu
172
4
0
22 Nov 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Hongming Zhang
Zhaolin Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
345
5
0
20 Nov 2023
Data-Guided Regulator for Adaptive Nonlinear Control
Data-Guided Regulator for Adaptive Nonlinear Control
Niyousha Rahimi
M. Mesbahi
200
0
0
20 Nov 2023
Low-Rank MDPs with Continuous Action Spaces
Low-Rank MDPs with Continuous Action SpacesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Andrew Bennett
Nathan Kallus
Miruna Oprescu
213
2
0
06 Nov 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with
  Linear Function Approximation
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function ApproximationNeural Information Processing Systems (NeurIPS), 2023
Nikki Lijing Kuang
Ming Yin
Mengdi Wang
Yu Wang
Yian Ma
265
6
0
29 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Unsupervised Behavior Extraction via Random Intent PriorsNeural Information Processing Systems (NeurIPS), 2023
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
217
14
0
28 Oct 2023
Uncertainty-aware transfer across tasks using hybrid model-based
  successor feature reinforcement learning
Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
246
1
0
16 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Bi-Level Offline Policy Optimization with Limited ExplorationNeural Information Processing Systems (NeurIPS), 2023
Wenzhuo Zhou
OffRL
267
5
0
10 Oct 2023
Pessimistic Nonlinear Least-Squares Value Iteration for Offline
  Reinforcement Learning
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Qiwei Di
Heyang Zhao
Jiafan He
Quanquan Gu
OffRL
193
6
0
02 Oct 2023
Reason for Future, Act for Now: A Principled Framework for Autonomous
  LLM Agents with Provable Sample Efficiency
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Zhihan Liu
Hao Hu
Shenao Zhang
Hongyi Guo
Shuqi Ke
Boyi Liu
Zhaoran Wang
LLMAGLRM
388
44
0
29 Sep 2023
Stackelberg Batch Policy Learning
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
182
1
0
28 Sep 2023
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
Rate-Optimal Policy Optimization for Linear Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2023
Uri Sherman
Alon Cohen
Tomer Koren
Yishay Mansour
319
8
0
28 Aug 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
Model-based Offline Reinforcement Learning with Count-based ConservatismInternational Conference on Machine Learning (ICML), 2023
Byeongchang Kim
Min Hwan Oh
OffRL
155
14
0
21 Jul 2023
Online Network Source Optimization with Graph-Kernel MAB
Online Network Source Optimization with Graph-Kernel MAB
Laura Toni
P. Frossard
255
1
0
07 Jul 2023
Sequential Neural Barriers for Scalable Dynamic Obstacle Avoidance
Sequential Neural Barriers for Scalable Dynamic Obstacle AvoidanceIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Hong-Den Yu
Chiaki Hirayama
Chenning Yu
Sylvia Herbert
Sicun Gao
183
20
0
06 Jul 2023
Provably Efficient Iterated CVaR Reinforcement Learning with Function
  Approximation and Human Feedback
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human FeedbackInternational Conference on Learning Representations (ICLR), 2023
Yu Chen
Yihan Du
Pihe Hu
Si-Yi Wang
De-hui Wu
Longbo Huang
271
10
0
06 Jul 2023
12345
Next