ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.04652
  4. Cited By
Representation Learning for Online and Offline RL in Low-rank MDPs

Representation Learning for Online and Offline RL in Low-rank MDPs

9 October 2021
Masatoshi Uehara
Xuezhou Zhang
Wen Sun
    OffRL
ArXivPDFHTML

Papers citing "Representation Learning for Online and Offline RL in Low-rank MDPs"

50 / 104 papers shown
Title
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
Alexander Levine
Peter Stone
Amy Zhang
OffRL
52
0
0
26 Mar 2025
Accelerating Multi-Task Temporal Difference Learning under Low-Rank Representation
Yitao Bai
Sihan Zeng
Justin Romberg
Thinh T. Doan
OffRL
31
0
0
03 Mar 2025
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
Rui Lu
Yang Yue
Andrew Zhao
S. Du
Gao Huang
OffRL
52
1
0
01 Mar 2025
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
Pai Liu
Lingfeng Zhao
Shivangi Agarwal
Jinghan Liu
Audrey Huang
P. Amortila
Nan Jiang
OODD
OffRL
93
0
0
11 Feb 2025
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Demystifying Linear MDPs and Novel Dynamics Aggregation Framework
Joongkyu Lee
Min-hwan Oh
25
2
0
31 Oct 2024
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise
  Matrix Estimation
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
Stefan Stojanovic
Yassir Jedra
Alexandre Proutiere
23
0
0
30 Oct 2024
Primal-Dual Spectral Representation for Off-policy Evaluation
Primal-Dual Spectral Representation for Off-policy Evaluation
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
21
0
0
23 Oct 2024
Scalable spectral representations for multi-agent reinforcement learning
  in network MDPs
Scalable spectral representations for multi-agent reinforcement learning in network MDPs
Zhaolin Ren
Runyu
Zhang
Bo Dai
17
0
0
22 Oct 2024
Guarantees for Nonlinear Representation Learning: Non-identical
  Covariates, Dependent Data, Fewer Samples
Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer Samples
Thomas T. Zhang
Bruce D. Lee
Ingvar M. Ziemann
George J. Pappas
Nikolai Matni
CML
OOD
31
0
0
15 Oct 2024
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Alexander Levine
Peter Stone
Amy Zhang
OffRL
34
0
0
03 Oct 2024
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low
  Interaction Rank
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Wenhao Zhan
Scott Fujimoto
Zheqing Zhu
Jason D. Lee
Daniel Jiang
Yonathan Efroni
OffRL
22
0
0
01 Oct 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
64
1
0
22 Aug 2024
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Dake Zhang
Boxiang Lyu
Shuang Qiu
Mladen Kolar
Tong Zhang
OffRL
25
0
0
10 Jul 2024
Uncertainty-Aware Reward-Free Exploration with General Function
  Approximation
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Junkai Zhang
Weitong Zhang
Dongruo Zhou
Q. Gu
34
2
0
24 Jun 2024
Diffusion Spectral Representation for Reinforcement Learning
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
21
3
0
23 Jun 2024
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in
  Tabular MDP
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
30
0
0
04 Jun 2024
Projection by Convolution: Optimal Sample Complexity for Reinforcement
  Learning in Continuous-Space MDPs
Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restelli
24
3
0
10 May 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline
  Reinforcement Learning
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
Xiaoyu Wen
Chenjia Bai
Kang Xu
Xudong Yu
Yang Zhang
Xuelong Li
Zhen Wang
34
2
0
10 May 2024
RLHF from Heterogeneous Feedback via Personalization and Preference
  Aggregation
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
Mingyang Liu
Dingwen Kong
Kaiqing Zhang
Asuman Ozdaglar
23
28
0
30 Apr 2024
Efficient Duple Perturbation Robustness in Low-rank MDPs
Efficient Duple Perturbation Robustness in Low-rank MDPs
Yang Hu
Haitong Ma
Bo Dai
Na Li
23
0
0
11 Apr 2024
Skill Transfer and Discovery for Sim-to-Real Learning: A
  Representation-Based Viewpoint
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint
Haitong Ma
Zhaolin Ren
Bo Dai
Na Li
24
1
0
07 Apr 2024
Towards Principled Representation Learning from Videos for Reinforcement
  Learning
Towards Principled Representation Learning from Videos for Reinforcement Learning
Dipendra Kumar Misra
Akanksha Saran
Tengyang Xie
Alex Lamb
John Langford
SSL
OffRL
27
5
0
20 Mar 2024
Sample Efficient Myopic Exploration Through Multitask Reinforcement
  Learning with Diverse Tasks
Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks
Ziping Xu
Zifan Xu
Runxuan Jiang
Peter Stone
Ambuj Tewari
40
1
0
03 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with
  General Function Approximation
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
29
3
0
28 Feb 2024
Offline Multi-task Transfer RL with Representational Penalization
Offline Multi-task Transfer RL with Representational Penalization
Avinandan Bose
S. S. Du
Maryam Fazel
OffRL
49
12
0
19 Feb 2024
Double Duality: Variational Primal-Dual Policy Optimization for
  Constrained Reinforcement Learning
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li
Boyi Liu
Zhuoran Yang
Zhaoran Wang
Mengdi Wang
26
1
0
16 Feb 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial
  Corruption
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
Chen Ye
Jiafan He
Quanquan Gu
Tong Zhang
21
5
0
14 Feb 2024
More Benefits of Being Distributional: Second-Order Bounds for
  Reinforcement Learning
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
80
12
0
11 Feb 2024
Model-Based RL for Mean-Field Games is not Statistically Harder than
  Single-Agent RL
Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL
Jiawei Huang
Niao He
Andreas Krause
24
6
0
08 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
17
5
0
06 Feb 2024
Sample Complexity Characterization for Linear Contextual MDPs
Sample Complexity Characterization for Linear Contextual MDPs
Junze Deng
Yuan-Chia Cheng
Shaofeng Zou
Yingbin Liang
25
1
0
05 Feb 2024
On Sample-Efficient Offline Reinforcement Learning: Data Diversity,
  Posterior Sampling, and Beyond
On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond
Thanh Nguyen-Tang
Raman Arora
OffRL
15
3
0
06 Jan 2024
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
40
3
0
20 Nov 2023
Provably Efficient CVaR RL in Low-rank MDPs
Provably Efficient CVaR RL in Low-rank MDPs
Yulai Zhao
Wenhao Zhan
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
Wen Sun
Jason D. Lee
22
4
0
20 Nov 2023
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
Yifei Zhou
Ayush Sekhari
Yuda Song
Wen Sun
OffRL
OnRL
25
8
0
14 Nov 2023
Learning Adversarial Low-rank Markov Decision Processes with Unknown
  Transition and Full-information Feedback
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback
Canzhe Zhao
Ruofeng Yang
Baoxiang Wang
Xuezhou Zhang
Shuai Li
22
2
0
14 Nov 2023
Low-Rank MDPs with Continuous Action Spaces
Low-Rank MDPs with Continuous Action Spaces
Andrew Bennett
Nathan Kallus
M. Oprescu
33
2
0
06 Nov 2023
A Doubly Robust Approach to Sparse Reinforcement Learning
A Doubly Robust Approach to Sparse Reinforcement Learning
Wonyoung Hedge Kim
Garud Iyengar
A. Zeevi
18
3
0
23 Oct 2023
Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement
  Learning
Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement Learning
Stefan Stojanovic
Yassir Jedra
Alexandre Proutière
13
5
0
10 Oct 2023
When is Agnostic Reinforcement Learning Statistically Tractable?
When is Agnostic Reinforcement Learning Statistically Tractable?
Zeyu Jia
Gene Li
Alexander Rakhlin
Ayush Sekhari
Nathan Srebro
OffRL
12
5
0
09 Oct 2023
Representation Learning in Low-rank Slate-based Recommender Systems
Representation Learning in Low-rank Slate-based Recommender Systems
Yijia Dai
Wen Sun
OffRL
11
0
0
10 Sep 2023
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs
Yuan-Chia Cheng
J. Yang
Yitao Liang
OOD
25
1
0
10 Aug 2023
The Optimal Approximation Factors in Misspecified Off-Policy Value
  Function Estimation
The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation
P. Amortila
Nan Jiang
Csaba Szepesvári
OffRL
13
3
0
25 Jul 2023
Efficient Model-Free Exploration in Low-Rank MDPs
Efficient Model-Free Exploration in Low-Rank MDPs
Zakaria Mhammedi
Adam Block
Dylan J. Foster
Alexander Rakhlin
OffRL
14
13
0
08 Jul 2023
Provably Efficient UCB-type Algorithms For Learning Predictive State
  Representations
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations
Ruiquan Huang
Yitao Liang
J. Yang
OffRL
16
5
0
01 Jul 2023
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value
  Approximation in Reinforcement Learning
Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
Qiang He
Tianyi Zhou
Meng Fang
S. Maghsudi
17
4
0
29 Jun 2023
Context-lumpable stochastic bandits
Context-lumpable stochastic bandits
Chung-Wei Lee
Qinghua Liu
Yasin Abbasi-Yadkori
Chi Jin
Tor Lattimore
Csaba Szepesvári
OffRL
82
2
0
22 Jun 2023
Achieving Sample and Computational Efficient Reinforcement Learning by
  Action Space Reduction via Grouping
Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping
Yining Li
Peizhong Ju
Ness B. Shroff
14
0
0
22 Jun 2023
Provably Efficient Representation Learning with Tractable Planning in
  Low-Rank POMDP
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP
Jiacheng Guo
Zihao Li
Huazheng Wang
Mengdi Wang
Zhuoran Yang
Xuezhou Zhang
25
5
0
21 Jun 2023
Look Beneath the Surface: Exploiting Fundamental Symmetry for
  Sample-Efficient Offline RL
Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL
Peng Cheng
Xianyuan Zhan
Zhihao Wu
Wenjia Zhang
Shoucheng Song
Han Wang
Youfang Lin
Li Jiang
OffRL
30
9
0
07 Jun 2023
123
Next