Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.10814
Cited By
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs
18 June 2020
Alekh Agarwal
Sham Kakade
A. Krishnamurthy
Wen Sun
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs"
50 / 188 papers shown
Title
LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Avinandan Bose
Zhihan Xiong
Yuejie Chi
Simon S. Du
Lin Xiao
Maryam Fazel
26
0
0
20 Apr 2025
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
23
0
0
06 Apr 2025
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
Alexander Levine
Peter Stone
Amy Zhang
OffRL
52
0
0
26 Mar 2025
Accelerating Multi-Task Temporal Difference Learning under Low-Rank Representation
Yitao Bai
Sihan Zeng
Justin Romberg
Thinh T. Doan
OffRL
41
0
0
03 Mar 2025
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
Rui Lu
Yang Yue
Andrew Zhao
S. Du
Gao Huang
OffRL
52
1
0
01 Mar 2025
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
Zixuan Gong
Xiaolin Hu
Huayi Tang
Yong Liu
33
0
0
24 Feb 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
34
3
0
28 Jan 2025
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations
Guojun Xiong
Shufan Wang
Daniel Jiang
Jian Li
FedML
73
1
0
22 Nov 2024
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms
Thanh Nguyen-Tang
Raman Arora
74
1
0
01 Nov 2024
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
Stefan Stojanovic
Yassir Jedra
Alexandre Proutiere
31
0
0
30 Oct 2024
Primal-Dual Spectral Representation for Off-policy Evaluation
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
32
0
0
23 Oct 2024
Inverse Reinforcement Learning from Non-Stationary Learning Agents
Kavinayan P. Sivakumar
Yi Shen
Zachary I. Bell
Scott A. Nivison
Boyuan Chen
Michael M. Zavlanos
16
1
0
18 Oct 2024
Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer Samples
Thomas T. Zhang
Bruce D. Lee
Ingvar M. Ziemann
George J. Pappas
Nikolai Matni
CML
OOD
36
0
0
15 Oct 2024
Accelerated Preference Optimization for Large Language Model Alignment
Jiafan He
Huizhuo Yuan
Q. Gu
26
1
0
08 Oct 2024
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Alexander Levine
Peter Stone
Amy Zhang
OffRL
34
0
0
03 Oct 2024
Large Language Models as Markov Chains
Oussama Zekri
Ambroise Odonnat
Abdelhakim Benechehab
Linus Bleistein
Nicolas Boullé
I. Redko
34
9
0
03 Oct 2024
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Wenhao Zhan
Scott Fujimoto
Zheqing Zhu
Jason D. Lee
Daniel Jiang
Yonathan Efroni
OffRL
27
0
0
01 Oct 2024
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
41
7
0
19 Sep 2024
High-Dimensional Sparse Data Low-rank Representation via Accelerated Asynchronous Parallel Stochastic Gradient Descent
Qicong Hu
Hao Wu
13
0
0
29 Aug 2024
Misspecified
Q
Q
Q
-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Ally Yalei Du
Lin F. Yang
Ruosong Wang
35
0
0
18 Jul 2024
Satisficing Exploration for Deep Reinforcement Learning
Dilip Arumugam
Saurabh Kumar
Ramki Gummadi
Benjamin Van Roy
32
1
0
16 Jul 2024
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Dake Zhang
Boxiang Lyu
Shuang Qiu
Mladen Kolar
Tong Zhang
OffRL
30
0
0
10 Jul 2024
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
21
3
0
23 Jun 2024
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
J. Andrew Bagnell
Aarti Singh
OffRL
74
2
0
11 Jun 2024
Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models
Xiang Ji
Sanjeev Kulkarni
Mengdi Wang
Tengyang Xie
OffRL
35
4
0
06 Jun 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
C. Caramanis
Yonathan Efroni
OffRL
29
2
0
03 Jun 2024
From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems
Jianliang He
Siyu Chen
Fengzhuo Zhang
Zhuoran Yang
LM&Ro
LLMAG
40
2
0
30 May 2024
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Shenao Zhang
Donghan Yu
Hiteshi Sharma
Ziyi Yang
Shuohang Wang
Hany Hassan
Zhaoran Wang
LRM
40
28
0
29 May 2024
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
Jian Qian
Haichen Hu
David Simchi-Levi
27
1
0
28 May 2024
Provably Efficient Off-Policy Adversarial Imitation Learning with Convergence Guarantees
Yilei Chen
Vittorio Giammarino
James Queeney
I. Paschalidis
25
0
0
26 May 2024
Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restelli
34
3
0
10 May 2024
Learning Latent Dynamic Robust Representations for World Models
Ruixiang Sun
Hongyu Zang
Xin-hui Li
Riashat Islam
29
5
0
10 May 2024
Imitation Learning in Discounted Linear MDPs without exploration assumptions
Luca Viano
Stratis Skoulakis
V. Cevher
30
3
0
03 May 2024
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
Mingyang Liu
Dingwen Kong
Kaiqing Zhang
Asuman Ozdaglar
33
28
0
30 Apr 2024
Provable Interactive Learning with Hindsight Instruction Feedback
Dipendra Kumar Misra
Aldo Pacchiano
Rob Schapire
30
1
0
14 Apr 2024
Efficient Duple Perturbation Robustness in Low-rank MDPs
Yang Hu
Haitong Ma
Bo Dai
Na Li
23
0
0
11 Apr 2024
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint
Haitong Ma
Zhaolin Ren
Bo Dai
Na Li
24
1
0
07 Apr 2024
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
Shan Xie
30
0
0
03 Apr 2024
Towards Principled Representation Learning from Videos for Reinforcement Learning
Dipendra Kumar Misra
Akanksha Saran
Tengyang Xie
Alex Lamb
John Langford
SSL
OffRL
32
5
0
20 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
34
3
0
28 Feb 2024
Offline Multi-task Transfer RL with Representational Penalization
Avinandan Bose
S. S. Du
Maryam Fazel
OffRL
49
12
0
19 Feb 2024
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li
Boyi Liu
Zhuoran Yang
Zhaoran Wang
Mengdi Wang
34
1
0
16 Feb 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
Chen Ye
Jiafan He
Quanquan Gu
Tong Zhang
35
5
0
14 Feb 2024
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
80
12
0
11 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
22
5
0
06 Feb 2024
Sample Complexity Characterization for Linear Contextual MDPs
Junze Deng
Yuan-Chia Cheng
Shaofeng Zou
Yingbin Liang
30
1
0
05 Feb 2024
Parameterized Projected Bellman Operator
Th´eo Vincent
Alberto Maria Metelli
Boris Belousov
Jan Peters
Marcello Restelli
Carlo DÉramo
23
3
0
20 Dec 2023
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
29
4
0
07 Dec 2023
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
Provably Efficient CVaR RL in Low-rank MDPs
Yulai Zhao
Wenhao Zhan
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
Wen Sun
Jason D. Lee
27
4
0
20 Nov 2023
1
2
3
4
Next