ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.10814
  4. Cited By
FLAMBE: Structural Complexity and Representation Learning of Low Rank
  MDPs

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

18 June 2020
Alekh Agarwal
Sham Kakade
A. Krishnamurthy
Wen Sun
    OffRL
ArXivPDFHTML

Papers citing "FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs"

50 / 188 papers shown
Title
LoRe: Personalizing LLMs via Low-Rank Reward Modeling
LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Avinandan Bose
Zhihan Xiong
Yuejie Chi
Simon S. Du
Lin Xiao
Maryam Fazel
26
0
0
20 Apr 2025
A Classification View on Meta Learning Bandits
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
23
0
0
06 Apr 2025
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
Alexander Levine
Peter Stone
Amy Zhang
OffRL
52
0
0
26 Mar 2025
Accelerating Multi-Task Temporal Difference Learning under Low-Rank Representation
Yitao Bai
Sihan Zeng
Justin Romberg
Thinh T. Doan
OffRL
41
0
0
03 Mar 2025
Towards Understanding the Benefit of Multitask Representation Learning in Decision Process
Rui Lu
Yang Yue
Andrew Zhao
S. Du
Gao Huang
OffRL
52
1
0
01 Mar 2025
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
Zixuan Gong
Xiaolin Hu
Huayi Tang
Yong Liu
33
0
0
24 Feb 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
34
3
0
28 Jan 2025
On the Linear Speedup of Personalized Federated Reinforcement Learning
  with Shared Representations
On the Linear Speedup of Personalized Federated Reinforcement Learning with Shared Representations
Guojun Xiong
Shufan Wang
Daniel Jiang
Jian Li
FedML
73
1
0
22 Nov 2024
Learning in Markov Games with Adaptive Adversaries: Policy Regret,
  Fundamental Barriers, and Efficient Algorithms
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms
Thanh Nguyen-Tang
Raman Arora
74
1
0
01 Nov 2024
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise
  Matrix Estimation
Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation
Stefan Stojanovic
Yassir Jedra
Alexandre Proutiere
31
0
0
30 Oct 2024
Primal-Dual Spectral Representation for Off-policy Evaluation
Primal-Dual Spectral Representation for Off-policy Evaluation
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
32
0
0
23 Oct 2024
Inverse Reinforcement Learning from Non-Stationary Learning Agents
Inverse Reinforcement Learning from Non-Stationary Learning Agents
Kavinayan P. Sivakumar
Yi Shen
Zachary I. Bell
Scott A. Nivison
Boyuan Chen
Michael M. Zavlanos
16
1
0
18 Oct 2024
Guarantees for Nonlinear Representation Learning: Non-identical
  Covariates, Dependent Data, Fewer Samples
Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer Samples
Thomas T. Zhang
Bruce D. Lee
Ingvar M. Ziemann
George J. Pappas
Nikolai Matni
CML
OOD
36
0
0
15 Oct 2024
Accelerated Preference Optimization for Large Language Model Alignment
Accelerated Preference Optimization for Large Language Model Alignment
Jiafan He
Huizhuo Yuan
Q. Gu
26
1
0
08 Oct 2024
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
Alexander Levine
Peter Stone
Amy Zhang
OffRL
34
0
0
03 Oct 2024
Large Language Models as Markov Chains
Large Language Models as Markov Chains
Oussama Zekri
Ambroise Odonnat
Abdelhakim Benechehab
Linus Bleistein
Nicolas Boullé
I. Redko
34
9
0
03 Oct 2024
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low
  Interaction Rank
Exploiting Structure in Offline Multi-Agent RL: The Benefits of Low Interaction Rank
Wenhao Zhan
Scott Fujimoto
Zheqing Zhu
Jason D. Lee
Daniel Jiang
Yonathan Efroni
OffRL
27
0
0
01 Oct 2024
The Central Role of the Loss Function in Reinforcement Learning
The Central Role of the Loss Function in Reinforcement Learning
Kaiwen Wang
Nathan Kallus
Wen Sun
OffRL
41
7
0
19 Sep 2024
High-Dimensional Sparse Data Low-rank Representation via Accelerated
  Asynchronous Parallel Stochastic Gradient Descent
High-Dimensional Sparse Data Low-rank Representation via Accelerated Asynchronous Parallel Stochastic Gradient Descent
Qicong Hu
Hao Wu
13
0
0
29 Aug 2024
Misspecified $Q$-Learning with Sparse Linear Function Approximation:
  Tight Bounds on Approximation Error
Misspecified QQQ-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
Ally Yalei Du
Lin F. Yang
Ruosong Wang
35
0
0
18 Jul 2024
Satisficing Exploration for Deep Reinforcement Learning
Satisficing Exploration for Deep Reinforcement Learning
Dilip Arumugam
Saurabh Kumar
Ramki Gummadi
Benjamin Van Roy
32
1
0
16 Jul 2024
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Dake Zhang
Boxiang Lyu
Shuang Qiu
Mladen Kolar
Tong Zhang
OffRL
30
0
0
10 Jul 2024
Diffusion Spectral Representation for Reinforcement Learning
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
21
3
0
23 Jun 2024
Hybrid Reinforcement Learning from Offline Observation Alone
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
J. Andrew Bagnell
Aarti Singh
OffRL
74
2
0
11 Jun 2024
Self-Play with Adversarial Critic: Provable and Scalable Offline
  Alignment for Language Models
Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models
Xiang Ji
Sanjeev Kulkarni
Mengdi Wang
Tengyang Xie
OffRL
35
4
0
06 Jun 2024
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy
  Evaluation
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon
Shie Mannor
C. Caramanis
Yonathan Efroni
OffRL
29
2
0
03 Jun 2024
From Words to Actions: Unveiling the Theoretical Underpinnings of
  LLM-Driven Autonomous Systems
From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems
Jianliang He
Siyu Chen
Fengzhuo Zhang
Zhuoran Yang
LM&Ro
LLMAG
40
2
0
30 May 2024
Self-Exploring Language Models: Active Preference Elicitation for Online
  Alignment
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Shenao Zhang
Donghan Yu
Hiteshi Sharma
Ziyi Yang
Shuohang Wang
Hany Hassan
Zhaoran Wang
LRM
40
28
0
29 May 2024
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise
  Exploration-Exploitation Tradeoff
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
Jian Qian
Haichen Hu
David Simchi-Levi
27
1
0
28 May 2024
Provably Efficient Off-Policy Adversarial Imitation Learning with
  Convergence Guarantees
Provably Efficient Off-Policy Adversarial Imitation Learning with Convergence Guarantees
Yilei Chen
Vittorio Giammarino
James Queeney
I. Paschalidis
25
0
0
26 May 2024
Projection by Convolution: Optimal Sample Complexity for Reinforcement
  Learning in Continuous-Space MDPs
Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restelli
34
3
0
10 May 2024
Learning Latent Dynamic Robust Representations for World Models
Learning Latent Dynamic Robust Representations for World Models
Ruixiang Sun
Hongyu Zang
Xin-hui Li
Riashat Islam
29
5
0
10 May 2024
Imitation Learning in Discounted Linear MDPs without exploration
  assumptions
Imitation Learning in Discounted Linear MDPs without exploration assumptions
Luca Viano
Stratis Skoulakis
V. Cevher
30
3
0
03 May 2024
RLHF from Heterogeneous Feedback via Personalization and Preference
  Aggregation
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
Mingyang Liu
Dingwen Kong
Kaiqing Zhang
Asuman Ozdaglar
33
28
0
30 Apr 2024
Provable Interactive Learning with Hindsight Instruction Feedback
Provable Interactive Learning with Hindsight Instruction Feedback
Dipendra Kumar Misra
Aldo Pacchiano
Rob Schapire
30
1
0
14 Apr 2024
Efficient Duple Perturbation Robustness in Low-rank MDPs
Efficient Duple Perturbation Robustness in Low-rank MDPs
Yang Hu
Haitong Ma
Bo Dai
Na Li
23
0
0
11 Apr 2024
Skill Transfer and Discovery for Sim-to-Real Learning: A
  Representation-Based Viewpoint
Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint
Haitong Ma
Zhaolin Ren
Bo Dai
Na Li
24
1
0
07 Apr 2024
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
Shan Xie
30
0
0
03 Apr 2024
Towards Principled Representation Learning from Videos for Reinforcement
  Learning
Towards Principled Representation Learning from Videos for Reinforcement Learning
Dipendra Kumar Misra
Akanksha Saran
Tengyang Xie
Alex Lamb
John Langford
SSL
OffRL
32
5
0
20 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with
  General Function Approximation
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
34
3
0
28 Feb 2024
Offline Multi-task Transfer RL with Representational Penalization
Offline Multi-task Transfer RL with Representational Penalization
Avinandan Bose
S. S. Du
Maryam Fazel
OffRL
49
12
0
19 Feb 2024
Double Duality: Variational Primal-Dual Policy Optimization for
  Constrained Reinforcement Learning
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li
Boyi Liu
Zhuoran Yang
Zhaoran Wang
Mengdi Wang
34
1
0
16 Feb 2024
Towards Robust Model-Based Reinforcement Learning Against Adversarial
  Corruption
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
Chen Ye
Jiafan He
Quanquan Gu
Tong Zhang
35
5
0
14 Feb 2024
More Benefits of Being Distributional: Second-Order Bounds for
  Reinforcement Learning
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning
Kaiwen Wang
Owen Oertell
Alekh Agarwal
Nathan Kallus
Wen Sun
OffRL
80
12
0
11 Feb 2024
No-Regret Reinforcement Learning in Smooth MDPs
No-Regret Reinforcement Learning in Smooth MDPs
Davide Maran
Alberto Maria Metelli
Matteo Papini
Marcello Restell
22
5
0
06 Feb 2024
Sample Complexity Characterization for Linear Contextual MDPs
Sample Complexity Characterization for Linear Contextual MDPs
Junze Deng
Yuan-Chia Cheng
Shaofeng Zou
Yingbin Liang
30
1
0
05 Feb 2024
Parameterized Projected Bellman Operator
Parameterized Projected Bellman Operator
Th´eo Vincent
Alberto Maria Metelli
Boris Belousov
Jan Peters
Marcello Restelli
Carlo DÉramo
23
3
0
20 Dec 2023
MICRO: Model-Based Offline Reinforcement Learning with a Conservative
  Bellman Operator
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
29
4
0
07 Dec 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
Provably Efficient CVaR RL in Low-rank MDPs
Provably Efficient CVaR RL in Low-rank MDPs
Yulai Zhao
Wenhao Zhan
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
Wen Sun
Jason D. Lee
27
4
0
20 Nov 2023
1234
Next