ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.14642
  4. Cited By
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
v1v2v3 (latest)

Rate-Optimal Policy Optimization for Linear Markov Decision Processes

International Conference on Machine Learning (ICML), 2023
28 August 2023
Uri Sherman
Alon Cohen
Tomer Koren
Yishay Mansour
ArXiv (abs)PDFHTML

Papers citing "Rate-Optimal Policy Optimization for Linear Markov Decision Processes"

8 / 8 papers shown
Title
Greedy Sampling Is Provably Efficient for RLHF
Greedy Sampling Is Provably Efficient for RLHF
Di Wu
Chengshuai Shi
Jing Yang
Cong Shen
58
0
0
28 Oct 2025
Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization
Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization
D. Tiapkin
Evgenii Chzhen
Jean-Michel Poggi
264
1
0
08 Jul 2024
Warm-up Free Policy Optimization: Improved Regret in Linear Markov
  Decision Processes
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
Asaf B. Cassel
Aviv A. Rosenberg
290
2
0
03 Jul 2024
Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback
Near-Optimal Regret in Linear MDPs with Aggregate Bandit FeedbackInternational Conference on Machine Learning (ICML), 2024
Asaf B. Cassel
Haipeng Luo
Aviv A. Rosenberg
Dmitry Sotnikov
OffRL
251
5
0
13 May 2024
Imitation Learning in Discounted Linear MDPs without exploration
  assumptions
Imitation Learning in Discounted Linear MDPs without exploration assumptionsInternational Conference on Machine Learning (ICML), 2024
Luca Viano
Stratis Skoulakis
Volkan Cevher
183
7
0
03 May 2024
Refined Sample Complexity for Markov Games with Independent Linear
  Function Approximation
Refined Sample Complexity for Markov Games with Independent Linear Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2024
Yan Dai
Qiwen Cui
S. S. Du
202
1
0
11 Feb 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement
  Learning via the Lens of Representation Complexity
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation ComplexityNeural Information Processing Systems (NeurIPS), 2023
Guhao Feng
Han Zhong
OffRL
222
4
0
28 Dec 2023
Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback
Towards Optimal Regret in Adversarial Linear MDPs with Bandit FeedbackInternational Conference on Learning Representations (ICLR), 2023
Haolin Liu
Chen-Yu Wei
Julian Zimmert
193
8
0
17 Oct 2023
1