ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.06394
  4. Cited By
Reward-Free Model-Based Reinforcement Learning with Linear Function
  Approximation
v1v2 (latest)

Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation

12 October 2021
Weitong Zhang
Dongruo Zhou
Quanquan Gu
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation"

25 / 25 papers shown
Title
Bayesian Optimization for Dynamic Pricing and Learning
Bayesian Optimization for Dynamic Pricing and Learning
Anush Anand
Pranav Agrawal
Tejas Bodas
20
0
0
14 Oct 2025
Statistical and Algorithmic Foundations of Reinforcement Learning
Statistical and Algorithmic Foundations of Reinforcement Learning
Yuejie Chi
Yuxin Chen
Yuting Wei
OffRL
97
0
0
19 Jul 2025
Linear Mixture Distributionally Robust Markov Decision Processes
Linear Mixture Distributionally Robust Markov Decision Processes
Zhishuai Liu
Pan Xu
186
3
0
23 May 2025
Efficient Reinforcement Learning in Probabilistic Reward Machines
Efficient Reinforcement Learning in Probabilistic Reward MachinesAAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaofeng Lin
Xuezhou Zhang
160
1
0
19 Aug 2024
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Dake Zhang
Boxiang Lyu
Delin Qu
Mladen Kolar
Tong Zhang
OffRL
162
1
0
10 Jul 2024
Uncertainty-Aware Reward-Free Exploration with General Function
  Approximation
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Junkai Zhang
Weitong Zhang
Dongruo Zhou
Q. Gu
230
4
0
24 Jun 2024
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity
  Constraints
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao
Yu Wang
OffRL
158
3
0
02 Feb 2024
When is Agnostic Reinforcement Learning Statistically Tractable?
When is Agnostic Reinforcement Learning Statistically Tractable?Neural Information Processing Systems (NeurIPS), 2023
Zeyu Jia
Gene Li
Alexander Rakhlin
Ayush Sekhari
Nathan Srebro
OffRL
204
7
0
09 Oct 2023
On the Model-Misspecification in Reinforcement Learning
On the Model-Misspecification in Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yunfan Li
Lin F. Yang
198
6
0
19 Jun 2023
Provably Efficient Adversarial Imitation Learning with Unknown
  Transitions
Provably Efficient Adversarial Imitation Learning with Unknown TransitionsConference on Uncertainty in Artificial Intelligence (UAI), 2023
Tian Xu
Ziniu Li
Yang Yu
Zhimin Luo
109
11
0
11 Jun 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid
  Reinforcement Learning
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
OffRLOnRL
195
14
0
17 May 2023
Provably Feedback-Efficient Reinforcement Learning via Active Reward
  Learning
Provably Feedback-Efficient Reinforcement Learning via Active Reward LearningNeural Information Processing Systems (NeurIPS), 2023
Dingwen Kong
Lin F. Yang
157
15
0
18 Apr 2023
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement LearningAnnual Conference Computational Learning Theory (COLT), 2023
Gen Li
Yuling Yan
Yuxin Chen
Jianqing Fan
OffRL
216
14
0
14 Apr 2023
Improved Sample Complexity for Reward-free Reinforcement Learning under
  Low-rank MDPs
Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPsInternational Conference on Learning Representations (ICLR), 2023
Yuan Cheng
Ruiquan Huang
J. Yang
Yitao Liang
OffRL
150
9
0
20 Mar 2023
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPsInternational Conference on Machine Learning (ICML), 2023
Junkai Zhang
Weitong Zhang
Quanquan Gu
127
5
0
17 Mar 2023
Offline Reinforcement Learning with Differentiable Function
  Approximation is Provably Efficient
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient
Ming Yin
Mengdi Wang
Yu Wang
OffRL
217
12
0
03 Oct 2022
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning
  with Linear Function Approximation
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function ApproximationInternational Conference on Learning Representations (ICLR), 2022
Dan Qiao
Yu Wang
OffRL
177
14
0
03 Oct 2022
Safe Exploration Incurs Nearly No Additional Sample Complexity for
  Reward-free RL
Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-free RLInternational Conference on Learning Representations (ICLR), 2022
Ruiquan Huang
J. Yang
Yingbin Liang
OffRL
176
9
0
28 Jun 2022
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear
  RL
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RLNeural Information Processing Systems (NeurIPS), 2022
Jinglin Chen
Aditya Modi
A. Krishnamurthy
Nan Jiang
Alekh Agarwal
181
27
0
21 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped PredictionNeural Information Processing Systems (NeurIPS), 2022
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
182
84
0
16 Jun 2022
Computationally Efficient Horizon-Free Reinforcement Learning for Linear
  Mixture MDPs
Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPsNeural Information Processing Systems (NeurIPS), 2022
Dongruo Zhou
Quanquan Gu
160
52
0
23 May 2022
Human-in-the-loop: Provably Efficient Preference-based Reinforcement
  Learning with General Function Approximation
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function ApproximationInternational Conference on Machine Learning (ICML), 2022
Xiaoyu Chen
Han Zhong
Zhuoran Yang
Zhaoran Wang
Liwei Wang
267
79
0
23 May 2022
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov
  Decision Processes
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision ProcessesInternational Conference on Machine Learning (ICML), 2022
Andrew Wagenmaker
Yifang Chen
Max Simchowitz
S. Du
Kevin Jamieson
155
53
0
26 Jan 2022
Adaptive Multi-Goal Exploration
Adaptive Multi-Goal Exploration
Jean Tarbouriech
O. D. Domingues
Pierre Ménard
Matteo Pirotta
Michal Valko
A. Lazaric
203
4
0
23 Nov 2021
Towards Continual Reinforcement Learning: A Review and Perspectives
Towards Continual Reinforcement Learning: A Review and PerspectivesJournal of Artificial Intelligence Research (JAIR), 2020
Khimya Khetarpal
Matthew D Riemer
Irina Rish
Doina Precup
CLLOffRL
310
360
0
25 Dec 2020
1