ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.03561
  4. Cited By
Optimizing Audio Recommendations for the Long-Term: A Reinforcement
  Learning Perspective

Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective

7 February 2023
Lucas Maystre
Daniel Russo
Yu Zhao
    OffRL
ArXivPDFHTML

Papers citing "Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective"

3 / 3 papers shown
Title
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
38
1
0
12 Oct 2024
Regularization and Variance-Weighted Regression Achieves Minimax
  Optimality in Linear MDPs: Theory and Practice
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
35
3
0
22 May 2023
Approximation Benefits of Policy Gradient Methods with Aggregated States
Approximation Benefits of Policy Gradient Methods with Aggregated States
Daniel Russo
42
7
0
22 Jul 2020
1