Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.03561
Cited By
Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective
7 February 2023
Lucas Maystre
Daniel Russo
Yu Zhao
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective"
3 / 3 papers shown
Title
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
38
1
0
12 Oct 2024
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
35
3
0
22 May 2023
Approximation Benefits of Policy Gradient Methods with Aggregated States
Daniel Russo
42
7
0
22 Jul 2020
1