ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.08225
  4. Cited By
Policy Optimization as Online Learning with Mediator Feedback

Policy Optimization as Online Learning with Mediator Feedback

AAAI Conference on Artificial Intelligence (AAAI), 2020
15 December 2020
Alberto Maria Metelli
Matteo Papini
P. DÓro
Marcello Restelli
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Policy Optimization as Online Learning with Mediator Feedback"

5 / 5 papers shown
Information Capacity Regret Bounds for Bandits with Mediator Feedback
Information Capacity Regret Bounds for Bandits with Mediator Feedback
Khaled Eldowa
Nicolò Cesa-Bianchi
Alberto Maria Metelli
Marcello Restelli
264
3
0
15 Feb 2024
Pure Exploration under Mediators' Feedback
Pure Exploration under Mediators' Feedback
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
264
1
0
29 Aug 2023
Information-Theoretic Regret Bounds for Bandits with Fixed Expert Advice
Information-Theoretic Regret Bounds for Bandits with Fixed Expert AdviceInformation Theory Workshop (ITW), 2023
Khaled Eldowa
Nicolò Cesa-Bianchi
Alberto Maria Metelli
Marcello Restelli
231
4
0
14 Mar 2023
Reward-Free Policy Space Compression for Reinforcement Learning
Reward-Free Policy Space Compression for Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Mirco Mutti
Stefano Del Col
Marcello Restelli
303
6
0
22 Feb 2022
Diversity-Preserving K-Armed Bandits, Revisited
Diversity-Preserving K-Armed Bandits, Revisited
Hédi Hadiji
Sébastien Gerchinovitz
Jean-Michel Loubes
Jean-Michel Poggi
314
3
0
05 Oct 2020
1
Page 1 of 1