ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.14335
  4. Cited By
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection
  and Learning

Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning

23 May 2024
Otmane Sakhi
Imad Aouali
Pierre Alquier
Nicolas Chopin
    OffRL
ArXivPDFHTML

Papers citing "Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning"

2 / 2 papers shown
Title
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
Noveen Sachdeva
Lequn Wang
Dawen Liang
Nathan Kallus
Julian McAuley
OffRL
18
12
0
24 Oct 2023
Pac-Bayesian Supervised Classification: The Thermodynamics of
  Statistical Learning
Pac-Bayesian Supervised Classification: The Thermodynamics of Statistical Learning
O. Catoni
135
451
0
03 Dec 2007
1