ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.15084
  4. Cited By
Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning
  and How to Deal with It

Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It

23 April 2024
Yuta Saito
Masahiro Nomura
    OffRL
ArXivPDFHTML

Papers citing "Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It"

4 / 4 papers shown
Title
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial
  Bandits
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
Tatsuhiro Shimizu
Koichi Tanaka
Ren Kishimoto
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
CML
OffRL
37
0
0
20 Aug 2024
Cross-Validated Off-Policy Evaluation
Cross-Validated Off-Policy Evaluation
Matej Cief
B. Kveton
Michal Kompan
OffRL
20
1
0
24 May 2024
cmaes : A Simple yet Practical Python Library for CMA-ES
cmaes : A Simple yet Practical Python Library for CMA-ES
Masahiro Nomura
Masashi Shibata
45
20
0
02 Feb 2024
Counterfactual Evaluation of Slate Recommendations with Sequential
  Reward Interactions
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
112
55
0
25 Jul 2020
1