Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.02646
Cited By
Prompt Optimization with Logged Bandit Data
3 April 2025
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Prompt Optimization with Logged Bandit Data"
1 / 1 papers shown
Title
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
119
101
0
05 Jun 2022
1