Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.16745
Cited By
Bandits with Preference Feedback: A Stackelberg Game Perspective
24 June 2024
Barna Pásztor
Parnian Kassraie
Andreas Krause
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Bandits with Preference Feedback: A Stackelberg Game Perspective"
2 / 2 papers shown
Title
Bayesian Optimization from Human Feedback: Near-Optimal Regret Bounds
Aya Kayal
Sattar Vakili
Laura Toni
Da-shan Shiu
A. Bernacchia
149
0
0
29 May 2025
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
International Conference on Learning Representations (ICLR), 2025
Hyungkyu Kang
Min-hwan Oh
OffRL
249
2
0
07 Mar 2025
1