Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.07429
Cited By
Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs
8 October 2025
Wang Wei
Tiankai Yang
Hongjie Chen
Yue Zhao
Franck Dernoncourt
Ryan Rossi
Hoda Eldardiry
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (3 upvotes)
Papers citing
"Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs"
0 / 0 papers shown
Title
No papers found