Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.06248
Cited By
Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models
8 January 2025
Roberto-Rafael Maura-Rivero
Chirag Nagpal
Roma Patel
Francesco Visin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Utility-inspired Reward Transformations Improve Reinforcement Learning Training of Language Models"
1 / 1 papers shown
Title
Robust Multi-Objective Controlled Decoding of Large Language Models
Seongho Son
William Bankes
Sangwoong Yoon
Shyam Sundhar Ramesh
Xiaohang Tang
Ilija Bogunovic
34
0
0
11 Mar 2025
1