Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.09724
Cited By
Taming Overconfidence in LLMs: Reward Calibration in RLHF
13 October 2024
Jixuan Leng
Chengsong Huang
Banghua Zhu
Jiaxin Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Taming Overconfidence in LLMs: Reward Calibration in RLHF"
Title
No papers