Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.22638
Cited By
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
26 September 2025
Renjie Luo
Zichen Liu
Xiangyan Liu
Chao Du
Min Lin
Wenhu Chen
Wei Lu
Tianyu Pang
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (62 upvotes)
Github (2045★)
Papers citing
"Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
1 / 1 papers shown
Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences
Batu El
J. Zou
158
3
0
07 Oct 2025
1
Page 1 of 1