Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.19557
Cited By
Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?
26 February 2025
Yudi Zhang
Lu Wang
Meng Fang
Yali Du
Chenghua Huang
Jun Wang
Qingwei Lin
Mykola Pechenizkiy
Dongmei Zhang
Saravan Rajmohan
Qi Zhang
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones?"
Title
No papers