Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.06903
Cited By
Semi-Supervised Reward Modeling via Iterative Self-Training
10 September 2024
Yifei He
Haoxiang Wang
Ziyan Jiang
Alexandros Papangelis
Han Zhao
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semi-Supervised Reward Modeling via Iterative Self-Training"
1 / 1 papers shown
Title
Revisiting Self-Training for Neural Sequence Generation
Junxian He
Jiatao Gu
Jiajun Shen
MarcÁurelio Ranzato
SSL
LRM
236
252
0
30 Sep 2019
1