Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.08400
Cited By
Self-training Language Models for Arithmetic Reasoning
11 July 2024
Marek Kadlcík
Michal Štefánik
KELM
ReLM
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Self-training Language Models for Arithmetic Reasoning"
1 / 1 papers shown
Title
KTO: Model Alignment as Prospect Theoretic Optimization
Kawin Ethayarajh
Winnie Xu
Niklas Muennighoff
Dan Jurafsky
Douwe Kiela
150
437
0
02 Feb 2024
1