Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.12854
Cited By
Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation
17 March 2025
Songjun Tu
Jiahao Lin
Xiangyu Tian
Qichao Zhang
Linjing Li
Y. Fu
Nan Xu
Wei He
Xiangyuan Lan
D. Jiang
Dongbin Zhao
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation"
1 / 1 papers shown
Title
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
Andreas Hochlehnert
Hardik Bhatnagar
Vishaal Udandarao
Samuel Albanie
Ameya Prabhu
Matthias Bethge
ReLM
ALM
LRM
58
4
0
09 Apr 2025
1