ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.12854
  4. Cited By
Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation

Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation

17 March 2025
Songjun Tu
Jiahao Lin
Xiangyu Tian
Qichao Zhang
Linjing Li
Y. Fu
Nan Xu
Wei He
Xiangyuan Lan
D. Jiang
Dongbin Zhao
    LRM
ArXivPDFHTML

Papers citing "Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation"

1 / 1 papers shown
Title
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility
Andreas Hochlehnert
Hardik Bhatnagar
Vishaal Udandarao
Samuel Albanie
Ameya Prabhu
Matthias Bethge
ReLM
ALM
LRM
58
4
0
09 Apr 2025
1