Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization

15 September 2025

Jiahao Yu

Zelei Cheng

Xian Wu

Xinyu Xing

ArXiv (abs)PDF HTML Github (9★)

Main:9 Pages

5 Figures

Bibliography:3 Pages

5 Tables

Appendix:8 Pages

Abstract

Software engineering presents complex, multi-step challenges for Large Language Models (LLMs), requiring reasoning over large codebases and coordinated tool use. The difficulty of these tasks is exemplified by benchmarks like SWE-bench, where current LLMs still struggle to resolve real-world issues.

View on arXiv

Comments on this paper