208

Building Coding Agents via Entropy-Enhanced Multi-Turn Preference Optimization

Main:9 Pages
5 Figures
Bibliography:3 Pages
5 Tables
Appendix:8 Pages
Abstract

Software engineering presents complex, multi-step challenges for Large Language Models (LLMs), requiring reasoning over large codebases and coordinated tool use. The difficulty of these tasks is exemplified by benchmarks like SWE-bench, where current LLMs still struggle to resolve real-world issues.

View on arXiv
Comments on this paper