ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.16941
  4. Cited By
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
v1v2 (latest)

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

21 September 2025
Xiang Deng
Jeff Da
Edwin Pan
Yannis Yiming He
Charles Ide
Kanak Garg
Niklas Lauffer
Andrew Park
Nitin Pasari
Chetan Rane
Karmini Sampath
Maya Krishnan
Srivatsa Kundurthy
Sean Hendryx
Zifan Wang
Vijay Bharadwaj
Jeff Holm
Bing Liu
Chen Bo Calvin Zhang
Noah Jacobson
Bing Liu
Brad Kenstler
ArXiv (abs)PDFHTMLHuggingFace (19 upvotes)Github (213★)

Papers citing "SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?"

3 / 3 papers shown
Title
Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?
Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?
Chunqiu Steven Xia
Zhe Wang
Yan Yang
Yuxiang Wei
Lingming Zhang
LLMAG
20
0
0
17 Nov 2025
ReCode: Updating Code API Knowledge with Reinforcement Learning
ReCode: Updating Code API Knowledge with Reinforcement Learning
Haoze Wu
Yunzhi Yao
Wenhao Yu
Ningyu Zhang
SyDa
63
3
0
25 Jun 2025
RFCAudit: An LLM Agent for Functional Bug Detection in Network Protocols
RFCAudit: An LLM Agent for Functional Bug Detection in Network Protocols
Mingwei Zheng
Chengpeng Wang
Xuwei Liu
Jinyao Guo
Shiwei Feng
Xiangyu Zhang
137
5
0
31 May 2025
1