ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.18525
26
0

Programming with Pixels: Computer-Use Meets Software Engineering

24 February 2025
Pranjal Aggarwal
Sean Welleck
ArXivPDFHTML
Abstract

Recent advancements in software engineering (SWE) agents have largely followed a tool-based paradigm\textit{tool-based paradigm}tool-based paradigm, where agents interact with hand-engineered tool APIs to perform specific tasks. While effective for specialized tasks, these methods fundamentally lack generalization, as they require predefined tools for each task and do not scale across programming languages and domains. We introduce Programming with Pixels\texttt{Programming with Pixels}Programming with Pixels (PwP), an agent environment that unifies software development tasks by enabling computer-use agents\textit{computer-use agents}computer-use agents-agents that operate directly within an IDE through visual perception, typing, and clicking, rather than relying on predefined tool APIs. To systematically evaluate these agents, we propose PwP-Bench\texttt{PwP-Bench}PwP-Bench, a benchmark that unifies existing SWE benchmarks spanning tasks across multiple programming languages, modalities, and domains under a task-agnostic state and action space. Our experiments demonstrate that general-purpose computer-use agents can approach or even surpass specialized tool-based agents on a variety of SWE tasks without the need for hand-engineered tools. However, our analysis shows that current models suffer from limited visual grounding and fail to exploit many IDE tools that could simplify their tasks. When agents can directly access IDE tools, without visual interaction, they show significant performance improvements, highlighting the untapped potential of leveraging built-in IDE capabilities. Our results establish PwP as a scalable testbed for building and evaluating the next wave of software engineering agents. We release code and data atthis https URL

View on arXiv
@article{aggarwal2025_2502.18525,
  title={ Programming with Pixels: Computer-Use Meets Software Engineering },
  author={ Pranjal Aggarwal and Sean Welleck },
  journal={arXiv preprint arXiv:2502.18525},
  year={ 2025 }
}
Comments on this paper