30
6

Natural Language Outlines for Code: Literate Programming in the LLM Era

Abstract

We propose using natural language outlines as a novel modality and interaction surface for providing AI assistance to developers throughout the software development process. An NL outline for a code function comprises multiple statements written in concise prose, which partition the code and summarize its main ideas in the style of literate programming. Crucially, we find that modern LLMs can generate accurate and high-quality NL outlines in practice. Moreover, NL outlines enable a bidirectional sync between code and NL, where a developer can change either code or NL and have the LLM automatically update the other. We discuss many use cases for NL outlines: they can accelerate understanding and navigation of code and diffs, simplify code maintenance, augment code search, steer code generation, and more. We then propose and compare multiple LLM prompting techniques for generating outlines and ask professional developers to judge outline quality. Finally, we present two case studies applying NL outlines toward code review and malware detection.

View on arXiv
@article{shi2025_2408.04820,
  title={ Natural Language Outlines for Code: Literate Programming in the LLM Era },
  author={ Kensen Shi and Deniz Altınbüken and Saswat Anand and Mihai Christodorescu and Katja Grünwedel and Alexa Koenings and Sai Naidu and Anurag Pathak and Marc Rasi and Fredde Ribeiro and Brandon Ruffin and Siddhant Sanyam and Maxim Tabachnyk and Sara Toth and Roy Tu and Tobias Welp and Pengcheng Yin and Manzil Zaheer and Satish Chandra and Charles Sutton },
  journal={arXiv preprint arXiv:2408.04820},
  year={ 2025 }
}
Comments on this paper