TOSSS: a CVE-based Software Security Benchmark for Large Language Models
Marc Damie
Murat Bilgehan Ertan
Domenico Essoussi
Angela Makhanu
Gaëtan Peter
Roos Wensveen
- ELM
Main:6 Pages
5 Figures
Bibliography:1 Pages
2 Tables
Abstract
With their increasing capabilities, Large Language Models (LLMs) are now used across many industries. They have become useful tools for software engineers and support a wide range of development tasks. As LLMs are increasingly used in software development workflows, a critical question arises: are LLMs good at software security? At the same time, organizations worldwide invest heavily in cybersecurity to reduce exposure to disruptive attacks. The integration of LLMs into software engineering workflows may introduce new vulnerabilities and weaken existing security efforts.
View on arXivComments on this paper
