20
0

Scoring the Unscorables: Cyber Risk Assessment Beyond Internet Scans

Main:20 Pages
15 Figures
Bibliography:3 Pages
2 Tables
Abstract

In this paper we present a study on using novel data types to perform cyber risk quantification by estimating the likelihood of a data breach. We demonstrate that it is feasible to build a highly accurate cyber risk assessment model using public and readily available technology signatures obtained from crawling an organization's website. This approach overcomes the limitations of previous similar approaches that relied on large-scale IP address based scanning data, which suffers from incomplete/missing IP address mappings as well as the lack of such data for large numbers of small and medium-sized organizations (SMEs). In comparison to scan data, technology digital signature data is more readily available for millions of SMEs. Our study shows that there is a strong relationship between these technology signatures and an organization's cybersecurity posture. In cross-validating our model using different cyber incident datasets, we also highlight the key differences between ransomware attack victims and the larger population of cyber incident and data breach victims.

View on arXiv
@article{sarabi2025_2506.06604,
  title={ Scoring the Unscorables: Cyber Risk Assessment Beyond Internet Scans },
  author={ Armin Sarabi and Manish Karir and Mingyan Liu },
  journal={arXiv preprint arXiv:2506.06604},
  year={ 2025 }
}
Comments on this paper