The Needle is a Thread: Finding Planted Paths in Noisy Process Trees
Maya Le
Paweł Prałat
Aaron Smith
François Théberge
Main:10 Pages
11 Figures
Bibliography:2 Pages
Appendix:3 Pages
Abstract
Motivated by applications in cybersecurity such as finding meaningful sequences of malware-related events buried inside large amounts of computer log data, we introduce the "planted path" problem and propose an algorithm to find fuzzy matchings between two trees. This algorithm can be used as a "building block" for more complicated workflows. We demonstrate usefulness of a few of such workflows in mining synthetically generated data as well as real-world ACME cybersecurity datasets.
View on arXivComments on this paper
