What is Normal? A Big Data Observational Science Model of Anonymized Internet Traffic
Jeremy Kepner
Hayden Jananthan
Michael Jeffrey Jones
William Arcand
David Bestor
William Bergeron
Daniel Burrill
A. Buluç
Chansup Byun
Timothy Davis
V. Gadepally
Daniel Grant
Michael Houle
Matthew Hubbell
Piotr Luszczek
Lauren Milechin
Chasen Milner
Guillermo Morales
Andrew Morris
J. Mullen
Ritesh Patel
Alex Pentland
Sandeep Pisharody
Andrew Prout
Albert Reuther
Antonio Rosa
Gabriel Wachman
Charles Yee
Peter Michaleas

Abstract
Understanding what is normal is a key aspect of protecting a domain. Other domains invest heavily in observational science to develop models of normal behavior to better detect anomalies. Recent advances in high performance graph libraries, such as the GraphBLAS, coupled with supercomputers enables processing of the trillions of observations required. We leverage this approach to synthesize low-parameter observational models of anonymized Internet traffic with a high regard for privacy.
View on arXivComments on this paper