Anomaly Detection in Large Scale Networks with Latent Space Models

Wesley Lee; Tyler H. McCormick; Joshua Neil; Cole Sodja; Yanran Cui

Anomaly Detection in Large Scale Networks with Latent Space Models

Wesley Lee ,
Tyler H. McCormick ,
Joshua Neil ,
Cole Sodja ,
Yanran Cui

Technometrics | July 2021 , pp. 1-23

Download BibTex

We develop a real-time anomaly detection algorithm for directed activity on large, sparse networks. We model the propensity for future activity using a dynamic logistic model with interaction terms for sender- and receiver-specific latent factors in addition to sender- and receiver-specific popularity scores; deviations from this underlying model constitute potential anomalies. Latent nodal attributes are estimated via a variational Bayesian approach and may change over time, representing natural shifts in network activity. Estimation is augmented with a case-control approximation to take advantage of the sparsity of the network and reduces computational complexity from $O(N^2)$ to $O(E)$, where $N$ is the number of nodes and $E$ is the number of observed edges. We run our algorithm on network event records collected from an enterprise network of over 25,000 computers and are able to identify a red team attack with half the detection rate required of the model without latent interaction terms.