High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning


Autoria(s): Erfani, Sarah M.; Rajasegarar, Sutharshan; Karunasekera, Shanika; Leckie, Christopher
Data(s)

01/10/2016

Resumo

High-dimensional problem domains pose significant challenges for anomaly detection. The presence of irrelevant features can conceal the presence of anomalies. This problem, known as the '. curse of dimensionality', is an obstacle for many anomaly detection techniques. Building a robust anomaly detection model for use in high-dimensional spaces requires the combination of an unsupervised feature extractor and an anomaly detector. While one-class support vector machines are effective at producing decision surfaces from well-behaved feature vectors, they can be inefficient at modelling the variation in large, high-dimensional datasets. Architectures such as deep belief networks (DBNs) are a promising technique for learning robust features. We present a hybrid model where an unsupervised DBN is trained to extract generic underlying features, and a one-class SVM is trained from the features learned by the DBN. Since a linear kernel can be substituted for nonlinear ones in our hybrid model without loss of accuracy, our model is scalable and computationally efficient. The experimental results show that our proposed model yields comparable anomaly detection performance with a deep autoencoder, while reducing its training and testing time by a factor of 3 and 1000, respectively.

Identificador

http://hdl.handle.net/10536/DRO/DU:30083533

Idioma(s)

eng

Publicador

Elsevier

Relação

LP120100529

LE120100129

http://dro.deakin.edu.au/eserv/DU:30083533/rajasegarar-highdimensional-2016.pdf

http://www.dx.doi.org/10.1016/j.patcog.2016.03.028

Direitos

2016, Elsevier

Palavras-Chave #anomaly detection #outlier detection #high-dimensional data #deep belief net #deep learning #one-class SVM #feature extraction
Tipo

Journal Article