Application of rank correlation, clustering and classification in information security


Autoria(s): Beliakov, Gleb; Yearwood, John; Kelarev, Andrei
Data(s)

01/06/2012

Resumo

This article is devoted to experimental investigation of a novel application of a clustering technique introduced by the authors recently in order to use robust and stable consensus functions in information security, where it is often necessary to process large data sets and monitor outcomes in real time, as it is required, for example, for intrusion detection. Here we concentrate on a particular case of application to profiling of phishing websites. First, we apply several independent clustering algorithms to a randomized sample of data to obtain independent initial clusterings. Silhouette index is used to determine the number of clusters. Second, rank correlation is used to select a subset of features for dimensionality reduction. We investigate the effectiveness of the Pearson Linear Correlation Coefficient, the Spearman Rank Correlation Coefficient and the Goodman--Kruskal Correlation Coefficient in this application. Third, we use a consensus function to combine independent initial clusterings into one consensus clustering. Fourth, we train fast supervised classification algorithms on the resulting consensus clustering in order to enable them to process the whole large data set as well as new data. The precision and recall of classifiers at the final stage of this scheme are critical for the effectiveness of the whole procedure. We investigated various combinations of several correlation coefficients, consensus functions, and a variety of supervised classification algorithms.<br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30046944

Idioma(s)

eng

Publicador

Academy Publisher

Relação

http://dro.deakin.edu.au/eserv/DU:30046944/beliakov-applicationofrank-2012.pdf

http://dx.doi.org/10.4304/jnw.7.6.935-945

Direitos

2012, The Author

Palavras-Chave #classification #clustering #consensus functions #phishing websites
Tipo

Journal Article