Algorithm for the detection of outliers based on the theory of rough sets


Autoria(s): Maciá Pérez, Francisco; Berna-Martinez, Jose Vicente; Fernández Oliva, Alberto; Abreu Ortega, Miguel Alfonso
Contribuinte(s)

Universidad de Alicante. Departamento de Tecnología Informática y Computación

GrupoM. Redes y Middleware

Data(s)

26/05/2015

26/05/2015

01/07/2015

Resumo

Outliers are objects that show abnormal behavior with respect to their context or that have unexpected values in some of their parameters. In decision-making processes, information quality is of the utmost importance. In specific applications, an outlying data element may represent an important deviation in a production process or a damaged sensor. Therefore, the ability to detect these elements could make the difference between making a correct and an incorrect decision. This task is complicated by the large sizes of typical databases. Due to their importance in search processes in large volumes of data, researchers pay special attention to the development of efficient outlier detection techniques. This article presents a computationally efficient algorithm for the detection of outliers in large volumes of information. This proposal is based on an extension of the mathematical framework upon which the basic theory of detection of outliers, founded on Rough Set Theory, has been constructed. From this starting point, current problems are analyzed; a detection method is proposed, along with a computational algorithm that allows the performance of outlier detection tasks with an almost-linear complexity. To illustrate its viability, the results of the application of the outlier-detection algorithm to the concrete example of a large database are presented.

This work was performed as part of the Smart University Project (SmartUniversity2014) financed by the University of Alicante.

Identificador

Decision Support Systems. 2015, 75: 63-75. doi:10.1016/j.dss.2015.05.002

0167-9236 (Print)

1873-5797 (Online)

http://hdl.handle.net/10045/47027

10.1016/j.dss.2015.05.002

Idioma(s)

eng

Publicador

Elsevier

Relação

http://dx.doi.org/10.1016/j.dss.2015.05.002

Direitos

© 2015 Elsevier B.V.

info:eu-repo/semantics/embargoedAccess

Palavras-Chave #Knowledge discovery #Detection of outliers #Rough set theory #Arquitectura y Tecnología de Computadores
Tipo

info:eu-repo/semantics/article