Towards a reverse engineering approach for guiding user in applying data mining


Autoria(s): Espinosa, Roberto; Mazón López, José Norberto; Zubcoff, Jose
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Universidad de Alicante. Departamento de Ciencias del Mar y Biología Aplicada

Web and Knowledge (WaKe)

Data(s)

16/11/2012

16/11/2012

01/09/2011

Resumo

Comunicación presentada en las XVI Jornadas de Ingeniería del Software y Bases de Datos, JISBD 2011, A Coruña, 5-7 septiembre 2011.

Data mining is at the core of the knowledge discovery process. However, an initial preprocessing step is crucial for assuring reliable results within this process. Preprocessing of data is a time-consuming and non-trivial task since data quality issues should be considered. This is even worst when dealing with complex data, not only because of the different kind of complex data types (XML, multimedia, and so on), but also because of the high dimensionality of complex data. Therefore, to overcome this situation, in this position paper we propose using mechanisms based on data reverse engineering for automatically measuring some data quality criteria on the data sources. These measures will guide user in selecting the most adequate data mining algorithm in the early stages of the knowledge discovery process. Finally, it is worth noting that this work is a first step towards considering, in a systematic and structured manner, data quality criteria for supporting data miners in applying those algorithms that obtain the most reliable knowledge from the available data sources.

Identificador

http://hdl.handle.net/10045/25166

Idioma(s)

eng

Direitos

info:eu-repo/semantics/openAccess

Palavras-Chave #Data mining #Data reverse engineering #Data quality #Knowledge discovery process #Lenguajes y Sistemas Informáticos #Estadística e Investigación Operativa
Tipo

info:eu-repo/semantics/conferenceObject