Ontologies for reusing data cleaning knowledge
Data(s) |
14/05/2013
14/05/2013
2012
|
---|---|
Resumo |
The emergence of new business models, namely, the establishment of partnerships between organizations, the chance that companies have of adding existing data on the web, especially in the semantic web, to their information, led to the emphasis on some problems existing in databases, particularly related to data quality. Poor data can result in loss of competitiveness of the organizations holding these data, and may even lead to their disappearance, since many of their decision-making processes are based on these data. For this reason, data cleaning is essential. Current approaches to solve these problems are closely linked to database schemas and specific domains. In order that data cleaning can be used in different repositories, it is necessary for computer systems to understand these data, i.e., an associated semantic is needed. The solution presented in this paper includes the use of ontologies: (i) for the specification of data cleaning operations and, (ii) as a way of solving the semantic heterogeneity problems of data stored in different sources. With data cleaning operations defined at a conceptual level and existing mappings between domain ontologies and an ontology that results from a database, they may be instantiated and proposed to the expert/specialist to be executed over that database, thus enabling their interoperability. |
Identificador |
DOI 10.1109/ICSC.2012.19 978-1-4673-4433-3 |
Idioma(s) |
eng |
Publicador |
IEEE |
Relação |
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6337110 |
Direitos |
closedAccess |
Palavras-Chave | #Data cleaning #Ontologies #OWL #Data quality |
Tipo |
conferenceObject |