Noise Filtering of Process Execution Logs based on Outliers Detection


Autoria(s): Conforti, Raffaele; La Rosa, Marcello; ter Hofstede, Arthur H.M.
Data(s)

2015

Resumo

This paper presents a technique for the automated removal of noise from process execution logs. Noise is the result of data quality issues such as logging errors and manifests itself in the form of infrequent process behavior. The proposed technique generates an abstract representation of an event log as an automaton capturing the direct follows relations between event labels. This automaton is then pruned from arcs with low relative frequency and used to remove from the log those events not fitting the automaton, which are identified as outliers. The technique has been extensively evaluated on top of various auto- mated process discovery algorithms using both artificial logs with different levels of noise, as well as a variety of real-life logs. The results show that the technique significantly improves the quality of the discovered process model along fitness, appropriateness and simplicity, without negative effects on generalization. Further, the technique scales well to large and complex logs.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/82901/

Relação

http://eprints.qut.edu.au/82901/1/LogFiltering.pdf

Conforti, Raffaele, La Rosa, Marcello, & ter Hofstede, Arthur H.M. (2015) Noise Filtering of Process Execution Logs based on Outliers Detection.

Direitos

Copyright 2015 [please consult the authors]

Fonte

Institute for Future Environments; School of Information Systems; Science & Engineering Faculty

Palavras-Chave #080000 INFORMATION AND COMPUTING SCIENCES #080300 COMPUTER SOFTWARE #080600 INFORMATION SYSTEMS #080609 Information Systems Management #Business Process Management #Process Mining #Noise Filtering #Outlier Detection
Tipo

Report