Controlled automated discovery of collections of business process models


Autoria(s): Garcia-Banuelos, Luciano; Dumas, Marlon; La Rosa, Marcello; De Weerdt, Jochen; Ekanayake, Chathura C.
Data(s)

01/04/2014

Resumo

Automated process discovery techniques aim at extracting process models from information system logs. Existing techniques in this space are effective when applied to relatively small or regular logs, but generate spaghetti-like and sometimes inaccurate models when confronted to logs with high variability. In previous work, trace clustering has been applied in an attempt to reduce the size and complexity of automatically discovered process models. The idea is to split the log into clusters and to discover one model per cluster. This leads to a collection of process models – each one representing a variant of the business process – as opposed to an all-encompassing model. Still, models produced in this way may exhibit unacceptably high complexity and low fitness. In this setting, this paper presents a two-way divide-and-conquer process discovery technique, wherein the discovered process models are split on the one hand by variants and on the other hand hierarchically using subprocess extraction. Splitting is performed in a controlled manner in order to achieve user-defined complexity or fitness thresholds. Experiments on real-life logs show that the technique produces collections of models substantially smaller than those extracted by applying existing trace clustering techniques, while allowing the user to control the fitness of the resulting models.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/70567/

Publicador

Elsevier Ltd

Relação

http://eprints.qut.edu.au/70567/1/Submitted.pdf

DOI:10.1016/j.is.2014.04.006

Garcia-Banuelos, Luciano, Dumas, Marlon, La Rosa, Marcello, De Weerdt, Jochen, & Ekanayake, Chathura C. (2014) Controlled automated discovery of collections of business process models. Information Systems, 46, pp. 85-101.

Direitos

Copyright 2014 Elsevier Ltd

NOTICE: this is the author’s version of a work that was accepted for publication in Information Systems. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Information Systems, [VOL#, ISSUE#, (DATE)] DOI#ÂÂÂÂÂÂÂÂÂÂÂÂÂ

Fonte

School of Information Systems; Science & Engineering Faculty

Palavras-Chave #080600 INFORMATION SYSTEMS #process mining #process discovery #trace clustering #clone detection #process model collection
Tipo

Journal Article