Mining taxonomies of process models

被引:33
作者
Greco, Gianluigi [2 ]
Guzzo, Antonella [1 ]
Pontieri, Luigi [3 ]
机构
[1] Univ Calabria, Dept DEIS, I-87036 Arcavacata Di Rende, Italy
[2] Univ Calabria, Dept Math, I-87036 Arcavacata Di Rende, Italy
[3] CNR, Inst ICAR, I-87036 Arcavacata Di Rende, Italy
关键词
process mining; abstraction; knowledge discovery; workflow management;
D O I
10.1016/j.datak.2008.06.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Process mining techniques have been receiving great attention in the literature for their ability to automatically support process (re)design. Typically, these techniques discover a concrete workflow schema modelling all possible execution patterns registered in a given log, which can be exploited subsequently to support further-coming enactments. In this paper, an approach to process mining is introduced that extends classical discovery mechanisms by means of an abstraction method aimed at producing a taxonomy of workflow models. The taxonomy is built to capture the process behavior at different levels of detail. Indeed, the most-detailed mined models, i.e., the leafs of the taxonomy, are meant to support the design of concrete workflows, as it happens with existing techniques in the literature. The other models, i.e., non-leaf nodes of the taxonomy, represent instead abstract views over the process behavior that can be used to support advanced monitoring and analysis tasks. All the techniques discussed in the paper have been implemented, tested, and made available as a plugin for a popular process mining framework (ProM). A series of tests, performed on different synthesized and real datasets, evidenced the capability of the approach to characterize the behavior encoded in input logs in a precise and complete way, achieving compelling conformance results even in the presence of complex behavior and noisy data. Moreover, encouraging results have been obtained in a real-life application scenario, where it is shown how the taxonomical view of the process can effectively support an explorative ex-post analysis, hinged on the different kinds of process execution discovered from the logs. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:74 / 102
页数:29
相关论文
共 52 条
[1]   Clustering documents into a web directory for bootstrapping a supervised classification [J].
Adami, G ;
Avesani, P ;
Sona, D .
DATA & KNOWLEDGE ENGINEERING, 2005, 54 (03) :301-325
[2]  
Agrawal R, 1998, LECT NOTES COMPUT SC, V1377, P469
[3]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[4]  
Agrawal R., 1994, Proceedings of the 20th International Conference on Very Large Data Bases. VLDB'94, P487
[5]  
[Anonymous], PROCEEDINGS OF THE I
[6]  
[Anonymous], 1999, P 5 ACM SIGKDD INT C, DOI DOI 10.1145/312129.312275
[7]   Inheritance of behavior [J].
Basten, T ;
van der Aalst, WMP .
JOURNAL OF LOGIC AND ALGEBRAIC PROGRAMMING, 2001, 47 (02) :47-145
[8]  
Castellanos M, 2005, PROC INT CONF DATA, P1084
[9]  
Chen KCW, 2003, LECT NOTES COMPUT SC, V2690, P887
[10]  
COOK JE, 1995, PROC INT CONF SOFTW, P73, DOI 10.1145/225014.225021