A rule-based approach for process discovery: Dealing with noise and imbalance in process logs

被引:57
作者
Maruster, Laura
Weijters, A. J. M. M.
Van der Aalst, Wil M. P.
Van den Bosch, Antal
机构
[1] Univ Groningen, NL-9700 AV Groningen, Netherlands
[2] Eindhoven Univ Technol, NL-5600 MB Eindhoven, Netherlands
[3] Tilburg Univ, NL-5000 LE Tilburg, Netherlands
关键词
rule induction; process mining; knowledge discovery; Petri nets;
D O I
10.1007/s10618-005-0029-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective information systems require the existence of explicit process models. A completely specified process design needs to be developed in order to enact a given business process. This development is time consuming and often subjective and incomplete. We propose a method that constructs the process model from process log data, by determining the relations between process tasks. To predict these relations, we employ machine learning technique to induce rule sets. These rule sets are induced from simulated process log data generated by varying process characteristics such as noise and log size. Tests reveal that the induced rule sets have a high predictive accuracy on new data. The effects of noise and imbalance of execution priorities during the discovery of the relations between process tasks are also discussed. Knowing the causal, exclusive, and parallel relations, a process model expressed in the Petri net formalism can be built. We illustrate our approach with real world data in a case study.
引用
收藏
页码:67 / 87
页数:21
相关论文
共 24 条
[21]  
VELD A, 2002, WFM EEN LAST EEN LUS
[22]  
Weijters A., 2001, P 13 BELG NETH C ART, P283
[23]  
WEISS S.M., 1998, PREDICTIVE DATA MINI
[24]  
WEISS SM, 1991, COMPUTER SYSTEMS THA