Process discovery in event logs: An application in the telecom industry

被引:42
作者
Goedertier, Stijn [1 ]
De Weerdt, Jochen [1 ]
Martens, David [1 ,2 ]
Vanthienen, Jan [1 ]
Baesens, Bart [1 ,3 ]
机构
[1] Katholieke Univ Leuven, Dept Decis Sci & Informat Management, B-3000 Louvain, Belgium
[2] Univ Ghent, Hogesch Gent, Dept Business Adm & Publ Management, B-9000 Ghent, Belgium
[3] Univ Southampton, Sch Management, Highfield Southampton SO17 1BJ, Hants, England
关键词
Process discovery; AGNEs; HeuristicsMiner; Event logs; Genetic Miner; Data mining; Workflow management systems (WfMS); PROCESS MODELS; PETRI NETS; SUPPORT; IMPLEMENTATION; FRAMEWORK; PATTERNS; SYSTEMS;
D O I
10.1016/j.asoc.2010.04.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The abundant availability of data is typical for information-intensive organizations. Usually, discerning knowledge from vast amounts of data is a challenge. Similarly, discovering business process models from information system event logs is definitely non-trivial. Within the analysis of event logs, process discovery, which can be defined as the automated construction of structured process models from such event logs, is an important learning task. However, the discovery of these processes poses many challenges. First of all, human-centric processes are likely to contain a lot of noise as people deviate from standard procedures. Other challenges are the discovery of so-called non-local, non-free choice constructs, duplicate activities, incomplete event logs and the inclusion of prior knowledge. In this paper, we present an empirical evaluation of three state-of-the-art process discovery techniques: Genetic Miner, AGNEs and HeuristicsMiner. Although the detailed empirical evaluation is the main contribution of this paper to the literature, an in-depth discussion of a number of different evaluation metrics for process discovery techniques and a thorough discussion of the validity issue are key contributions as well. (C) 2010 Elsevier B. V. All rights reserved.
引用
收藏
页码:1697 / 1710
页数:14
相关论文
共 47 条
  • [1] ALONSO G, 2007, LECT NOTES COMPUTER, V4714
  • [2] A REALLY TEMPORAL LOGIC
    ALUR, R
    HENZINGER, TA
    [J]. JOURNAL OF THE ACM, 1994, 41 (01) : 181 - 204
  • [3] [Anonymous], BPM0630
  • [4] [Anonymous], 2004, BETA WORKING PAPER S
  • [5] ATHENA P, 2008, BPM PROTOS PROCESS M
  • [6] Top-down induction of first-order logical decision trees
    Blockeel, H
    De Raedt, L
    [J]. ARTIFICIAL INTELLIGENCE, 1998, 101 (1-2) : 285 - 297
  • [7] Model-based clustering and visualization of navigation patterns on a web site
    Cadez, I
    Heckerman, D
    Meek, C
    Smyth, P
    White, S
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2003, 7 (04) : 399 - 424
  • [8] Deriving Petri Nets from finite transition systems
    Cortadella, J
    Kishinevsky, M
    Lavagno, L
    Yakovlev, A
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1998, 47 (08) : 859 - 882
  • [9] Genetic process mining: an experimental evaluation
    de Medeiros, A. K. A.
    Weijters, A. J. M. M.
    van der Aalst, W. M. P.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 14 (02) : 245 - 304
  • [10] Dumas M, 2005, PROCESS-AWARE INFORMATION SYSTEMS: BRIDGING PEOPLE AND SOFTWARE THROUGH PROCESS TECHNOLOGY, P1, DOI 10.1002/0471741442