Mining software repositories for comprehensible software fault prediction models

被引:78
作者
Vandecruys, Olivier [1 ]
Martens, David [1 ]
Baesens, Bart [1 ,2 ]
Mues, Christophe [2 ]
De Backer, Manu [1 ]
Haesen, Raf [1 ]
机构
[1] Dept Decis Sci & Informat Management, B-3000 Louvain, Belgium
[2] Univ Southampton, Sch Management, Southampton SO17 1BJ, Hants, England
关键词
classification; software mining; fault prediction; comprehensibility; Ant Colony Optimization;
D O I
10.1016/j.jss.2007.07.034
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software managers are routinely confronted with software projects that contain errors or inconsistencies and exceed budget and time limits. By mining software repositories with comprehensible data mining techniques, predictive models can be induced that offer software managers the insights they need to tackle these quality and budgeting problems in an efficient way. This paper deals with the role that the Ant Colony Optimization (ACO)-based classification technique AntMiner+ can play as a comprehensible data mining technique to predict erroneous software modules. In an empirical comparison on three real-world public datasets, the rule-based models produced by AntMiner+ are shown to achieve a predictive accuracy that is competitive to that of the models induced by several other included classification techniques, such as C4.5, logistic regression and support vector machines. In addition, we will argue that the intuitiveness and comprehensibility of the AntMiner+ models can be considered superior to the latter models. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:823 / 839
页数:17
相关论文
共 65 条
[1]  
Abraham A, 2003, IEEE C EVOL COMPUTAT, P1384
[2]  
[Anonymous], 2002, 6 AUSTR JAP JOINT WO
[3]  
[Anonymous], 2004, Ant colony optimization
[4]  
[Anonymous], 91016 DIP EL INF POL
[5]  
[Anonymous], THESIS KU LEUVEN
[6]  
[Anonymous], 11 IEEE INT SOFTW ME
[7]  
[Anonymous], CHAOS REP
[8]   Benchmarking state-of-the-art classification algorithms for credit scoring [J].
Baesens, B ;
Van Gestel, T ;
Viaene, S ;
Stepanova, M ;
Suykens, J ;
Vanthienen, J .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2003, 54 (06) :627-635
[9]   Using neural network rule extraction and decision tables for credit-risk evaluation [J].
Baesens, B ;
Setiono, R ;
Mues, C ;
Vanthienen, J .
MANAGEMENT SCIENCE, 2003, 49 (03) :312-329
[10]  
Berry M.J. A., 2004, DATA MINING TECHNIQU, V2nd