Classification-tree models of software-quality over multiple releases

被引:45
作者
Khoshgoftaar, TM [1 ]
Allen, EB
Jones, WD
Hudepohl, JP
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
[2] EMERALD, Res Triangle Pk, NC 27709 USA
关键词
CART; classification trees; fault-prone modules; software metrics; software reliability;
D O I
10.1109/24.855532
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an empirical study that evaluates software-quality models over several releases, to address the question, "How long will a model yield useful predictions?" The Classification And Regression Trees (CART) algorithm is introduced. CART can achieve a preferred balance between the two types of misclassification rates. This is desirable because misclassification of fault-prone modules often has much more severe consequences than misclassification of those that are not fault-prone. The case-study developed 2 classification-tree models based on 4 consecutive releases of a very large legacy telecommunication system. Forty-two software product, process, and execution metrics were candidate predictors. Model #1 used measurements of the first release as the training data set; this model had II important predictors. Model #2 used measurements of the second release as the training data set; this model had 15 important predictors, Measurements of subsequent releases were evaluation data sets. Analysis of the models' predictors yielded insights into various software development practices. Both models had accuracy that would be useful to developers. One might suppose that software-quality models lose their value very quickly over successive releases due to evolution of the product and the underlying development processes. We found the models remained useful over all the releases studied.
引用
收藏
页码:4 / 11
页数:8
相关论文
共 41 条
[1]   FACTOR-ANALYSIS AND AIC [J].
AKAIKE, H .
PSYCHOMETRIKA, 1987, 52 (03) :317-332
[2]   A validation of object-oriented design metrics as quality indicators [J].
Basili, VR ;
Briand, LC ;
Melo, WL .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (10) :751-761
[3]   How reuse influences productivity in object-oriented systems [J].
Basili, VR ;
Briand, LC ;
Melo, WL .
COMMUNICATIONS OF THE ACM, 1996, 39 (10) :104-116
[4]   Exploring defect data from development and customer usage on software modules over multiple releases [J].
Biyani, S ;
Santhanam, P .
NINTH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 1998, :316-320
[5]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[6]   A PATTERN-RECOGNITION APPROACH FOR SOFTWARE ENGINEERING DATA-ANALYSIS [J].
BRIAND, LC ;
BASILI, VR ;
THOMAS, WM .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1992, 18 (11) :931-942
[7]  
Clark L.A., 1992, STAT MODELS S, P377
[8]   Classification techniques for metric-based software development [J].
Ebert, C .
SOFTWARE QUALITY JOURNAL, 1996, 5 (04) :255-272
[10]   A COMPOSITE COMPLEXITY APPROACH FOR SOFTWARE DEFECT MODELING [J].
EVANCO, WM ;
AGRESTI, WW .
SOFTWARE QUALITY JOURNAL, 1994, 3 (01) :27-44