Evaluating predictive quality models derived from software measures: Lessons learned

被引:29
作者
Lanubile, F [1 ]
Visaggio, G [1 ]
机构
[1] UNIV BARI, DIPARTIMENTO INFORMAT, BARI, ITALY
基金
美国国家航空航天局;
关键词
D O I
10.1016/S0164-1212(96)00153-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper describes an empirical comparison of several modeling techniques for predicting the quality of software components early in the software life cycle. Using software product measures, we built models that classify components as high-risk, i.e., likely to contain faults, or low-risk, i.e., likely to be free of faults. The modeling techniques evaluated in this study include principal component analysis, discriminant analysis, logistic regression, logical classification models, layered neural networks, and holographic networks. These techniques provide a good coverage of the main problem-solving paradigms: statistical analysis, machine learning, and neural networks. Using the results of independent testing, we determined the absolute worth of the predictive models and compare their performance in terms of misclassification errors, achieved quality, and verification cost. Data came from 27 software systems, developed and tested during three years of project-intensive academic courses. A surprising result is that no model was able to effectively discriminate between components with faults and components without faults. (C) 1997 Elsevier Science Inc.
引用
收藏
页码:225 / 234
页数:10
相关论文
共 33 条
[1]  
Agresti A., 1990, Analysis of categorical data
[2]  
[Anonymous], 1989, SAS STAT US GUID VER
[3]  
[Anonymous], 1998, Practical nonparametric statistics
[4]  
[Anonymous], 1995, ISERN9503
[5]  
BOETTICHER G, 1993, P 5 INT C SOFTW ENG, P271
[6]  
BRIAND LC, 1993, PROC INT CONF SOFTW, P55, DOI 10.1109/ICSE.1993.346056
[7]   DEVELOPING INTERPRETABLE MODELS WITH OPTIMIZED SET REDUCTION FOR IDENTIFYING HIGH-RISK SOFTWARE COMPONENTS [J].
BRIAND, LC ;
BASILI, VR ;
HETMANSKI, CJ .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (11) :1028-1044
[8]  
Dillon W.R., 1984, MULTIVARIATE ANAL ME
[9]   IDENTIFYING REUSABLE SOFTWARE COMPONENTS BY INDUCTION [J].
ESTEVA, JC ;
REYNOLDS, RG .
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 1991, 1 (03) :271-292
[10]   SOFTWARE MEASUREMENT - A NECESSARY SCIENTIFIC BASIS [J].
FENTON, N .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1994, 20 (03) :199-206