Software metrics data analysis - exploring the relative performance of some commonly used modeling techniques

被引:27
作者
Gray A.R. [1 ,2 ,3 ]
Macdonell S.G. [1 ,2 ,3 ,4 ]
机构
[1] Software Metrics Research Laboratory, Department of Information Science, University of Otago, Dunedin
[2] Department of Information Science, University of Otago, Dunedin
[3] NZCS, Dunedin
[4] NAFIPS, Dunedin
关键词
Software metrics; analysis; statistical methods; connectionist methods;
D O I
10.1023/A:1009849100780
中图分类号
学科分类号
摘要
Whilst some software measurement research has been unquestionably successful, other research has struggled to enable expected advances in project and process management. Contributing to this lack of advancement has been the incidence of inappropriate or non-optimal application of various model-building procedures. This obviously raises questions over the validity and reliability of any results obtained as well as the conclusions that may have been drawn regarding the appropriateness of the techniques in question. In this paper we investigate the influence of various data set characteristics and the purpose of analysis on the effectiveness of four model-building techniques - three statistical methods and one neural network method. In order to illustrate the impact of data set characteristics, three separate data sets, drawn from the literature, are used in this analysis. In terms of predictive accuracy, it is shown that no one modeling method is best in every case. Some consideration of the characteristics of data sets should therefore occur before analysis begins, so that the most appropriate modeling method is then used. Moreover, issues other than predictive accuracy may have a significant influence on the selection of model-building methods. These issues are also addressed here and a series of guidelines for selecting among and implementing these and other modeling techniques is discussed.
引用
收藏
页码:297 / 316
页数:19
相关论文
共 21 条
[1]  
Albrecht, A.J., Gaffney Jr., J.E., Software function, source lines of code, and development effort prediction: A software science validation (1983) IEEE Transactions on Software Engineering, 9 (6), pp. 639-648
[2]  
Boehm, B.W., (1981) Software Engineering Economics, , Englewood Cliffs, NJ: Prentice-Hall
[3]  
Dolado, J.J., A study of the relationships among Albrecht and Mark II function points, lines of code 4GL and effort (1997) Journal of Systems and Software, 37, pp. 161-173
[4]  
Ebrahimi, N.B., How to improve the calibration of cost models (1999) IEEE Transactions on Software Engineering, 25 (1), pp. 136-140
[5]  
Finnie, G.R., Wittig, G.E., Desharnais, J.-M., A comparision of software effort estimation techniques: Using function points with neural networks, case-based reasoning and regression models (1997) Journal of Systems and Software, 39, pp. 281-289
[6]  
Gray, A.R., MacDonell, S.G., A comparison of techniques for developing predictive models of software metrics (1997) Information and Software Technology, 39, pp. 425-437
[7]  
Hakkarainen, J., Laamanen, P., Rask, R., Neural networks in specification level software size estimation (1993) Proceedings of the 26th Hawaii International Conference on System Sciences, pp. 626-634. , Hawaii, USA, IEEE Computer Society Press
[8]  
Heiat, A., Heiat, N., A model for estimating efforts required for developing small-scale business applications (1997) Journal of Systems and Software, 39, pp. 7-14
[9]  
Hertz, J., Krogh, A., Palmer, R.G., (1991) Introduction to the Theory of Neural Computation, , Redwood City, CA: Addison-Wesley
[10]  
Hornik, K., Stinchcombe, M., White, H., Multilayer feedforward networks are universal approximators (1989) Neural Networks, 2, pp. 359-366