SHOTGUN CORRELATIONS IN SOFTWARE MEASURES

被引:24
作者
COURTNEY, RE
GUSTAFSON, DA
机构
[1] TULANE UNIV,DEPT COMP SCI,NEW ORLEANS,LA 70118
[2] KANSAS STATE UNIV AGR & APPL SCI,DEPT COMP & INFORMAT SCI,MANHATTAN,KS 66506
来源
SOFTWARE ENGINEERING JOURNAL | 1993年 / 8卷 / 01期
关键词
D O I
10.1049/sej.1993.0002
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many software measures have been forwarded on the simple basis of a high linear correlation coefficient with some measurable quantities. The linear correlation coefficient is an unreliable statistic for deciding whether an observed correlation indicates significant association. Several published software measure experiments collected more than 20 different measurements, or have 14 or fewer observations. With considerable data from small samples, the probability of 'discovering' a 'significant' correlation is high. We present a computer simulation experiment where the correlation between sets of randomly generated numbers is calculated. We also look at randomly generated numbers in the ranges that would be expected in Halstead's Software Science [1] measures. Our results show that the average maximum linear correlation for randomly generated numbers is 0.70 or higher if the sample size is low compared to the number of variables. Alternative statistical approaches to obtaining meaningful significant results are presented.
引用
收藏
页码:5 / 13
页数:9
相关论文
共 17 条
[1]   SOFTWARE FUNCTION, SOURCE LINES OF CODE, AND DEVELOPMENT EFFORT PREDICTION - A SOFTWARE SCIENCE VALIDATION [J].
ALBRECHT, AJ ;
GAFFNEY, JE .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1983, 9 (06) :639-648
[2]  
BAKER AL, 1987, 20TH P HAW INT C SYS, P23
[3]   METRIC ANALYSIS AND DATA VALIDATION ACROSS FORTRAN PROJECTS [J].
BASILI, VR ;
SELBY, RW ;
PHILLIPS, TY .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1983, 9 (06) :652-663
[4]   MEASURING SOFTWARE-DESIGN COMPLEXITY [J].
CARD, DN ;
AGRESTI, WW .
JOURNAL OF SYSTEMS AND SOFTWARE, 1988, 8 (03) :185-197
[5]  
CHAPIN N, 1979, P NCC, P995
[6]   SOME BASIC DETERMINANTS OF COMPUTER PROGRAMMING PRODUCTIVITY [J].
CHRYSLER, E .
COMMUNICATIONS OF THE ACM, 1978, 21 (06) :472-483
[7]   AN ANALYSIS OF STATIC METRICS AND FAULTS IN C-SOFTWARE [J].
CRAWFORD, SG ;
MCINTOSH, AA ;
PREGIBON, D .
JOURNAL OF SYSTEMS AND SOFTWARE, 1985, 5 (01) :37-48
[8]  
EDWARDS WR, 1988, 11TH P MINN WORKSH S
[9]  
Halstead M, 1977, ELEMENTS SOFTWARE SC
[10]  
Hamer P. G., 1982, Sixth International Conference on Software Engineering, P197