Data mining static code attributes to learn defect predictors

被引：917

作者：

Menzies, Tim ^{[1
]}

Greenwald, Jeremy

Frank, Art

机构：

[1] W Virginia Univ, Lane Dept Comp Sci & Elect Engn, Morgantown, WV 26506 USA

[2] Portland State Univ, Dept Comp Sci, Portland, OR 97207 USA

来源：

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING | 2007年 / 33卷 / 01期

关键词：

data mining detect prediction; McCabe; Halstead; artifical intelligence; empirical; naive Bayes;

D O I：

10.1109/TSE.2007.256941

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The value of using static code attributes to learn defect predictors has been widely debated. Prior work has explored issues like the merits of "McCabes versus Halstead versus lines of code counts" for generating defect predictors. We show here that such debates are irrelevant since how the attributes are used to build predictors is much more important than which particular attributes are used. Also, contrary to prior pessimism, we show that such defect predictors are demonstrably useful and, on the data studied here, yield predictors with a mean probability of detection of 71 percent and mean false alarms rates of 25 percent. These predictors would be useful for prioritizing a resource-bound exploration of code that has yet to be inspected.

引用

页码：2 / 13

页数：12

共 47 条

[1]

ALMUALLIM H, 1991, PROCEEDINGS : NINTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P547

[2]

[Anonymous], 2001, ART H COMP SCI LIBR

[3]

[Anonymous], IBM SYSTEMS J

[4]

[Anonymous], P INT C SOFTW ENG

[5]

[Anonymous], 2004, P WORKSH PRED SOFTW

[6]

BASILI VR, 2002, P 24 INT C SOFTW ENG

[7]

Blake C.L., 1998, UCI repository of machine learning databases

[8]

BOUCKAERT R, 2003, P INT C MACH LEARN I

[9]

CHAPMAN M, 2002, P NASA SOFTW ASS S

[10]

Dougherty J., 1995, MACHINE LEARNING P 1, P194, DOI DOI 10.1016/B978-1-55860-377-6.50032-3

← 1 2 3 4 5 →