An application of zero-inflated Poisson regression for software fault prediction

被引:38
作者
Khoshgoftaar, TM [1 ]
Gao, KH [1 ]
Szabo, RM [1 ]
机构
[1] Florida Atlantic Univ, Empir Software Engn Lab, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
来源
12TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS | 2001年
关键词
software quality modeling; Poisson regression model; zero-inflated Poisson regression model; nested models; Vuong hypothesis test; program module;
D O I
10.1109/ISSRE.2001.989459
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Poisson regression model is widely used in software quality modeling. When the response variable of a data set includes a large number of zeros, Poisson regression model will underestimate the probability of zeros. A zero-inflated model changes the mean structure of the pure Poisson model. The predictive quality is therefore improved. In this paper, we examine a full-scale industrial software system and develop two models, Poisson regression and zero-inflated Poisson regression. To our knowledge, this is the,first study that introduces the zero-inflated Poisson regression model in software reliability. Comparing the predictive qualities of the two competing models, we conclude that for this system, the zero-inflated Poisson regression model is more appropriate in theory and practice.
引用
收藏
页码:66 / 73
页数:8
相关论文
共 18 条
[11]  
Khoshgoftaar T., 1999, INT J RELIABILITY QU, V6, P303, DOI DOI 10.1142/S0218539399000292
[12]   PREDICTIVE MODELING TECHNIQUES OF SOFTWARE QUALITY FROM SOFTWARE MEASURES [J].
KHOSHGOFTAAR, TM ;
MUNSON, JC ;
BHATTACHARYA, BB ;
RICHARDSON, GD .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1992, 18 (11) :979-987
[13]   Modeling software quality: The software measurement analysis and reliability toolkit [J].
Khoshgoftaar, TM ;
Allen, EB ;
Busboom, JC .
12TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, :54-61
[14]   ZERO-INFLATED POISSON REGRESSION, WITH AN APPLICATION TO DEFECTS IN MANUFACTURING [J].
LAMBERT, D .
TECHNOMETRICS, 1992, 34 (01) :1-14
[15]   THE RELATIONSHIP BETWEEN TRUCK ACCIDENTS AND GEOMETRIC DESIGN OF ROAD SECTIONS - POISSON VERSUS NEGATIVE BINOMIAL REGRESSIONS [J].
MIAOU, SP .
ACCIDENT ANALYSIS AND PREVENTION, 1994, 26 (04) :471-482
[16]   SPECIFICATION AND TESTING OF SOME MODIFIED COUNT DATA MODELS [J].
MULLAHY, J .
JOURNAL OF ECONOMETRICS, 1986, 33 (03) :341-365
[17]  
SZABO RM, 2000, TRCSE0056 FLOR ATL U
[18]   LIKELIHOOD RATIO TESTS FOR MODEL SELECTION AND NON-NESTED HYPOTHESES [J].
VUONG, QH .
ECONOMETRICA, 1989, 57 (02) :307-333