An upper bound on software testing effectiveness

被引:80
作者
Chen, Tsong Yueh [1 ]
Merkel, Robert [1 ]
机构
[1] Swinburne Univ Technol, Hawthorn, Vic 3122, Australia
关键词
verification; reliability; theory; measurement; software testing; failure-causing inputs; failure patterns; testing effectiveness metrics; random testing; adaptive random testing;
D O I
10.1145/1363102.1363107
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Failure patterns describe typical ways in which inputs revealing program failure are distributed across the input domain-in many cases, clustered together in contiguous regions. Based on these observations several debug testing methods have been developed. We examine the upper bound of debug testing effectiveness improvements possible through making assumptions about the shape, size and orientation of failure patterns. We consider the bounds for testing strategies with respect to minimizing the F-measure, maximizing the P-measure, and maximizing the E-measure. Surprisingly, we find that the empirically measured effectiveness of some existing methods that are not based on these assumptions is close to the theoretical upper bound of these strategies. The assumptions made to obtain the upper bound, and its further implications, are also examined.
引用
收藏
页数:27
相关论文
共 18 条
[1]   DATA DIVERSITY - AN APPROACH TO SOFTWARE FAULT TOLERANCE [J].
AMMANN, PE ;
KNIGHT, JC .
IEEE TRANSACTIONS ON COMPUTERS, 1988, 37 (04) :418-425
[2]  
BISHOP PG, 1993, FTSC, V23, P98
[3]   Proportional sampling strategy: Guidelines for software testing practitioners [J].
Chan, FT ;
Chen, TY ;
Mak, IK ;
Yu, YT .
INFORMATION AND SOFTWARE TECHNOLOGY, 1996, 38 (12) :775-782
[4]   On the statistical properties of testing effectiveness measures [J].
Chen, Tsong Yueh ;
Kuo, Fei-Ching ;
Merkel, Robert .
JOURNAL OF SYSTEMS AND SOFTWARE, 2006, 79 (05) :591-601
[5]  
Chen TY, 2004, LECT NOTES COMPUT SC, V3321, P320
[6]   Mirror adaptive random testing [J].
Chen, TY ;
Kuo, FC ;
Merkel, RG ;
Ng, SP .
INFORMATION AND SOFTWARE TECHNOLOGY, 2004, 46 (15) :1001-1010
[7]   On the expected number of failures detected by subdomain testing and random testing [J].
Chen, TY ;
Yu, YT .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (02) :109-119
[8]  
CIUPA I, 2006, P 1 INT WORKSH RAND, P55
[9]  
DICKINSON W, 2001, ESEC FSE, V9, P246
[10]   AN EVALUATION OF RANDOM TESTING [J].
DURAN, JW ;
NTAFOS, SC .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1984, 10 (04) :438-444