Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size

被引:102
作者
Sahiner, B [1 ]
Chan, HP [1 ]
Petrick, N [1 ]
Wagner, RF [1 ]
Hadjiiski, L [1 ]
机构
[1] Univ Michigan, Dept Radiol, Ann Arbor, MI 48109 USA
关键词
feature selection; linear discriminant analysis; effects of finite sample size; computer-aided diagnosis;
D O I
10.1118/1.599017
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
In computer-aided diagnosis (CAD), a frequently used approach for distinguishing normal and abnormal cases is first to extract potentially useful features for the classification task. Effective features are then selected from this entire pool of available features. Finally, a classifier is designed using the selected features. In this study, we investigated the effect of finite sample size on classification accuracy when classifier design involves stepwise feature selection in linear discriminant analysis, which is the most commonly used feature selection algorithm for linear classifiers. The feature selection and the classifier coefficient estimation steps were considered to be cascading stages in the classifier design process. We compared the performance of the classifier when feature selection was performed on the design samples alone and on the entire set of available samples, which consisted of design and test samples. The area A(z) under the receiver operating characteristic curve was used as our performance measure. After linear classifier coefficient estimation using the design samples, we studied the hold-out and resubstitution performance estimates. The two classes were assumed to have multidimensional Gaussian distributions, with a large number of features available for feature selection. We investigated the dependence of feature selection performance on the covariance matrices and means for the two classes, and examined the effects of sample size, number of available features, and parameters of stepwise feature selection on classifier bias. Our results indicated that the resubstitution estimate was always optimistically biased, except in cases where the parameters of stepwise feature selection were chosen such that too few features were selected by the stepwise procedure. When feature selection was performed using only the design samples, the hold-out estimate was always pessimistically biased. When feature selection was performed using the entire finite sample space, the hold-out estimates could be pessimistically or optimistically biased, depending on the number of features available for selection, the number of available samples, and their statistical distribution. For our simulation conditions, these estimates were always pessimistically (conservatively) biased if the ratio of the total number of available samples per class to the number of available features was greater than five. (C) 2000 American Association of Physicists in Medicine. [S0094-2405(00)01607-2].
引用
收藏
页码:1509 / 1522
页数:14
相关论文
共 37 条
[1]  
[Anonymous], 1998, Applied regression analysis, DOI 10.1002/9781118625590
[2]  
[Anonymous], 1975, Discriminant Analysis
[3]   Computerized analysis of mammographic microcalcifications in morphological and texture feature spaces [J].
Chan, HP ;
Sahiner, B ;
Lam, KL ;
Petrick, N ;
Helvie, MA ;
Goodsitt, MM ;
Adler, DD .
MEDICAL PHYSICS, 1998, 25 (10) :2007-2019
[4]   COMPUTER-AIDED CLASSIFICATION OF MAMMOGRAPHIC MASSES AND NORMAL TISSUE - LINEAR DISCRIMINANT-ANALYSIS IN TEXTURE FEATURE SPACE [J].
CHAN, HP ;
WEI, DT ;
HELVIE, MA ;
SAHINER, B ;
ADLER, DD ;
GOODSITT, MM ;
PETRICK, N .
PHYSICS IN MEDICINE AND BIOLOGY, 1995, 40 (05) :857-876
[5]   Effects of sample size on classifier design: Quadratic and neural network classifiers [J].
Chan, HP ;
Sahiner, B ;
Wagner, RF ;
Petrick, N ;
Mossoba, J .
IMAGE PROCESSING - MEDICAL IMAGING 1997, PTS 1 AND 2, 1997, 3034 :1102-1113
[6]   Effects of sample size on classifier design for computer-aided diagnosis [J].
Chan, HP ;
Sahiner, B ;
Wagner, RF ;
Petrick, N .
MEDICAL IMAGING 1998: IMAGE PROCESSING, PTS 1 AND 2, 1998, 3338 :845-858
[7]   Classifier design for computer-aided diagnosis: Effects of finite sample size on the mean performance of classical and neural network classifiers [J].
Chan, HP ;
Sahiner, B ;
Wagner, RF ;
Petrick, N .
MEDICAL PHYSICS, 1999, 26 (12) :2654-2668
[8]   MAXIMUM-LIKELIHOOD ESTIMATION OF PARAMETERS OF SIGNAL-DETECTION THEORY AND DETERMINATION OF CONFIDENCE INTERVALS - RATING-METHOD DATA [J].
DORFMAN, DD ;
ALF, E .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1969, 6 (03) :487-&
[9]  
Efron B., 1982, SOC IND APPL MATH CB, V38, DOI [10.1137/1.9781611970319, DOI 10.1137/1.9781611970319]
[10]   MR image texture analysis applied to the diagnosis and tracking of Alzheimer's disease [J].
Freeborough, PA ;
Fox, NC .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 1998, 17 (03) :475-479