Application of the mutual information criterion for feature selection in computer-aided diagnosis

被引:160
作者
Tourassi, GD [1 ]
Frederick, ED
Markey, MK
Floyd, CE
机构
[1] Duke Univ, Med Ctr, Dept Radiol, Durham, NC 27710 USA
[2] ChemCodes Inc, Durham, NC 27713 USA
[3] Duke Univ, Med Ctr, Dept Biomed Engn, Durham, NC 27710 USA
[4] Duke Univ, Med Ctr, Dept Radiol, Durham, NC 27710 USA
[5] Duke Univ, Dept Biomed Engn, Durham, NC 27710 USA
关键词
mutual information; feature selection; computer-assisted diagnosis; acute pulmonary embolism;
D O I
10.1118/1.1418724
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
The purpose of this study was to investigate an information theoretic approach to feature selection for computer-aided diagnosis (CAD). The approach is based on the mutual information (MI) concept. MI measures the general dependence of random variables without making any assumptions about the nature of their underlying relationships. Consequently, MI can potentially offer some advantages over feature selection techniques that focus only on the linear relationships of variables. This study was based on a database of statistical texture features extracted from perfusion lung scans. The ultimate goal was to select the optimal subset of features for the computer-aided diagnosis of acute pulmonary embolism (PE). Initially, the study addressed issues regarding the approximation of MI in a limited dataset as it is often the case in CAD applications. The MI selected features were compared to those features selected using stepwise linear discriminant analysis and genetic algorithms for the same PE database. Linear and nonlinear decision models were implemented to merge the selected features into a final diagnosis. Results showed that the MI is an effective feature selection criterion for nonlinear CAD models overcoming some of the well-known limitations and computational complexities of other popular feature selection techniques in the field. (C) 2001 American Association of Physicists in Medicine.
引用
收藏
页码:2394 / 2402
页数:9
相关论文
共 39 条
[1]  
[Anonymous], [No title captured]
[2]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[3]  
[Anonymous], 1994, Kendall's Advanced Theory of Statistics, Distribution theory
[4]  
[Anonymous], 1994, Modern applied statistics with S-Plus
[5]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[6]   Spatial registration of digital brain atlases based on fuzzy set theory [J].
Berks, G ;
Ghassemi, A ;
von Keyserlingk, DG .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2001, 25 (01) :1-10
[7]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[8]   Computerized analysis of mammographic microcalcifications in morphological and texture feature spaces [J].
Chan, HP ;
Sahiner, B ;
Lam, KL ;
Petrick, N ;
Helvie, MA ;
Goodsitt, MM ;
Adler, DD .
MEDICAL PHYSICS, 1998, 25 (10) :2007-2019
[9]   Classifier design for computer-aided diagnosis: Effects of finite sample size on the mean performance of classical and neural network classifiers [J].
Chan, HP ;
Sahiner, B ;
Wagner, RF ;
Petrick, N .
MEDICAL PHYSICS, 1999, 26 (12) :2654-2668
[10]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X