Analysis and minimization of overtraining effect in rule-based classifiers for computer-aided diagnosis

被引:32
作者
Li, Q [1 ]
Doi, K [1 ]
机构
[1] Univ Chicago, Dept Radiol, Chicago, IL 60637 USA
关键词
D O I
10.1118/1.1999126
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Computer-aided diagnostic (CAD) schemes have been developed to assist radiologists detect various lesions in medical images. In CAD schemes, classifiers play a key role in achieving a high lesion detection rate and a low false-positive rate. Although many popular classifiers such as linear discriminant analysis and artificial neural networks have been employed in CAD schemes for reduction of false positives, a rule-based classifier has probably been the simplest and most frequently used one since the early days of development of various CAD schemes. However, with existing rule-based classifiers, there are major disadvantages that significantly reduce their practicality and credibility. The disadvantages include manual design, poor reproducibility, poor evaluation methods such as resubstitution, and a large overtraining effect. An automated rule-based classifier with a minimized overtraining effect can overcome or significantly reduce the extent of the above-mentioned disadvantages. In this study, we developed an "optimal" method for the selection of cutoff thresholds and a fully automated rule-based classifier. Experimental results performed with Monte Carlo simulation and a real lung nodule CT data set demonstrated that the automated threshold selection method can completely eliminate overtraining effect in the procedure of cutoff threshold selection, and thus can minimize overall overtraining effect in the constructed rule-based classifier. We believe that this threshold selection method is very useful in the construction of automated rule-based classifiers with minimized overtraining effect. (c) 2006 American Association of Physicists in Medicine.
引用
收藏
页码:320 / 328
页数:9
相关论文
共 29 条
[1]   Computerized detection of pulmonary nodules on CT scans [J].
Armato, SG ;
Giger, ML ;
Moran, CJ ;
Blackburn, JT ;
Doi, K ;
MacMahon, H .
RADIOGRAPHICS, 1999, 19 (05) :1303-1311
[2]  
BLOM G, 1958, STAT ESTIMATES TRANS, P1
[3]   Classifier design for computer-aided diagnosis: Effects of finite sample size on the mean performance of classical and neural network classifiers [J].
Chan, HP ;
Sahiner, B ;
Wagner, RF ;
Petrick, N .
MEDICAL PHYSICS, 1999, 26 (12) :2654-2668
[4]   Knowledge-based computer-aided detection of masses on digitized mammograms: A preliminary assessment [J].
Chang, YH ;
Hardesty, LA ;
Hakim, CM ;
Chang, TS ;
Zheng, B ;
Good, WF ;
Gur, D .
MEDICAL PHYSICS, 2001, 28 (04) :455-461
[5]   Identification of clustered microcalcifications on digitized mammograms using morphology and topography-based computer-aided detection schemes - A preliminary experiment [J].
Chang, YH ;
Zheng, B ;
Good, WF ;
Gur, D .
INVESTIGATIVE RADIOLOGY, 1998, 33 (10) :746-751
[6]  
DOI K, 2000, BREAST CANC, V4, P228
[7]   EFFECTS OF SAMPLE-SIZE IN CLASSIFIER DESIGN [J].
FUKUNAGA, K ;
HAYES, RR .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (08) :873-885
[8]   COMPUTERIZED DETECTION OF PULMONARY NODULES IN COMPUTED-TOMOGRAPHY IMAGES [J].
GIGER, ML ;
BAE, KT ;
MACMAHON, H .
INVESTIGATIVE RADIOLOGY, 1994, 29 (04) :459-465
[9]   Lung nodule detection on thoracic computed tomography images: Preliminary evaluation of a computer-aided diagnosis system [J].
Gurcan, MN ;
Sahiner, B ;
Petrick, N ;
Chan, HP ;
Kazerooni, EA ;
Cascade, PN ;
Hadjiiski, L .
MEDICAL PHYSICS, 2002, 29 (11) :2552-2558
[10]   Acute myocardial infarction detected in the 12-lead ECG by artificial neural networks [J].
Heden, B ;
Ohlin, H ;
Rittner, R ;
Edenbrandt, L .
CIRCULATION, 1997, 96 (06) :1798-1802