Tolerance to missing data using a likelihood ratio based classifier for computer-aided classification of breast cancer

被引:6
作者
Bilska-Wolak, AO
Floyd, CE
机构
[1] Duke Univ, Dept Biomed Engn, Durham, NC 27708 USA
[2] Duke Univ, Dept Radiol, Durham, NC 27708 USA
关键词
D O I
10.1088/0031-9155/49/18/003
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
While mammography is a highly sensitive method for detecting breast tumours, its ability to differentiate between malignant and benign lesions is low, which may result in as many as 70% of unnecessary biopsies. The purpose of this study was to develop a highly specific computer-aided diagnosis algorithm to improve classification of mammographic masses. A classifier based on the likelihood ratio was developed to accommodate cases with missing data. Data for development included 671 biopsy cases (245 malignant), with biopsy-proved outcome. Sixteen features based on the BI-RADS(TM) lexicon and patient history had been recorded for the cases, with 1.3+/-1.1 missing feature values per case. Classifier evaluation methods included receiver operating characteristic and leave-one-out bootstrap sampling. The classifier achieved 32% specificity at 100% sensitivity on the 671 cases with 16 features that had missing values. Utilizing just the seven features present for all cases resulted in decreased performance at 100% sensitivity with average 19% specificity. No cases and no feature data were omitted during classifier development, showing that it is more beneficial to utilize cases with missing values than to discard incomplete cases that cannot be handled by many algorithms. Classification of mammographic masses was commendable at high sensitivity levels, indicating that benign cases could be potentially spared from biopsy.
引用
收藏
页码:4219 / 4237
页数:19
相关论文
共 44 条
[1]  
[Anonymous], 1992, MULTIVARIATE DENSITY
[2]   Some unlikely properties of the likelihood ratio and its logarithm [J].
Barrett, HH ;
Abbey, CK ;
Clarkson, E .
IMAGE PERCEPTION: MEDICAL IMAGING 1998, 1998, 3340 :65-77
[3]  
BEALE EML, 1975, J ROY STAT SOC B MET, V37, P129
[4]   Development and evaluation of a case-based reasoning classifier for prediction of breast biopsy outcome with BI-RADS™ lexicon [J].
Bilska-Wolak, AO ;
Floyd, CE .
MEDICAL PHYSICS, 2002, 29 (09) :2090-2100
[5]  
BILSKAWOLAK AO, 2002, SPIE MED IMAGING 200, P661
[6]  
*BIRADS, 1998, AM COLL RAD BREAST I
[7]   Knowledge-based computer-aided detection of masses on digitized mammograms: A preliminary assessment [J].
Chang, YH ;
Hardesty, LA ;
Hakim, CM ;
Chang, TS ;
Zheng, B ;
Good, WF ;
Gur, D .
MEDICAL PHYSICS, 2001, 28 (04) :455-461
[8]  
DIXON JM, 1992, LANCET, V339, P128
[9]   Computer-aided diagnosis in radiology: potential and pitfalls [J].
Doi, K ;
MacMahon, H ;
Katsuragawa, S ;
Nishikawa, RM ;
Jiang, YL .
EUROPEAN JOURNAL OF RADIOLOGY, 1999, 31 (02) :97-109
[10]   Improvements on cross-validation: The .632+ bootstrap method [J].
Efron, B ;
Tibshirani, R .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) :548-560