Tolerance to missing data using a likelihood ratio based classifier for computer-aided classification of breast cancer

被引:6
作者
Bilska-Wolak, AO
Floyd, CE
机构
[1] Duke Univ, Dept Biomed Engn, Durham, NC 27708 USA
[2] Duke Univ, Dept Radiol, Durham, NC 27708 USA
关键词
D O I
10.1088/0031-9155/49/18/003
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
While mammography is a highly sensitive method for detecting breast tumours, its ability to differentiate between malignant and benign lesions is low, which may result in as many as 70% of unnecessary biopsies. The purpose of this study was to develop a highly specific computer-aided diagnosis algorithm to improve classification of mammographic masses. A classifier based on the likelihood ratio was developed to accommodate cases with missing data. Data for development included 671 biopsy cases (245 malignant), with biopsy-proved outcome. Sixteen features based on the BI-RADS(TM) lexicon and patient history had been recorded for the cases, with 1.3+/-1.1 missing feature values per case. Classifier evaluation methods included receiver operating characteristic and leave-one-out bootstrap sampling. The classifier achieved 32% specificity at 100% sensitivity on the 671 cases with 16 features that had missing values. Utilizing just the seven features present for all cases resulted in decreased performance at 100% sensitivity with average 19% specificity. No cases and no feature data were omitted during classifier development, showing that it is more beneficial to utilize cases with missing values than to discard incomplete cases that cannot be handled by many algorithms. Classification of mammographic masses was commendable at high sensitivity levels, indicating that benign cases could be potentially spared from biopsy.
引用
收藏
页码:4219 / 4237
页数:19
相关论文
共 44 条
[31]   The pattern of breast cancer screening utilization and its consequences [J].
Michaelson, J ;
Satija, S ;
Moore, R ;
Weber, G ;
Halpern, E ;
Garland, A ;
Puri, D ;
Kopans, DB .
CANCER, 2002, 94 (01) :37-43
[32]  
MILLER AB, 1992, CAN MED ASSOC J, V147, P1459
[33]  
Morimoto T, 2000, ANTICANCER RES, V20, P3689
[34]   OPTIMAL AND DATA-BASED HISTOGRAMS [J].
SCOTT, DW .
BIOMETRIKA, 1979, 66 (03) :605-610
[35]   Observer variation and the performance accuracy gained by averaging ratings of abnormality [J].
Swensson, RG ;
King, JL ;
Good, WF ;
Gur, D .
MEDICAL PHYSICS, 2000, 27 (08) :1920-1933
[36]  
Tabár L, 2001, CANCER-AM CANCER SOC, V91, P1724, DOI 10.1002/1097-0142(20010501)91:9<1724::AID-CNCR1190>3.0.CO
[37]  
2-V
[38]   A neural network approach to breast cancer diagnosis as a constraint satisfaction problem [J].
Tourassi, GD ;
Markey, MK ;
Lo, JY ;
Floyd, CE .
MEDICAL PHYSICS, 2001, 28 (05) :804-811
[39]  
*US CENS BUR POP D, 2001, POP AG SEX RAC HISP
[40]  
Van Trees H.L., 2013, DETECTION ESTIMATION