Three validation metrics for automated probabilistic image segmentation of brain tumours

被引:74
作者
Zou, KH
Wells, WM
Kikinis, R
Warfield, SK
机构
[1] Harvard Univ, Sch Med, Dept Hlth Care Policy, Boston, MA 02115 USA
[2] Brigham & Womens Hosp, Dept Radiol, Boston, MA 02115 USA
[3] MIT, Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
sensitivity; specificity; receiver operating characteristic (ROC) curve; dice similarity coefficient (DSC); mutual information; expectation maximization (EM) algorithm;
D O I
10.1002/sim.1723
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The validity of brain tumour segmentation is an important issue in image processing because it has a direct impact on surgical planning-We examined the segmentation accuracy based on three two-sample validation metrics against the estimated composite latent gold standard, which was derived from several experts' manual segmentations by an EM algorithm. The distribution functions of the tumour and control pixel data were parametrically assumed to be a mixture of two beta distributions with different shape parameters. We estimated the corresponding receiver operating characteristic curve, Dice similarity coefficient, and mutual information, over all possible decision thresholds. Based on each validation metric, an optimal threshold was then computed via maximization. We illustrated these methods on MR imaging data from nine brain tumour cases of three different tumour types, each consisting of a large number of pixels. The automated segmentation yielded satisfactory accuracy with varied optimal thresholds. The performances of these validation metrics were also investigated via Monte Carlo simulation. Extensions of incorporating spatial correlation structures using a Markov random field model were considered. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:1259 / 1282
页数:24
相关论文
共 58 条
[1]  
Brown D, 1993, MODELS BIOL MATH STA
[2]   ADVANCES IN STATISTICAL METHODOLOGY FOR THE EVALUATION OF DIAGNOSTIC AND LABORATORY TESTS [J].
CAMPBELL, G .
STATISTICS IN MEDICINE, 1994, 13 (5-7) :499-508
[3]   Detection of prostate cancer by integration of line-scan diffusion, T2-mapping and T2-weighted magnetic resonance imaging; a multichannel statistical classifier [J].
Chan, I ;
Wells, W ;
Mulkern, RV ;
Haker, S ;
Zhang, JQ ;
Zou, KH ;
Maier, SE ;
Tempany, CMC .
MEDICAL PHYSICS, 2003, 30 (09) :2390-2398
[4]   3-DIMENSIONAL SEGMENTATION OF MR IMAGES OF THE HEAD USING PROBABILITY AND CONNECTIVITY [J].
CLINE, HE ;
LORENSEN, WE ;
KIKINIS, R ;
JOLESZ, F .
JOURNAL OF COMPUTER ASSISTED TOMOGRAPHY, 1990, 14 (06) :1037-1045
[5]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[6]  
DEBONET JS, 1998, P SPIE, V3370
[7]  
DEBONNET JS, 1998, P COMPUTER VISION PA
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   MEASURES OF THE AMOUNT OF ECOLOGIC ASSOCIATION BETWEEN SPECIES [J].
DICE, LR .
ECOLOGY, 1945, 26 (03) :297-302
[10]   AN EXPONENTIAL MODEL USED FOR OPTIMAL THRESHOLD SELECTION ON ROC CURVES [J].
ENGLAND, WL .
MEDICAL DECISION MAKING, 1988, 8 (02) :120-131