ROC analysis with multiple classes and multiple tests: methodology and its application in microarray studies

被引:108
作者
Li, Jialiang [2 ]
Fine, Jason P. [1 ]
机构
[1] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
[2] Natl Univ Singapore, Dept Stat & Appl Probabil, Singapore 117546, Singapore
关键词
class prevalence; diagnostic accuracy; maximum likelihood estimation; multicategory classification; multinomial logistic regression;
D O I
10.1093/biostatistics/kxm050
中图分类号
Q [生物科学];
学科分类号
07 [理学]; 0710 [生物学]; 09 [农学];
摘要
The accuracy of a single diagnostic test for binary outcome can be summarized by the area under the receiver operating characteristic (ROC) curve. Volume under the surface and hypervolume under the manifold have been proposed as extensions for multiple class diagnosis (Scurfield, 1996, 1998). However, the lack of simple inferential procedures for such measures has limited their practical utility. Part of the difficulty is that calculating such quantities may not be straightforward, even with a single test. The decision rule used to generate the ROC surface requires class probability assessments, which are not provided by the tests. We develop a method based on estimating the probabilities via some procedure, for example, multinomial logistic regression. Bootstrap inferences are proposed to account for variability in estimating the probabilities and perform well in simulations. The ROC measures are compared to the correct classification rate, which depends heavily on class prevalences. An example of tumor classification with microarray data demonstrates that this property may lead to substantially different analyses. The ROC-based analysis yields notable decreases in model complexity over previous analyses.
引用
收藏
页码:566 / 576
页数:11
相关论文
共 20 条
[1]
Brieman L., 1984, WADSWORTH INC
[2]
CHEN Y, 1997, BIOMED OPTICS, V2, P364
[3]
Comparing three-class diagnostic tests by three-way ROC analysis [J].
Dreiseitl, S ;
Ohno-Machado, L ;
Binder, M .
MEDICAL DECISION MAKING, 2000, 20 (03) :323-331
[4]
Comparison of discrimination methods for the classification of tumors using gene expression data [J].
Dudoit, S ;
Fridlyand, J ;
Speed, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :77-87
[5]
Efron B., 1993, INTRO BOOTSTRAP MONO, DOI DOI 10.1201/9780429246593
[6]
The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[7]
Fix E., 1951, JOSEPH
[8]
A CLASS OF STATISTICS WITH ASYMPTOTICALLY NORMAL DISTRIBUTION [J].
HOEFFDING, W .
ANNALS OF MATHEMATICAL STATISTICS, 1948, 19 (03) :293-325
[9]
Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks [J].
Khan, J ;
Wei, JS ;
Ringnér, M ;
Saal, LH ;
Ladanyi, M ;
Westermann, F ;
Berthold, F ;
Schwab, M ;
Antonescu, CR ;
Peterson, C ;
Meltzer, PS .
NATURE MEDICINE, 2001, 7 (06) :673-679
[10]
Multicategory support vector machines: Theory and application to the classification of microarray data and satellite radiance data [J].
Lee, YK ;
Lin, Y ;
Wahba, G .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (465) :67-81