Small-sample precision of ROC-related estimates

被引:238
作者
Hanczar, Blaise [2 ]
Hua, Jianping [1 ]
Sima, Chao [1 ]
Weinstein, John [3 ]
Bittner, Michael [1 ]
Dougherty, Edward R. [1 ,3 ,4 ]
机构
[1] Translat Genom Res Inst, Computat Biol Div, Phoenix, AZ USA
[2] Univ Paris 05, LIPADE, Paris, France
[3] Univ Texas MD Anderson Canc Ctr, Dept Bioinformat & Computat Biol, Houston, TX 77030 USA
[4] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX USA
基金
美国国家科学基金会;
关键词
GENE-EXPRESSION SIGNATURE; CLASSIFICATION; PERFORMANCE;
D O I
10.1093/bioinformatics/btq037
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The receiver operator characteristic (ROC) curves are commonly used in biomedical applications to judge the performance of a discriminant across varying decision thresholds. The estimated ROC curve depends on the true positive rate (TPR) and false positive rate (FPR), with the key metric being the area under the curve (AUC). With small samples these rates need to be estimated from the training data, so a natural question arises: How well do the estimates of the AUC, TPR and FPR compare with the true metrics? Results: Through a simulation study using data models and analysis of real microarray data, we show that (i) for small samples the root mean square differences of the estimated and true metrics are considerable; (ii) even for large samples, there is only weak correlation between the true and estimated metrics; and (iii) generally, there is weak regression of the true metric on the estimated metric. For classification rules, we consider linear discriminant analysis, linear support vector machine (SVM) and radial basis function SVM. For error estimation, we consider resubstitution, three kinds of crossvalidation and bootstrap. Using resampling, we show the unreliability of some published ROC results.
引用
收藏
页码:822 / 830
页数:9
相关论文
共 17 条
[1]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[2]  
BRAGA UMD, 2005, EURASIP BOOK SERIES
[3]   Exact performance of error estimators for discrete classifiers [J].
Braga-Neto, U ;
Dougherty, E .
PATTERN RECOGNITION, 2005, 38 (11) :1799-1814
[4]   Is cross-validation valid for small-sample microarray classification? [J].
Braga-Neto, UM ;
Dougherty, ER .
BIOINFORMATICS, 2004, 20 (03) :374-380
[5]   ADDITIVE ESTIMATORS FOR PROBABILITIES OF CORRECT CLASSIFICATION [J].
GLICK, N .
PATTERN RECOGNITION, 1978, 10 (03) :211-222
[6]  
Gyorfi L., 2013, A Probabilistic Theory of Pattern Recognition, V31
[7]   Decorrelation of the True and Estimated Classifier Errors in High-Dimensional Settings [J].
Hanczar, Blaise K ;
Hua, Jianping ;
Dougherty, Edward R. .
EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2007, (01)
[8]   A simple generalisation of the area under the ROC curve for multiple class classification problems [J].
Hand, DJ ;
Till, RJ .
MACHINE LEARNING, 2001, 45 (02) :171-186
[9]   Relation between permutation-test P values and classifier error estimates [J].
Hsing, T ;
Attoor, S ;
Dougherty, E .
MACHINE LEARNING, 2003, 52 (1-2) :11-30
[10]   ON A TEST OF WHETHER ONE OF 2 RANDOM VARIABLES IS STOCHASTICALLY LARGER THAN THE OTHER [J].
MANN, HB ;
WHITNEY, DR .
ANNALS OF MATHEMATICAL STATISTICS, 1947, 18 (01) :50-60