Performance of reclassification statistics in comparing risk prediction models

被引:98
作者
Cook, Nancy R. [1 ]
Paynter, Nina P. [1 ]
机构
[1] Brigham & Womens Hosp, Div Prevent Med, Boston, MA 02115 USA
关键词
Calibration; Discrimination; Model accuracy; Prediction; Reclassification; PENCINA; M.J; ET-AL; OF-FIT TEST; LOGISTIC-REGRESSION; ROC CURVE; CARDIOVASCULAR RISK; AREA; MARKER; ABILITY; IMPROVEMENT; R-2;
D O I
10.1002/bimj.201000078
中图分类号
Q [生物科学];
学科分类号
090105 [作物生产系统与生态工程];
摘要
Concerns have been raised about the use of traditional measures of model fit in evaluating risk prediction models for clinical use, and reclassification tables have been suggested as an alternative means of assessing the clinical utility of a model. Several measures based on the table have been proposed, including the reclassification calibration (RC) statistic, the net reclassification improvement (NRI), and the integrated discrimination improvement (IDI), but the performance of these in practical settings has not been fully examined. We used simulations to estimate the type I error and power for these statistics in a number of scenarios, as well as the impact of the number and type of categories, when adding a new marker to an established or reference model. The type I error was found to be reasonable in most settings, and power was highest for the IDI, which was similar to the test of association. The relative power of the RC statistic, a test of calibration, and the NRI, a test of discrimination, varied depending on the model assumptions. These tools provide unique but complementary information.
引用
收藏
页码:237 / 258
页数:22
相关论文
共 43 条
[1]
Ash A, 1999, STAT MED, V18, P375, DOI 10.1002/(SICI)1097-0258(19990228)18:4<375::AID-SIM20>3.3.CO
[2]
2-A
[3]
Using relative utility curves to evaluate risk prediction [J].
Baker, Stuart G. ;
Cook, Nancy R. ;
Vickers, Andrew ;
Kramer, Barnett S. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 :729-748
[4]
Executive summary of the Third Report of the National Cholesterol Education Program (NCEP) expert panel on detection, evaluation, and treatment of high blood cholesterol in adults (Adult Treatment Panel III) [J].
Cleeman, JI ;
Grundy, SM ;
Becker, D ;
Clark, LT ;
Cooper, RS ;
Denke, MA ;
Howard, WJ ;
Hunninghake, DB ;
Illingworth, DR ;
Luepker, RV ;
McBride, P ;
McKenney, JM ;
Pasternak, RC ;
Stone, NJ ;
Van Horn, L ;
Brewer, HB ;
Ernst, ND ;
Gordon, D ;
Levy, D ;
Rifkind, B ;
Rossouw, JE ;
Savage, P ;
Haffner, SM ;
Orloff, DG ;
Proschan, MA ;
Schwartz, JS ;
Sempos, CT ;
Shero, ST ;
Murray, EZ .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2001, 285 (19) :2486-2497
[5]
Cook NR, 2008, STAT MED, V27, P191, DOI 10.1002/sim.2987
[6]
Use and misuse of the receiver operating characteristic curve in risk prediction [J].
Cook, Nancy R. .
CIRCULATION, 2007, 115 (07) :928-935
[7]
The effect of including C-reactive protein in cardiovascular risk prediction models for women [J].
Cook, Nancy R. ;
Buring, Julie E. ;
Ridker, Paul M. .
ANNALS OF INTERNAL MEDICINE, 2006, 145 (01) :21-29
[8]
Comment: Measures to Summarize and Compare the Predictive Capacity of Markers [J].
Cook, Nancy R. .
INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2010, 6 (01)
[9]
Advances in Measuring the Effect of Individual Predictors of Cardiovascular Risk: The Role of Reclassification Measures [J].
Cook, Nancy R. ;
Ridker, Paul M. .
ANNALS OF INTERNAL MEDICINE, 2009, 150 (11) :795-W143
[10]
D'Agostino R.B., 1997, Proceedings of the Section on Biometrics, P253