Statistical evaluation of prognostic versus diagnostic models: Beyond the ROC curve

被引:545
作者
Cook, Nancy R. [1 ,2 ]
机构
[1] Harvard Univ, Sch Med, Brigham & Womens Hosp, Dept Med,Div Prevent Med, Boston, MA 02115 USA
[2] Harvard Univ, Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
关键词
D O I
10.1373/clinchem.2007.096529
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
BACKGROUND: - Diagnostic and prognostic or predictive models serve different purposes. Whereas diagnostic models are usually used for classification, prognostic models incorporate the dimension of time, adding a stochastic element. CONTENT: The ROC curve is typically used to evaluate clinical utility for both diagnostic and prognostic models. This curve assesses how well a test or model discriminates, or separates individuals into two classes, such as diseased and nondiseased. A strong risk predictor, such as lipids for cardiovascular disease, may have limited impact on the area under the curve, called the AUC or c-statistic, even if it alters predicted values. Calibration, measuring whether predicted probabilities agree with observed proportions, is another component of model accuracy important to assess. Reclassification can directly compare the clinical impact of two models by determining how many individuals would be reclassified into clinically relevant risk strata. For example, adding high-sensitivity C-reactive protein and family history to prediction models for cardiovascular disease using traditional risk factors moves approximately 30% of those at intermediate risk levels, such as 5%-10% or 10%-20% 10-year risk, into higher or lower risk categories, despite little change in the c-statistic. A calibration statistic can asses how well the new predicted values agree with those observed in the cross -classified data. SUMMARY: Although it is useful for classification, evaluation of prognostic models should not rely solely on the ROC curve, but should assess both discrimination and calibration. Risk reclassification can aid in comparing the clinical impact of two models on risk for the individual, as well as the population. (c) 2007 American Association for Clinical Chemistry.
引用
收藏
页码:17 / 23
页数:7
相关论文
共 25 条
  • [1] The central role of receiver operating characteristic (ROC) curves in evaluating tests for the early detection of cancer
    Baker, SG
    [J]. JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2003, 95 (07) : 511 - 515
  • [2] BIGGER JT, 1978, AM J CARDIOL, V42, P202, DOI 10.1016/0002-9149(78)90901-3
  • [3] Executive summary of the Third Report of the National Cholesterol Education Program (NCEP) expert panel on detection, evaluation, and treatment of high blood cholesterol in adults (Adult Treatment Panel III)
    Cleeman, JI
    Grundy, SM
    Becker, D
    Clark, LT
    Cooper, RS
    Denke, MA
    Howard, WJ
    Hunninghake, DB
    Illingworth, DR
    Luepker, RV
    McBride, P
    McKenney, JM
    Pasternak, RC
    Stone, NJ
    Van Horn, L
    Brewer, HB
    Ernst, ND
    Gordon, D
    Levy, D
    Rifkind, B
    Rossouw, JE
    Savage, P
    Haffner, SM
    Orloff, DG
    Proschan, MA
    Schwartz, JS
    Sempos, CT
    Shero, ST
    Murray, EZ
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2001, 285 (19): : 2486 - 2497
  • [4] Time and chance: the stochastic nature of disease causation
    Coggon, DIW
    Martyn, CN
    [J]. LANCET, 2005, 365 (9468) : 1434 - 1437
  • [5] Use and misuse of the receiver operating characteristic curve in risk prediction
    Cook, Nancy R.
    [J]. CIRCULATION, 2007, 115 (07) : 928 - 935
  • [6] The effect of including C-reactive protein in cardiovascular risk prediction models for women
    Cook, Nancy R.
    Buring, Julie E.
    Ridker, Paul M.
    [J]. ANNALS OF INTERNAL MEDICINE, 2006, 145 (01) : 21 - 29
  • [7] COOK NR, 2007, STAT MED
  • [8] Improving coronary heart disease risk assessment in asymptomatic people - Role of traditional risk factors and noninvasive cardiovascular tests
    Greenland, P
    Smith, SC
    Grundy, SM
    [J]. CIRCULATION, 2001, 104 (15) : 1863 - 1867
  • [9] THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE
    HANLEY, JA
    MCNEIL, BJ
    [J]. RADIOLOGY, 1982, 143 (01) : 29 - 36
  • [10] Harrell FE., 2001, REGRESSION MODELING, DOI DOI 10.1007/978-3-319-19425-7