External validity of predictive models: A comparison of logistic regression, classification trees, and neural networks

被引:104
作者
Terrin, N
Schmid, CH
Griffith, JL
D'Agostino, RB
Selker, HP
机构
[1] Tufts Univ, New England Med Ctr, Dept Med, Div Clin Care Res, Boston, MA 02111 USA
[2] Tufts Univ, Sch Med, Boston, MA 02111 USA
[3] Boston Univ, Dept Math & Stat, Boston, MA 02215 USA
关键词
transportability; reproducibility; validation; nonlinearity; prognostic model; diagnostic model;
D O I
10.1016/S0895-4356(03)00120-3
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background and Objective: The utility of predictive models depends on their external validity, that is, their ability to maintain accuracy when applied to patients and settings different from those on which the models were developed. We report a simulation study that compared the external validity of standard logistic regression (LR1), logistic regression with piecewise-linear and quadratic terms (LR2), classification trees, and neural networks (NNETs). Methods: We developed predictive models on data simulated from a specified population and on data from perturbed forms of the population not representative of the original distribution. All models were tested on new data generated from the population. Results: The performance of LR2 was superior to that of the other model types when the models were developed on data sampled from the population (mean receiver operating characteristic [ROC] areas 0.769, 0.741, 0.724, and 0.682, for LR2, LR1, NNETs, and trees, respectively) and when they were developed on nonrepresentative data (mean ROC areas 0.734, 0.713, 0.703, and 0.667). However, when the models developed using nonrepresentative data were compared with models developed from data sampled from the population, LR2 had the greatest loss in performance. Conclusion: Our results highlight the necessity of external validation to test the transportability of predictive models. (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:721 / 729
页数:9
相关论文
共 46 条
[1]   Why models predicting bacteremia in general medical patients do not work [J].
Allison, JJ ;
Centor, RM .
JOURNAL OF GENERAL INTERNAL MEDICINE, 1996, 11 (02) :132-134
[2]  
Altman DG, 2000, STAT MED, V19, P453, DOI 10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.3.CO
[3]  
2-X
[4]   Validation procedures in radiologic diagnostic models -: Neural network and logistic regression [J].
Arana, E ;
Delicado, P ;
Martí-Bonmatí, L .
INVESTIGATIVE RADIOLOGY, 1999, 34 (10) :636-642
[5]   WHY PREDICTIVE INDEXES PERFORM LESS WELL IN VALIDATION STUDIES - IS IT MAGIC OR METHODS [J].
CHARLSON, ME ;
ALES, KL ;
SIMON, R ;
MACKENZIE, CR .
ARCHIVES OF INTERNAL MEDICINE, 1987, 147 (12) :2155-2161
[6]  
Clark L.A., 1993, Statistical Models in S, P377
[7]  
COPAS JB, 1988, J R STAT SOC B, V50, P225
[8]  
COPAS JB, 1983, J R STAT SOC C-APPL, V32, P25
[9]   Validation of the Framingham Coronary Heart Disease prediction scores - Results of a multiple ethnic groups investigation [J].
D'Agostino, RB ;
Grundy, S ;
Sullivan, LM ;
Wilson, P .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2001, 286 (02) :180-187
[10]   PREDICTING CARDIAC COMPLICATIONS IN PATIENTS UNDERGOING NONCARDIAC SURGERY [J].
DETSKY, AS ;
ABRAMS, HB ;
MCLAUGHLIN, JR ;
DRUCKER, DJ ;
SASSON, Z ;
JOHNSTON, N ;
SCOTT, JG ;
FORBATH, N ;
HILLIARD, JR .
JOURNAL OF GENERAL INTERNAL MEDICINE, 1986, 1 (04) :211-219