External validity of predictive models: A comparison of logistic regression, classification trees, and neural networks

被引：104

作者：

Terrin, N

Schmid, CH

Griffith, JL

D'Agostino, RB

Selker, HP

机构：

[1] Tufts Univ, New England Med Ctr, Dept Med, Div Clin Care Res, Boston, MA 02111 USA

[2] Tufts Univ, Sch Med, Boston, MA 02111 USA

[3] Boston Univ, Dept Math & Stat, Boston, MA 02215 USA

来源：

JOURNAL OF CLINICAL EPIDEMIOLOGY | 2003年 / 56卷 / 08期

关键词：

transportability; reproducibility; validation; nonlinearity; prognostic model; diagnostic model;

D O I：

10.1016/S0895-4356(03)00120-3

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background and Objective: The utility of predictive models depends on their external validity, that is, their ability to maintain accuracy when applied to patients and settings different from those on which the models were developed. We report a simulation study that compared the external validity of standard logistic regression (LR1), logistic regression with piecewise-linear and quadratic terms (LR2), classification trees, and neural networks (NNETs). Methods: We developed predictive models on data simulated from a specified population and on data from perturbed forms of the population not representative of the original distribution. All models were tested on new data generated from the population. Results: The performance of LR2 was superior to that of the other model types when the models were developed on data sampled from the population (mean receiver operating characteristic [ROC] areas 0.769, 0.741, 0.724, and 0.682, for LR2, LR1, NNETs, and trees, respectively) and when they were developed on nonrepresentative data (mean ROC areas 0.734, 0.713, 0.703, and 0.667). However, when the models developed using nonrepresentative data were compared with models developed from data sampled from the population, LR2 had the greatest loss in performance. Conclusion: Our results highlight the necessity of external validation to test the transportability of predictive models. (C) 2003 Elsevier Inc. All rights reserved.

引用

页码：721 / 729

页数：9

共 46 条

[1] Why models predicting bacteremia in general medical patients do not work [J].