Internal validation of risk models in lung resection surgery: Bootstrap versus training-and-test sampling

被引:62
作者
Brunelli, Alessandro
Rocco, Gaetano
机构
[1] Umberto I Reg Hosp, Unit Thorac Surg, Ancona, Italy
[2] Sheffield Teaching Hosp, Div Thorac Surg, Sheffield, S Yorkshire, England
关键词
D O I
10.1016/j.jtcvs.2006.02.002
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective: The objective of the present analysis was to compare the performance of a lung resection mortality model developed by means of logistic regression and bootstrap analysis with that of multiple mortality models developed by using the traditional training-and-test method from the same dataset. Methods: Eleven mortality models ( 1 developed by means of logistic regression and bootstrap validation and the other 10 developed by means of the traditional training-and-test random splitting of the dataset) were generated by the data of unit A ( 571 patients submitted to major lung resection). The performances of each of the 11 mortality models were then evaluated by assessing the distribution of the respective c-statistics in 1000 bootstrap samples derived from unit B ( 224 patients). Results: The first model ( logistic regression and bootstrap analysis) had good discrimination among the 1000 bootstrap external samples (c-statistics > 0.7 in 80% of samples and > 0.8 in 38% of samples). Among the 10 training-and-test models, only one model had a similar performance, whereas the others had a poorer discrimination. Conclusions: The traditional training-and-test method for risk model building proved to be unreliable across multiple external populations and was generally inferior to bootstrap analysis for variable selection in regression analysis. Therefore the use of bootstrap analysis must be recommended for every future model-building process.
引用
收藏
页码:1243 / 1247
页数:5
相关论文
共 16 条
[1]   The European Thoracic Surgery Database project: modelling the risk of in-hospital death following lung resection [J].
Berrisford, R ;
Brunelli, A ;
Rocco, G ;
Treasure, T ;
Utley, M .
EUROPEAN JOURNAL OF CARDIO-THORACIC SURGERY, 2005, 28 (02) :306-311
[2]   Breaking down barriers: Helpful breakthrough statistical methods you need to understand better [J].
Blackstone, EH .
JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2001, 122 (03) :430-439
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   COMPUTER-INTENSIVE METHODS IN STATISTICS [J].
DIACONIS, P ;
EFRON, B .
SCIENTIFIC AMERICAN, 1983, 248 (05) :116-&
[5]   The bootstrap and modern statistics [J].
Efron, B .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (452) :1293-1296
[6]   A LEISURELY LOOK AT THE BOOTSTRAP, THE JACKKNIFE, AND CROSS-VALIDATION [J].
EFRON, B ;
GONG, G .
AMERICAN STATISTICIAN, 1983, 37 (01) :36-48
[7]  
Efron B., 1993, INTRO BOOTSTRAP, DOI 10.1007/978-1-4899-4541-9
[8]   Bootstrap resampling methods: Something for nothing? [J].
Grunkemeier, GL ;
Wu, YX .
ANNALS OF THORACIC SURGERY, 2004, 77 (04) :1142-1144
[9]   Receiver operating characteristic curve analysis of clinical risk models [J].
Grunkemeier, GL ;
Jin, RY .
ANNALS OF THORACIC SURGERY, 2001, 72 (02) :323-326
[10]  
Harrel F., 2001, Regression Modelling Strategies