Survival ensembles

被引:561
作者
Hothorn, Torsten
Buehlmann, Peter
Dudoit, Sandrine
Molinaro, Annette
Van der Laan, Mark J.
机构
[1] Univ Erlangen Nurnberg, Inst Med Informat Biometrie & Epidemiol, D-91054 Erlangen, Germany
[2] ETH, Seminar Stat, CH-8032 Zurich, Switzerland
[3] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[4] Yale Univ, Sch Med, Div Biostat Epidemiol & Publ Hlth, New Haven, CT 06520 USA
关键词
censoring; cross-validation; ensemble methods; IPC weights; loss function; prediction; prognostic factors; survival analysis;
D O I
10.1093/biostatistics/kxj011
中图分类号
Q [生物科学];
学科分类号
07 [理学]; 0710 [生物学]; 09 [农学];
摘要
We propose a unified and flexible framework for ensemble learning in the presence of censoring. For right-censored data, we introduce a random forest algorithm and a generic gradient boosting algorithm for the construction of prognostic and diagnostic models. The methodology is utilized for predicting the survival time of patients suffering from acute myeloid leukemia based on clinical and genetic covariates. Furthermore, we compare the diagnostic capabilities of the proposed censored data random forest and boosting methods, applied to the recurrence-free survival time of node-positive breast cancer patients, with previously published findings.
引用
收藏
页码:355 / 373
页数:19
相关论文
共 56 条
[1]
Altman DG, 2000, STAT MED, V19, P453, DOI 10.1002/(SICI)1097-0258(20000229)19:4<453::AID-SIM350>3.3.CO
[2]
2-X
[3]
Shape quantization and recognition with randomized trees [J].
Amit, Y ;
Geman, D .
NEURAL COMPUTATION, 1997, 9 (07) :1545-1588
[4]
[Anonymous], 1997, INTRO RECURSIVE PART, DOI DOI 10.3390/SU9081457
[5]
Semi-supervised methods to predict patient survival from gene expression data [J].
Bair, E ;
Tibshirani, R .
PLOS BIOLOGY, 2004, 2 (04) :511-522
[6]
Predictive influence in the accelerated failure time model [J].
Bedrick, EJ ;
Exuzides, A ;
Johnson, WO ;
Thurmond, MC .
BIOSTATISTICS, 2002, 3 (03) :331-346
[7]
BENNER A, 2002, P COMP STAT COMPSTAT
[8]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]
Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215