Multiple additive regression trees with application in epidemiology

被引:614
作者
Friedman, JH [1 ]
Meulman, JJ
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Leiden Univ, Data Theory Grp, Leiden, Netherlands
关键词
predictive learning; regression trees; boosting; data mining; MART; cervical cancer;
D O I
10.1002/sim.1501
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Predicting future outcomes based on knowledge obtained from past observational data is a common application in a wide variety of areas of scientific research. In the present paper, prediction will be focused on various grades of cervical preneoplasia and neoplasia. Statistical tools used for prediction should of course possess predictive accuracy, and preferably meet secondary requirements such as speed, ease of use, and interpretability of the resulting predictive model. A new automated procedure based on an extension (called 'boosting') of regression and classification tree (CART) models is described. The resulting tool is a fast 'off-the-shelf procedure for classification and regression that is competitive in accuracy with more customized approaches, while being fairly automatic to use (little tuning), and highly robust especially when applied to less than clean data. Additional tools are presented for interpreting and visualizing the results of such multiple additive regression tree (MART) models. Copyright (C) 2003 John Wiley Sons, Ltd.
引用
收藏
页码:1365 / 1381
页数:17
相关论文
共 7 条
[1]  
BOON ME, 1990, ACTA CYTOL, V35, P57
[2]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[3]  
Freund Y., 1996, Machine Learning. Proceedings of the Thirteenth International Conference (ICML '96), P148
[4]   Additive logistic regression: A statistical view of boosting - Rejoinder [J].
Friedman, J ;
Hastie, T ;
Tibshirani, R .
ANNALS OF STATISTICS, 2000, 28 (02) :400-407
[5]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[6]  
MEULMAN JJ, 1992, ANAL QUANT CYTOL, V14, P60
[7]  
Steinberg D., 1997, CART CLASSIFICATION