Classification and regression tree analysis in public health: Methodological review and comparison with logistic regression

被引:634
作者
Lemon, SC
Roy, J
Clark, MA
Friedmann, PD
Rakowski, W
机构
[1] Brown Univ, Sch Med, Providence, RI 02912 USA
[2] Rhode Isl Hosp, Providence, RI USA
关键词
D O I
10.1207/S15324796ABM2603_02
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Background: Audience segmentation strategies are of increasing interest to public health professionals who wish to identify easily defined, mutually exclusive population subgroups whose members share similar characteristics that help determine participation in a health-related behavior as a basis for targeted interventions. Classification and regression tree (C&RT) analysis is a nonparametric decision tree methodology that has the ability to efficiently segment populations into meaningful subgroups. However it is not commonly used in public health. Purpose: This study provides a methodological overview of C&RT analysis for persons unfamiliar with the procedure. Methods and Results: An example of a C&RT analysis is provided and interpretation of results is discussed. Results are validated with those obtained from a logistic regression model that was created to replicate the C&RT findings. Results obtained from the example C&RT analysis are also compared to those obtained from a common approach to logistic regression, the stepwise selection procedure. Issues to consider when deciding whether to use C&RT are discussed, and situations in which C&RT may and may not be beneficial are described. Conclusions: C&RT is a promising research tool for the identification of at-risk populations in public health research and outreach.
引用
收藏
页码:172 / 181
页数:10
相关论文
共 83 条
  • [1] *AM COLL PHYS TASK, 1994, GUID AD IMM
  • [2] Predictive model for serious bacterial infections among infants younger than 3 months of age
    Bachur, RG
    Harper, MB
    [J]. PEDIATRICS, 2001, 108 (02) : 311 - 316
  • [3] Barriga KJ, 1996, DIABETES RES CLIN PR, V34, pS17, DOI 10.1016/S0168-8227(96)90004-2
  • [4] SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation
    Blewitt, Marnie E.
    Gendrel, Anne-Valerie
    Pang, Zhenyi
    Sparrow, Duncan B.
    Whitelaw, Nadia
    Craig, Jeffrey M.
    Apedaile, Anwyn
    Hilton, Douglas J.
    Dunwoodie, Sally L.
    Brockdorff, Neil
    Kay, Graham F.
    Whitelaw, Emma
    [J]. NATURE GENETICS, 2008, 40 (05) : 663 - 669
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Breiman L, 1998, ANN STAT, V26, P801
  • [7] Buntine W., 1992, Statistics and Computing, V2, P63, DOI 10.1007/BF01889584
  • [8] Classification tree analysis: a statistical tool to investigate risk factor interactions with an example for colon cancer (United States)
    Camp, NJ
    Slattery, ML
    [J]. CANCER CAUSES & CONTROL, 2002, 13 (09) : 813 - 823
  • [9] Obesity and 33-year follow-up for coronary heart disease and cancer mortality
    Carmelli, D
    Zhang, HP
    Swan, GE
    [J]. EPIDEMIOLOGY, 1997, 8 (04) : 378 - 383
  • [10] CDC, 2001, Morbidity and Mortality Weekly Report, V50, P1