Using classification trees to assess low birth weight outcomes

被引:27
作者
Kitsantas, Panagiota
Hollander, Myles
Li, Lei
机构
[1] George Mason Univ, Coll Hlth & Human Serv, Dept Hlth Policy & Adm, Fairfax, VA 22030 USA
[2] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
[3] Univ So Calif, Dept Biol & Math, Los Angeles, CA 90089 USA
关键词
low birth weight; classification trees; logistic regression; geographical regions;
D O I
10.1016/j.artmed.2006.03.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: Low birth weight (LBW) is a major public health problem. Compared to normal weight infants, LBW is positively associated with infant mortality and negatively associated with normative childhood cognitive and physical development. In the past two decades, research has identified important risk factors of LBW. In this study, we used classification trees to study the interactive nature of these factors. In particular we: (1) identify subgroups of women who are at a high risk of a LBW outcome in seven geographical regions of Florida, and (2) study the predictive performance of classification trees by comparing the tree-based results to those obtained using logistic regression. Methods: The data, 181,690 singleton births, were derived from Florida birth certificates recorded in 1998. Classification trees and logistic regression models were built based on seven geographical regions. The outcome variable consisted of two classes, namely LBW (<2500 g) and normal birth weight (>= 2500 g) cases, while a Large number of known risk factors was examined. Tree and Logistic regression models were compared using Receiving Operating Curves, and sensitivity and specificity analyses. Results: The use of classification trees has revealed a number of high-risk subgroups. For instance, White, Hispanic or Other non-white mothers who were healthy and smoked with a weight gain less than 20 lbs had a higher risk of a LBW birth compared to those with the same characteristics but with a weight gain of more than 20 lbs. Factors such as parity and marital status were important predictors for pregnancy outcomes among nonsmoker White, Hispanic or Other non-white mothers. Furthermore, we found that Black mothers were directly classified as a high-risk subgroup in the regions of Panhandle, Northeast, North Central, while in the Southern regions a series of other characteristics further defined the high-risk subgroup of Black mothers. Overall, the differences in predictive performance between tree models and logistic regression were minimal. Conclusion: The present study demonstrated that classification trees can be used to identify high-risk subgroups of mothers who are at risk of LBW outcomes. Although these exploratory tree analyses revealed a number of distinctive variable interactions for each geographical area, the variable selection was similar across all seven regions. This study also demonstrated that classification trees did not outperform logistic regression models or vice versa; both approaches provided useful analyses of the data. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:275 / 289
页数:15
相关论文
共 32 条
[1]   Predicting postconcussion syndrome after minor traumatic brain injury [J].
Bazarian, JJ ;
Atabaki, S .
ACADEMIC EMERGENCY MEDICINE, 2001, 8 (08) :788-795
[2]  
Breiman L., 1998, CLASSIFICATION REGRE
[3]  
*CART, 2000, TREE STRUCT NONP DAT
[4]  
Colombet I, 2000, J AM MED INFORM ASSN, P156
[5]   SOCIAL-FACTORS AND INFANT-MORTALITY - IDENTIFYING HIGH-RISK GROUPS AND PROXIMATE CAUSES [J].
CRAMER, JC .
DEMOGRAPHY, 1987, 24 (03) :299-322
[6]   Differing birth weight among infants of US-born blacks, African-born blacks, and US-born whites [J].
David, RJ ;
Collins, JW .
NEW ENGLAND JOURNAL OF MEDICINE, 1997, 337 (17) :1209-1214
[7]  
Department of Health and Human Services (USA), 2000, HLTH PEOPL 2010 UND
[8]  
Friedman J., 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5
[9]   Compromised birth outcomes and infant mortality among racial and ethnic groups [J].
Frisbie, WP ;
Forbes, D ;
Pullum, SG .
DEMOGRAPHY, 1996, 33 (04) :469-481
[10]   The health status of southern children: A neglected regional disparity [J].
Goldhagen, J ;
Remo, R ;
Bryant, T ;
Wludyka, P ;
Dailey, A ;
Wood, D ;
Watts, G ;
Livingood, W .
PEDIATRICS, 2005, 116 (06) :E746-E753