Using Classification and Regression Trees (CART) and Random Forests to Analyze Attrition: Results From Two Simulations

被引:109
作者
Hayes, Timothy [1 ]
Usami, Satoshi [2 ]
Jacobucci, Ross [1 ]
McArdle, John J. [1 ]
机构
[1] Univ So Calif, Dept Psychol, Los Angeles, CA 90089 USA
[2] Univ Tsukuba, Dept Psychol, Tsukuba, Ibaraki 305, Japan
关键词
missing data analysis; attrition; machine learning; classification and regression trees (CART); longitudinal data analysis; MAXIMUM-LIKELIHOOD-ESTIMATION; MISSING-DATA; WEIGHTS; LONELINESS; INFERENCE;
D O I
10.1037/pag0000046
中图分类号
R4 [临床医学]; R592 [老年病学];
学科分类号
100201 [内科学]; 100218 [急诊医学];
摘要
In this article, we describe a recent development in the analysis of attrition: using classification and regression trees (CART) and random forest methods to generate inverse sampling weights. These flexible machine learning techniques have the potential to capture complex nonlinear, interactive selection models, yet to our knowledge, their performance in the missing data analysis context has never been evaluated. To assess the potential benefits of these methods, we compare their performance with commonly employed multiple imputation and complete case techniques in 2 simulations. These initial results suggest that weights computed from pruned CART analyses performed well in terms of both bias and efficiency when compared with other methods. We discuss the implications of these findings for applied researchers.
引用
收藏
页码:911 / 929
页数:19
相关论文
共 41 条
[1]
Aiken L. S., 1991, Multiple regression: Testing and interpreting interactions
[3]
Arbuckle JL., 1996, Advanced structural equation modeling: Issues and techniques, V243, P277, DOI DOI 10.4324/9781315827414
[4]
Sampling weights in latent variable modeling [J].
Asparouhov, T .
STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2005, 12 (03) :411-434
[5]
Berk R.A., 2009, Statistical learning from a regression perspective
[6]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]
Breiman L., 1984, Classification and Regression Trees. The Wadsworth statisticsprobability series, V19
[9]
MULTIVARIATE DECISION TREES [J].
BRODLEY, CE ;
UTGOFF, PE .
MACHINE LEARNING, 1995, 19 (01) :45-77
[10]
COHEN S, 1988, CLAR SYMP, P31