Modeling Clustered Data with Very Few Clusters

被引:295
作者
McNeish, Daniel [1 ,2 ]
Stapleton, Laura M. [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Univ Utrecht, NL-3508 TC Utrecht, Netherlands
关键词
Bayesian; cluster randomized trial; fixed effect model; GEE; HLM; multilevel model; small sample; LONGITUDINAL DATA-ANALYSIS; SMALL SAMPLE INFERENCE; GENERALIZED ESTIMATING EQUATIONS; MULTILEVEL MODELS; BAYESIAN METHODS; VARIANCE; ESTIMATOR; TESTS; GEE; APPROXIMATIONS;
D O I
10.1080/00273171.2016.1167008
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Small-sample inference with clustered data has received increased attention recently in the methodological literature, with several simulation studies being presented on the small-sample behavior of many methods. However, nearly all previous studies focus on a single class of methods (e.g., only multilevel models, only corrections to sandwich estimators), and the differential performance of various methods that can be implemented to accommodate clustered data with very few clusters is largely unknown, potentially due to the rigid disciplinary preferences. Furthermore, a majority of these studies focus on scenarios with 15 or more clusters and feature unrealistically simple data-generation models with very few predictors. This article, motivated by an applied educational psychology cluster randomized trial, presents a simulation study that simultaneously addresses the extreme small sample and differential performance (estimation bias, Type I error rates, and relative power) of 12 methods to account for clustered data with a model that features a more realistic number of predictors. The motivating data are then modeled with each method, and results are compared. Results show that generalized estimating equations perform poorly; the choice of Bayesian prior distributions affects performance; and fixed effect models perform quite well. Limitations and implications for applications are also discussed.
引用
收藏
页码:495 / 518
页数:24
相关论文
共 82 条
[1]  
Allison Paul., 2005, FIXED EFFECTS REGRES
[2]  
Angrist JD, 2009, MOSTLY HARMLESS ECONOMETRICS: AN EMPIRICISTS COMPANION, P1
[3]  
[Anonymous], ED PSYCHOL REV
[4]  
[Anonymous], 2004, Applied Longitudinal Analysis
[5]  
[Anonymous], COMMUNICATIONS STAT
[6]  
[Anonymous], 1996, ARE MULTILEVEL TECHN
[7]  
[Anonymous], PSYCHOL METHODS
[8]   Bayesian Methods for the Analysis of Small Sample Multilevel Data With a Complex Variance Structure [J].
Baldwin, Scott A. ;
Fellingham, Gilbert W. .
PSYCHOLOGICAL METHODS, 2013, 18 (02) :151-164
[9]   Using generalized estimating equations for longitudinal data analysis [J].
Ballinger, GA .
ORGANIZATIONAL RESEARCH METHODS, 2004, 7 (02) :127-150
[10]   Fitting Multilevel Models With Ordinal Outcomes: Performance of Alternative Specifications and Methods of Estimation [J].
Bauer, Daniel J. ;
Sterba, Sonya K. .
PSYCHOLOGICAL METHODS, 2011, 16 (04) :373-390