Avoiding 'data snooping' in multilevel and mixed effects models

被引:14
作者
Afshartous, David
Wolf, Michael [1 ]
机构
[1] Univ Zurich, Inst Empir Res Econ, CH-8006 Zurich, Switzerland
[2] Univ Miami, Coral Gables, FL 33124 USA
关键词
data snooping; hierarchical linear models; hypothesis testing; pairwise comparisons; random effects; rankings;
D O I
10.1111/j.1467-985X.2007.00494.x
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Multilevel or mixed effects models are commonly applied to hierarchical data. The level 2 residuals, which are otherwise known as random effects, are often of both substantive and diagnostic interest. Substantively, they are frequently used for institutional comparisons or rankings. Diagnostically, they are used to assess the model assumptions at the group level. Inference on the level 2 residuals, however, typically does not account for 'data snooping', i.e. for the harmful effects of carrying out a multitude of hypothesis tests at the same time. We provide a very general framework that encompasses both of the following inference problems: inference on the 'absolute' level 2 residuals to determine which are significantly different from 0, and inference on any prespecified number of pairwise comparisons. Thus, the user has the choice of testing the comparisons of interest. As our methods are flexible with respect to the estimation method that is invoked, the user may choose the desired estimation method accordingly. We demonstrate the methods with the London education authority data, the wafer data and the National Educational Longitudinal Study data.
引用
收藏
页码:1035 / 1059
页数:25
相关论文
共 33 条
[1]  
Afshartous D., 2004, BEHAVIORMETRIKA, V31, P43
[2]  
[Anonymous], 2003, MCMC ESTIMATION MLWI
[3]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]  
Bryk A.S., 1992, Hierarchical Models: Applications and Data Analysis Methods
[6]   A novel bootstrap procedure for assessing the relationship between class size and achievement [J].
Carpenter, JR ;
Goldstein, H ;
Rasbash, J .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2003, 52 :431-443
[7]   RANDOM COEFFICIENT MODELS FOR MULTILEVEL ANALYSIS [J].
DELEEUW, J ;
KREFT, I .
JOURNAL OF EDUCATIONAL STATISTICS, 1986, 11 (01) :57-85
[8]   QUESTIONING MULTILEVEL MODELS [J].
DELEEUW, J ;
KREFT, IGG .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 1995, 20 (02) :171-189
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]  
Efron B., 1993, INTRO BOOTSTRAP MONO, DOI DOI 10.1201/9780429246593