Robustness issues in multilevel regression analysis

被引：581

作者：

Maas, CJM ^{[1
]}

Hox, JJ ^{[1
]}

机构：

[1] Univ Utrecht, Fac Social Sci, Dept Methodol & Stat, NL-3508 TC Utrecht, Netherlands

来源：

STATISTICA NEERLANDICA | 2004年 / 58卷 / 02期

关键词：

multilevel modeling; sample size; cluster sampling; maximum likelihood; (robust) standard errors; sandwich estimate; Huber/White correction;

D O I：

10.1046/j.0039-0402.2003.00252.x

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

A multilevel problem concerns a population with a hierarchical structure. A sample from such a population can be described as a multistage sample. First, a sample of higher level units is drawn (e.g. schools or organizations), and next a sample of the sub-units from the available units (e.g. pupils in schools or employees in organizations). In such samples, the individual observations are in general not completely independent. Multilevel analysis software accounts for this dependence and in recent years these programs have been widely accepted. Two problems that occur in the practice of multilevel modeling will be discussed. The first problem is the choice of the sample sizes at the different levels. What are sufficient sample sizes for accurate estimation? The second problem is the normality assumption of the level-2 error distribution. When one wants to conduct tests of significance, the errors need to be normally distributed. What happens when this is not the case? In this paper, simulation studies are used to answer both questions. With respect to the first question, the results show that a small sample size at level two (meaning a sample of 50 or less) leads to biased estimates of the second-level standard errors. The answer to the second question is that only the standard errors for the random effects at the second level are highly inaccurate if the distributional assumptions concerning the level-2 errors are not fulfilled. Robust standard errors turn out to be more reliable than the asymptotic standard errors based on maximum likelihood.

引用

页码：127 / 137

页数：11

共 23 条

[1]

AFSHARTOUS D, 1995, ANN M AM ED RES ASS

[2]

[Anonymous], 2005, HLM 603 HIERARCHICAL

[3]

Browne, 1998, APPL MCMC METHODS MU

[4] Implementation and performance issues in the Bayesian and likelihood fitting of multilevel models [J].

Browne, WJ ;

Draper, D .

COMPUTATIONAL STATISTICS, 2000, 15 (03) :391-420

[5]

Bryk A.S., 1992, Hierarchical Models: Applications and Data Analysis Methods

[6]

BUSING F, 1993, UNPUB DISTRIBUTION C

[7]

Cohen J., 1988, STAT POWER ANAL BEHA

[8] RANDOM COEFFICIENT MODELS FOR MULTILEVEL ANALYSIS [J].

DELEEUW, J ;

KREFT, I .

JOURNAL OF EDUCATIONAL STATISTICS, 1986, 11 (01) :57-85

[9]

ELLIASON SR, 1993, MAXIMUM LIKELIHOOD E

[10]

Goldstein H., 2011, MULTILEVEL STAT MODE

← 1 2 3 →