Random effects structure for confirmatory hypothesis testing: Keep it maximal

被引：7421

作者：

Barr, Dale J. ^{[1
]}

Levy, Roger ^{[2
]}

Scheepers, Christoph ^{[1
]}

Tily, Harry J. ^{[3
]}

机构：

[1] Univ Glasgow, Inst Neurosci & Psychol, Glasgow G12 8QB, Lanark, Scotland

[2] Univ Calif San Diego, Dept Linguist, La Jolla, CA 92093 USA

[3] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA

来源：

JOURNAL OF MEMORY AND LANGUAGE | 2013年 / 68卷 / 03期

基金：

美国国家科学基金会;

关键词：

Linear mixed-effects models; Generalization; Statistics; Monte Carlo simulation; FIXED-EFFECT FALLACY; LINEAR MIXED MODELS; LANGUAGE;

D O I：

10.1016/j.jml.2012.11.001

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Linear mixed-effects models (LMEMs) have become increasingly prominent in psycholinguistics and related areas. However, many researchers do not seem to appreciate how random effects structures affect the generalizability of an analysis. Here, we argue that researchers using LMEMs for confirmatory hypothesis testing should minimally adhere to the standards that have been in place for many decades. Through theoretical arguments and Monte Carlo simulation, we show that LMEMs generalize best when they include the maximal random effects structure justified by the design. The generalization performance of LMEMs including data-driven random effects structures strongly depends upon modeling criteria and sample size, yielding reasonable results on moderately-sized samples when conservative criteria are used, but with little or no power advantage over maximal models. Finally, random-intercepts-only LMEMs used on within-subjects and/or within-items data from populations where subjects and/or items vary in their sensitivity to experimental manipulations always generalize worse than separate F-1 and F-2 tests, and in many cases, even worse than F-1 alone. Maximal LMEMs should be the 'gold standard' for confirmatory hypothesis testing in psycholinguistics and beyond. (C) 2012 Elsevier Inc. All rights reserved.

引用

页码：255 / 278

页数：24

共 45 条

[1]

[Anonymous], 2003, P 3 INT WORKSH DISTR, DOI DOI 10.1.1.13.3406

[2]

[Anonymous], 1999, The analysis of variance

[3]

[Anonymous], 1971, Statistical Principles in Experimental Design

[4] Mixed-effects modeling with crossed random effects for subjects and items [J].

Baayen, R. H. ;

Davidson, D. J. ;

Bates, D. M. .

JOURNAL OF MEMORY AND LANGUAGE, 2008, 59 (04) :390-412

[5]

Baayen R. H., 2008, ANAL LINGUISTIC DATA, DOI [10.1017/CBO9780511801686, DOI 10.1017/CBO9780511801686]

[6]

Baayen R.H., 2011, LANGUAGER DATA SETS

[7]

Baayen R.H., 2004, Mental Lexicon Working Papers, V1, P1

[8] Analyzing 'visual world' eyetracking data using multilevel logistic regression [J].

Barr, Dale J. .

JOURNAL OF MEMORY AND LANGUAGE, 2008, 59 (04) :457-474

[9]

Bates D., 2011, lme4: Linear mixed-effects models using S4 classes

[10] Generalized linear mixed models: a practical guide for ecology and evolution [J].

Bolker, Benjamin M. ;

Brooks, Mollie E. ;

Clark, Connie J. ;

Geange, Shane W. ;

Poulsen, John R. ;

Stevens, M. Henry H. ;

White, Jada-Simone S. .

TRENDS IN ECOLOGY & EVOLUTION, 2009, 24 (03) :127-135

← 1 2 3 4 5 →