Confidence intervals for correlations when data are not normal

被引:101
作者
Bishara, Anthony J. [1 ]
Hittner, James B. [1 ]
机构
[1] Coll Charleston, Dept Psychol, 66 George St, Charleston, SC 29424 USA
关键词
Correlation; Confidence interval; Normal; Robust; Fisher z; Fisher z'; r to z; MOMENT CORRELATION-COEFFICIENT; RANK CORRELATION; MULTIVARIATE SKEWNESS; SAMPLE SKEWNESS; NONNORMAL DATA; BOOTSTRAP; KURTOSIS; PEARSON; ERROR; TESTS;
D O I
10.3758/s13428-016-0702-8
中图分类号
B841 [心理学研究方法];
学科分类号
040201 [基础心理学];
摘要
With nonnormal data, the typical confidence interval of the correlation (Fisher z') may be inaccurate. The literature has been unclear as to which of several alternative methods should be used instead, and how extreme a violation of normality is needed to justify an alternative. Through Monte Carlo simulation, 11 confidence interval methods were compared, including Fisher z', two Spearman rank-order methods, the Box-Cox transformation, rank-based inverse normal (RIN) transformation, and various bootstrap methods. Nonnormality often distorted the Fisher z' confidence interval- for example, leading to a 95 % confidence interval that had actual coverage as low as 68 %. Increasing the sample size sometimes worsened this problem. Inaccurate Fisher z' intervals could be predicted by a sample kurtosis of at least 2, an absolute sample skewness of at least 1, or significant violations of normality hypothesis tests. Only the Spearman rankorder and RIN transformation methods were universally robust to nonnormality. Among the bootstrap methods, an observed imposed bootstrap came closest to accurate coverage, though it often resulted in an overly long interval. The results suggest that sample nonnormality can justify avoidance of the Fisher z' interval in favor of a more robust alternative. R code for the relevant methods is provided in supplementary materials.
引用
收藏
页码:294 / 309
页数:16
相关论文
共 77 条
[1]
[Anonymous], R LANG ENV STAT COMP
[2]
[Anonymous], JACKKNIFE BOOTSTRAP, DOI DOI 10.1137/1.9781611970319
[3]
[Anonymous], 1993, An introduction to the bootstrap
[4]
[Anonymous], 1994, Statistics
[5]
[Anonymous], 2010, Publication Manual of the American Psychological Association, V6th
[6]
[Anonymous], 1980, Multivariate Analysis
[7]
A Cautionary Note on the Use of the Vale and Maurelli Method to Generate Multivariate, Nonnormal Data for Simulation Purposes [J].
Astivia, Oscar L. Olvera ;
Zumbo, Bruno D. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2015, 75 (04) :541-567
[8]
Rank-Based Inverse Normal Transformations are Increasingly Used, But are They Merited? [J].
Beasley, T. Mark ;
Erickson, Stephen ;
Allison, David B. .
BEHAVIOR GENETICS, 2009, 39 (05) :580-595
[9]
Beasley W., 2012, APA handbook of research methods in Psychology, P407, DOI [10.1037/13620-022, DOI 10.1037/13620-022]
[10]
Bootstrapping to test for nonzero population correlation coefficients using univariate sampling [J].
Beasley, William Howard ;
DeShea, Lise ;
Toothaker, Larry E. ;
Mendoza, Jorge L. ;
Bard, David E. ;
Rodgers, Joseph Lee .
PSYCHOLOGICAL METHODS, 2007, 12 (04) :414-433