Replication and p Intervals p Values Predict the Future Only Vaguely, but Confidence Intervals Do Much Better

被引:293
作者
Cumming, Geoff [1 ]
机构
[1] La Trobe Univ, Sch Psychol Sci, Melbourne, Vic 3086, Australia
基金
澳大利亚研究理事会;
关键词
D O I
10.1111/j.1745-6924.2008.00079.x
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Replication is fundamental to science, so statistical analysis should give information about replication. Because p values dominate statistical analysis in psychology, it is important to ask what p says about replication. The answer to this question is "Surprisingly little." In one simulation of 25 repetitions of a typical experiment, p varied from <.001 to.76, thus illustrating that p is a very unreliable measure. This article shows that, if an initial experiment results in two-tailed p = 5.05, there is an 80% chance the one-tailed p value from a replication will fall in the interval (.00008,.44), a 10% chance that p < .00008, and fully a 10% chance that p > .44. Remarkably, the interval-termed a p interval-is this wide however large the sample size. p is so unreliable and gives such dramatically vague information that it is a poor basis for inference. Confidence intervals, however, give much better information about replication. Researchers should minimize the role of p by using confidence intervals and modelfitting techniques and by adopting meta-analytic thinking.
引用
收藏
页码:286 / 300
页数:15
相关论文
共 50 条
[1]  
Abelson R.P., 1995, Statistics As Principled Argument, DOI 10.4324/9781410601155
[2]  
[Anonymous], 2002, Methods of Psychological Research, DOI DOI 10.1119/1.2343497
[3]  
APA, 2001, Publication manual of the American Psychological Association, V5th, DOI DOI 10.1037/0000165-000
[4]  
BERGER JO, 1987, J AM STAT ASSOC, V82, P112, DOI 10.2307/2289131
[5]  
COHEN J, 1994, AM PSYCHOL, V49, P997, DOI 10.1037/0003-066X.50.12.1103
[6]   Understanding the average probability of replication - Comment on Killeen (2005) [J].
Cumming, G .
PSYCHOLOGICAL SCIENCE, 2005, 16 (12) :1002-1004
[7]   A primer on the understanding, use, and calculation of confidence intervals that are based on central and noncentral distributions [J].
Cumming, G ;
Finch, S .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2001, 61 (04) :532-574
[8]   Inference by eye - Confidence intervals and how to read pictures of data [J].
Cumming, G ;
Finch, S .
AMERICAN PSYCHOLOGIST, 2005, 60 (02) :170-180
[9]  
Cumming G., 2004, Understanding Statistics, V3, P299, DOI [10.1207/s15328031us0304_5, DOI 10.1207/S15328031US0304_5]
[10]   Statistical reform in psychology - Is anything changing? [J].
Cumming, Geoff ;
Fidler, Fiona ;
Leonard, Martine ;
Kalinowski, Pavel ;
Christiansen, Ashton ;
Kleinig, Anita ;
Lo, Jessica ;
McMenamin, Natalie ;
Wilson, Sarah .
PSYCHOLOGICAL SCIENCE, 2007, 18 (03) :230-232