Replication, statistical consistency, and publication bias

被引:82
作者
Francis, Gregory [1 ]
机构
[1] Purdue Univ, Dept Psychol Sci, W Lafayette, IN 47907 USA
关键词
Hypothesis testing; Statistics; Publication bias; Scientific publishing; POST HOC POWER; EXPLORATORY TEST; SOCIAL-CLASS; PREVALENCE; PSYCHOLOGY; EXCESS; FUTURE; TESTS; FILL; TRIM;
D O I
10.1016/j.jmp.2013.02.003
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Scientific methods of investigation offer systematic ways to gather information about the world; and in the field of psychology application of such methods should lead to a better understanding of human behavior. Instead, recent reports in psychological science have used apparently scientific methods to report strong evidence for unbelievable claims such as precognition. To try to resolve the apparent conflict between unbelievable claims and the scientific method many researchers turn to empirical replication to reveal the truth. Such an approach relies on the belief that true phenomena can be successfully demonstrated in well-designed experiments, and the ability to reliably reproduce an experimental outcome is widely considered the gold standard of scientific investigations. Unfortunately, this view is incorrect; and misunderstandings about replication contribute to the conflicts in psychological science. Because experimental effects in psychology are measured by statistics, there should almost always be some variability in the reported outcomes. An absence of such variability actually indicates that experimental replications are invalid, perhaps because of a bias to suppress contrary findings or because the experiments were run improperly. Recent investigations have demonstrated how to identify evidence of such invalid experiment sets and noted its appearance for prominent findings in experimental psychology. The present manuscript explores those investigative methods by using computer simulations to demonstrate their properties and limitations. The methods are shown to be a check on the statistical consistency of a set of experiments by comparing the reported power of the experiments with the reported frequency of statistical significance. Overall, the methods are extremely conservative about reporting inconsistency when experiments are run properly and reported fully. The manuscript also considers how to improve scientific practice to avoid inconsistency, and discusses criticisms of the investigative method. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:153 / 169
页数:17
相关论文
共 70 条
[1]   Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC [J].
Aad, G. ;
Abajyan, T. ;
Abbott, B. ;
Abdallah, J. ;
Khalek, S. Abdel ;
Abdelalim, A. A. ;
Abdinov, O. ;
Aben, R. ;
Abi, B. ;
Abolins, M. ;
AbouZeid, U. S. ;
Abramowicz, H. ;
Abreu, H. ;
Acharya, B. S. ;
Adamczyk, L. ;
Adams, D. L. ;
Addy, T. N. ;
Adelman, J. ;
Adomeit, S. ;
Adragna, P. ;
Adye, T. ;
Aefsky, S. ;
Aguilar-Saavedra, J. A. ;
Agustoni, M. ;
Aharrouche, M. ;
Ahlen, S. P. ;
Ahles, F. ;
Ahmad, A. ;
Ahsan, M. ;
Aielli, G. ;
Akdogan, T. ;
Akesson, T. P. A. ;
Akimoto, G. ;
Akimov, A. V. ;
Alam, M. S. ;
Alam, M. A. ;
Albert, J. ;
Albrand, S. ;
Aleksa, M. ;
Aleksandrov, I. N. ;
Alessandria, F. ;
Alexa, C. ;
Alexander, G. ;
Alexandre, G. ;
Alexopoulos, T. ;
Alhroob, M. ;
Aliev, M. ;
Alimonti, G. ;
Alison, J. ;
Allbrooke, B. M. M. .
PHYSICS LETTERS B, 2012, 716 (01) :1-29
[2]  
[Anonymous], J STAT SOFTWARE
[3]  
[Anonymous], 2011, R LANG ENV STAT COMP
[4]  
[Anonymous], PSYCHONOMIC B REV
[5]  
[Anonymous], 1988, Statistical power analysis for the behavioral sciences
[6]  
[Anonymous], PSYCHOL METHODS
[7]  
[Anonymous], 1981, J EDUC STAT, DOI DOI 10.3102/10769986006002107
[8]  
[Anonymous], I PERCEPTION
[9]  
[Anonymous], 2012, APS OBSERVER
[10]  
[Anonymous], 2009, INT STAT REV