Bayesian Tests to Quantify the Result of a Replication Attempt

被引:180
作者
Verhagen, Josine [1 ]
Wagenmakers, Eric-Jan [1 ]
机构
[1] Univ Amsterdam, Dept Psychol, NL-1018 XA Amsterdam, Netherlands
基金
欧洲研究理事会;
关键词
effect size; prior distribution; Bayes factor; CONFIDENCE-INTERVALS; PRIOR SENSITIVITY; MODEL SELECTION; HYPOTHESIS TEST; PSYCHOLOGY; REPLICABILITY; INCENTIVES; ROMANCE; DESIGN; FUTURE;
D O I
10.1037/a0036731
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Replication attempts are essential to the empirical sciences. Successful replication attempts increase researchers' confidence in the presence of an effect, whereas failed replication attempts induce skepticism and doubt. However, it is often unclear to what extent a replication attempt results in success or failure. To quantify replication outcomes we propose a novel Bayesian replication test that compares the adequacy of 2 competing hypotheses. The 1st hypothesis is that of the skeptic and holds that the effect is spurious; this is the null hypothesis that postulates a zero effect size, H-0 : delta=0. The 2nd hypothesis is that of the proponent and holds that the effect is consistent with the one found in the original study, an effect that can be quantified by a posterior distribution. Hence, the 2nd hypothesis-the replication hypothesis-is given by H-r : delta similar to "posterior distribution from original study." The weighted-likelihood ratio between H-0 and H-r quantifies the evidence that the data provide for replication success and failure. In addition to the new test, we present several other Bayesian tests that address different but related questions concerning a replication study. These tests pertain to the independent conclusions of the separate experiments, the difference in effect size between the original experiment and the replication attempt, and the overall conclusion based on the pooled results. Together, this suite of Bayesian tests allows a relatively complete formalization of the way in which the result of a replication attempt alters our knowledge of the phenomenon at hand. The use of all Bayesian replication tests is illustrated with 3 examples from the literature. For experiments analyzed using the t test, computation of the new replication test only requires the t values and the numbers of participants from the original study and the replication study.
引用
收藏
页码:1457 / 1475
页数:19
相关论文
共 81 条
[1]  
[Anonymous], 2013, Social Psychology, DOI [10.1027/1864-9335/a000143, DOI 10.1027/1864-9335/A000143]
[2]  
[Anonymous], 2008, Bayesian evaluation of informative hypotheses, DOI [10.1007/978-0-387-09612-4_9, DOI 10.1007/978-0-387-09612-49]
[3]  
[Anonymous], 1971, Posterior probabilities of alternative linear models
[4]   Recommendations for Increasing Replicability in Psychology [J].
Asendorpf, Jens B. ;
Conner, Mark ;
De Fruyt, Filip ;
De Houwer, Jan ;
Denissen, Jaap J. A. ;
Fiedler, Klaus ;
Fiedler, Susann ;
Funder, David C. ;
Kliegl, Reinhold ;
Nosek, Brian A. ;
Perugini, Marco ;
Roberts, Brent W. ;
Schmitt, Manfred ;
vanAken, Marcel A. G. ;
Weber, Hannelore ;
Wicherts, Jelte M. .
EUROPEAN JOURNAL OF PERSONALITY, 2013, 27 (02) :108-119
[5]   Extending conventional priors for testing general hypotheses in linear models [J].
Bayarri, M. J. ;
Garcia-Donato, Gonzalo .
BIOMETRIKA, 2007, 94 (01) :135-152
[6]   CRITERIA FOR BAYESIAN MODEL CHOICE WITH APPLICATION TO VARIABLE SELECTION [J].
Bayarri, M. J. ;
Berger, J. O. ;
Forte, A. ;
Garcia-Donato, G. .
ANNALS OF STATISTICS, 2012, 40 (03) :1550-1577
[7]   Bayesian design of "successful" replications [J].
Bayarri, MJ ;
Mayoral, AM .
AMERICAN STATISTICIAN, 2002, 56 (03) :207-214
[8]   Bayesian analysis and design for comparison of effect-sizes [J].
Bayarri, MJ ;
Mayoral, AM .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2002, 103 (1-2) :225-243
[9]   Must Psychologists Change the Way They Analyze Their Data? [J].
Bem, Daryl J. ;
Utts, Jessica ;
Johnson, Wesley O. .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2011, 101 (04) :716-719
[10]  
Berger J., 2006, ENCY STAT SCI, V1, P378, DOI DOI 10.1002/0471667196.ESS0985.PUB2