Calibration of p values for testing precise null hypotheses

被引:540
作者
Sellke, T [1 ]
Bayarri, MJ
Berger, JO
机构
[1] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
[2] Univ Valencia, Dept Stat & Operat Res, E-46100 Valencia, Spain
[3] Duke Univ, Inst Stat & Decis Sci, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
Bayes factors; Bayesian robustness; conditional frequentist error probabilities; odds;
D O I
10.1198/000313001300339950
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
P values are the most commonly used tool to measure evidence against a hypothesis or hypothesized model. Unfortunately, they are often incorrectly viewed as an error probability for rejection of the hypothesis or, even worse, as the posterior probability that the hypothesis is true. The fact that these interpretations can be completely misleading when testing precise hypotheses is first reviewed, through consideration of two revealing simulations. Then two calibrations of a p value are developed, the first being interpretable as odds and the second as either a (conditional) frequentist error probability or as the posterior probability of the hypothesis.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 26 条
[1]  
[Anonymous], 1987, Statistical Science
[2]  
[Anonymous], SANKHAYA B
[3]  
Bayarri MJ, 1999, BAYESIAN STATISTICS 6, P53
[4]  
Berger JO, 1997, STAT SCI, V12, P133
[5]  
BERGER JO, 1987, J AM STAT ASSOC, V82, P112, DOI 10.2307/2289131
[6]   The intrinsic Bayes factor for model selection and prediction [J].
Berger, JO ;
Pericchi, LR .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (433) :109-122
[7]   A UNIFIED CONDITIONAL FREQUENTIST AND BAYESIAN TEST FOR FIXED AND SEQUENTIAL SIMPLE HYPOTHESIS-TESTING [J].
BERGER, JO ;
BROWN, LD ;
WOLPERT, RL .
ANNALS OF STATISTICS, 1994, 22 (04) :1787-1807
[8]   Default Bayes factors for nonnested hypothesis testing [J].
Berger, JO ;
Mortera, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) :542-554
[9]  
BIRNBAUM A, 1961, ANN MATH STAT, V32, P414, DOI 10.1214/aoms/1177705050
[10]   RECONCILING BAYESIAN AND FREQUENTIST EVIDENCE IN THE ONE-SIDED TESTING PROBLEM [J].
CASELLA, G ;
BERGER, RL .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (397) :106-111