P-values and confidence intervals: Two sides of the same unsatisfactory coin

被引:68
作者
Feinstein, AR [1 ]
机构
[1] Yale Univ, Sch Med, New Haven, CT 06510 USA
关键词
D O I
10.1016/S0895-4356(97)00295-3
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
For both P-values and confidence intervals, an ct level is chosen to set limits of acceptable probability for the role of chance in the observed distinctions. The level of a is used either for direct comparison with a single P-value, or for determining the extent of a confidence interval. "Statistical significance" is proclaimed if the calculations yield a P-value that is below alpha, or a 1 - alpha confidence interval whose range excludes the null result of "no difference." Both the P-value and confidence-interval methods are essentially reciprocal, since they use the same principles of probabilistic calculation; and both can yield distorted or misleading results if the data do not adequately conform to the underlying mathematical requirements. The major scientific disadvantage of both methods is that their "significance" is merely an inference derived from principles of mathematical probability, not an evaluation of substantive importance for the "big" or "small" magnitude of the observed distinction. The latter evaluation has not received adequate attention during the emphasis on probabilistic decisions; and cartful principles have not been developed either for the substantive reasoning or for setting appropriate boundaries for "big" or "small." After a century of "significance" inferred exclusively from probabilities, a basic scientific challenge is to develop methods for deciding what is substantively impressive or trivial. (C) 1998 Elsevier Science Inc.
引用
收藏
页码:355 / 360
页数:6
相关论文
共 16 条
[1]  
[Anonymous], 1965, J PHYSIOL-LONDON, DOI DOI 10.1113/JPHYSIOL.1965.SP007639
[2]  
[Anonymous], 1996, BAYESIAN BIOSTATISTI
[3]  
[Anonymous], 1925, MATH PROC CAMBRIDGE
[4]   INDEXES AND BOUNDARIES FOR QUANTITATIVE SIGNIFICANCE IN STATISTICAL DECISIONS [J].
BURNAND, B ;
KERNAN, WN ;
FEINSTEIN, AR .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1990, 43 (12) :1273-1284
[5]  
Cohen J., 1965, HDB CLIN PSYCHOL, P95
[6]  
FISHER RA, 1970, STAT METHODS RES WOR, P44
[7]  
FISHER RA, 1959, STATISTICAL METHODS, P42
[8]  
FLEISS JL, 1986, AM J PUBLIC HEALTH, V76, P587, DOI 10.2105/AJPH.76.5.587
[9]   CONFIDENCE-INTERVALS RATHER THAN P-VALUES - ESTIMATION RATHER THAN HYPOTHESIS-TESTING [J].
GARDNER, MJ ;
ALTMAN, DG .
BMJ-BRITISH MEDICAL JOURNAL, 1986, 292 (6522) :746-750
[10]  
GAVARRET J, 1840, PRINCIPLES GENERAUX, P286