Null hypothesis testing: Problems, prevalence, and an alternative

被引:1383
作者
Anderson, DR
Burnham, KP
Thompson, WL
机构
[1] Colorado State Univ, Colorado Cooperat Fish & Wildlife Res Unit, Ft Collins, CO 80523 USA
[2] US Forest Serv, Rocky Mtn Res Stn, Boise, ID 83702 USA
关键词
AIC; Akaike weights; Ecology; information theory; Journal of Wildlife Management; Kullback-Leibler information; model selection; null hypothesis; P-values; significance tests;
D O I
10.2307/3803199
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 [生物信息与计算生物学]; 0713 [生态学];
摘要
This paper presents a review and critique of statistical null hypothesis testing in ecological studies in general, and wildlife studies in particular, and describes an alternative. Our review of Ecology and the journal of Wildlife Management found the use of null hypothesis testing to be pervasive. The estimated number of P-values appearing within articles of Ecology exceeded 8,000 in 1991 and has exceeded 3,000 in each year since 1984, whereas the estimated number of P-values in the Journal of Wildlife Management exceeded 8,000 in 1997 and has exceeded 3,000 in each year since 1991. We estimated that 47% (SE = 3.9%) of the P-values in the Journal of Wildlife;fe Management lacked estimates of means or effect sizes or even the sign of the difference in means or other parameters. We find that null hypothesis testing is uninformative when no estimates of means or effect size and their precision are given. Contrary to common dogma, tests of statistical null hypotheses have relatively little utility in science and are not a fundamental aspect of the scientific method. We recommend their use be reduced in favor of more informative approaches. Towards this objective, we describe a relatively new paradigm of data analysis based on Kullback-Leibler information. This paradigm is an extension of likelihood theory and, when used correctly, avoids many of the fundamental limitations and common misuses of null hypothesis testing. Information-theoretic methods focus on providing a strength of evidence for an a priori set of alternative hypotheses, rather than a statistical test of a null hypothesis. This paradigm allows the following types of evidence for the alternative hypotheses: the rank of each hypothesis, expressed as a model; an estimate of the formal likelihood of each model, given the data; a measure of precision that incorporates model selection uncertainty; and simple methods to allow the use of the set of alternative models in making formal inference. We provide an example of the information-theoretic approach using data on the effect of lead on survival in spectacled elder ducks (Somateria fischeri). Regardless of the analysis paradigm used, we strongly recommend inferences based on a priori considerations be clearly separated from those resulting from some form of data dredging.
引用
收藏
页码:912 / 923
页数:12
相关论文
共 55 条
[1]
NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]
Akaike H., 1973, 2 INT S INFORM THEOR, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0_15]
[3]
[Anonymous], 1970, The Significance Test Controversy
[4]
[Anonymous], 1989, EMPIRE CHSNCE PROBAB
[5]
BERGER JO, 1988, AM SCI, V76, P159
[6]
BERGER JO, 1987, J AM STAT ASSOC, V82, P112, DOI 10.2307/2289131
[7]
Some difficulties of interpretation encountered in the application of the chi-square test [J].
Berkson, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1938, 33 (203) :526-536
[8]
Tests of significance considered as evidence [J].
Berkson, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1942, 37 (219) :325-335
[9]
Model selection: An integral part of inference [J].
Buckland, ST ;
Burnham, KP ;
Augustin, NH .
BIOMETRICS, 1997, 53 (02) :603-618
[10]
Burnham K. P., 1998, MODEL SELECTION INFE