Precis of Statistical significance: Rationale, validity, and utility

被引：63

作者：

Chow, SL ^{[1
]}

机构：

[1] Univ Regina, Dept Psychol, Regina, SK S4S 0A2, Canada

来源：

BEHAVIORAL AND BRAIN SCIENCES | 1998年 / 21卷 / 02期

关键词：

Bayesianism; effect size; null hypothesis; statistical hypothesis testing; statistical significance; theory corroboration;

D O I：

10.1017/S0140525X98001162

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

The null-hypothesis significance-test procedure (NHSTP) is defended in the context of the theory-corroboration experiment, as well as the following contrasts: (a) substantive hypotheses versus statistical hypotheses, (b) theory corroboration versus statistical hypothesis testing, (c) theoretical inference versus statistical decision, (d) experiments versus nonexperimental studies, and (e) theory corroboration versus treatment assessment. The null hypothesis can be true because it is the hypothesis that errors are randomly distributed in data. Moreover, the null hypothesis is never used as a categorical proposition. Statistical significance means only that chance influences can be excluded as an explanation of data; it does not identify the nonchance factor responsible. The experimental conclusion is drawn with the inductive principle underlying the experimental design. A chain of deductive arguments gives rise to the theoretical conclusion via the experimental conclusion. The anomalous relationship between statistical significance and the effect size often used to criticize NHSTP is more apparent than real. The absolute size of the effect is not an index of evidential support for the substantive hypothesis. Nor is the effect size, by itself, informative as to the practical importance of the the research result. Being a conditional probability, statistical power cannot be the a priori probability of statistical significance. The validity of statistical power is debatable because statistical significance is determined with a single sampling distribution of the test statistic based on H-0, whereas it takes two distributions to represent statistical power or effect size. Sample size should not be determined in the mechanical manner envisaged in power analysis. It is inappropriate to criticize NHSTP for nonstatistical reasons. At the same time, neither effect size, nor confidence interval estimate, nor posterior probability can be used to exclude chance as an explanation of data. Neither can any of them fulfill the nonstatistical functions expected of them by critics.

引用

页码：169 / +

页数：31

共 89 条

[1]

[Anonymous], 1965, HDB CLIN PSYCHOL

[2]

[Anonymous], 1957, STAT THEORY RELATION

[3]

[Anonymous], 1986, LOGIC SCI DISCOVERY

[4]

[Anonymous], 1986, EPIDEMIOLOGY RESOUR

[5] TEST OF SIGNIFICANCE IN PSYCHOLOGICAL RESEARCH [J].

BAKAN, D .

PSYCHOLOGICAL BULLETIN, 1966, 66 (06) :423-&

[6] THE NATURE AND HISTORY OF EXPERIMENTAL CONTROL [J].

BORING, EG .

AMERICAN JOURNAL OF PSYCHOLOGY, 1954, 67 (04) :573-589

[7]

Campbell D.T., 1963, Experimental and quasi-experimental designs for research

[8]

CAMPBELL DT, 1969, ARTIFACT BEHAV RES

[9]

CHOMSKY Noam, 1957, SYNTACTIC STRUCTURES, DOI 10.1515/9783112316009

[10] SIGNIFICANCE TEST OR EFFECT SIZE [J].

CHOW, SL .

PSYCHOLOGICAL BULLETIN, 1988, 103 (01) :105-110

← 1 2 3 4 5 6 7 8 9 →