Blurring the distinctions between p's and α's in psychological research

被引：71

作者：

Hubbard, R ^{[1
]}

机构：

[1] Drake Univ, Coll Business & Publ Adm, Des Moines, IA 50311 USA

来源：

THEORY & PSYCHOLOGY | 2004年 / 14卷 / 03期

关键词：

Fisher; hybrid statistical model; inductive behavior; inductive inference; Neyman-Pearson;

D O I：

10.1177/0959354304043638

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

Confusion over the reporting and interpretation of results of commonly employed classical statistical tests is recorded in a sample of 1,645 papers from 12 psychology journals for the period 1990 through 2002. The confusion arises because researchers mistakenly believe that their interpretation is guided by a single unified theory of statistical inference. But this is not so: classical statistical testing is a nameless amalgamation of the rival and often contradictory approaches developed by Ronald Fisher, on the one hand, and Jerzy Neyman and Egon Pearson, on the other. In particular, there is extensive failure to acknowledge the incompatibility of Fisher's evidential p value with the Type I error rate, alpha, of Neyman-Pearson statistical orthodoxy. The distinction between evidence (p's) and errors (alpha's) is not trivial. Rather, it reveals the basic differences underlying Fisher's ideas on significance testing and inductive inference, and Neyman-Pearson views on hypothesis testing and inductive behavior. So complete is this misunderstanding over measures of evidence versus error that it is not viewed as even being a problem among the vast majority of researchers and other relevant parties. These include the APA Task Force on Statistical Inference, and those writing the guidelines concerning statistical testing mandated in APA Publication Manuals. The result is that, despite supplanting Fisher's significance-testing paradigm some fifty years or so ago, recognizable applications of Neyman-Pearson theory are few and far between in psychology's empirical literature. On the other hand, Fisher's influence is ubiquitous.

引用

页码：295 / 327

页数：33

共 103 条

[1]

American Psychological Association, 2016, ETH PRINC PSYCH COD

[2] Null hypothesis testing: Problems, prevalence, and an alternative [J].

Anderson, DR ;

Burnham, KP ;

Thompson, WL .

JOURNAL OF WILDLIFE MANAGEMENT, 2000, 64 (04) :912-923

[3]

[Anonymous], 1979, Philosophical Problems of Statistical Inference: Learning from R. A. Fisher

[4]

[Anonymous], P SOC PSYCHICAL RES

[5]

[Anonymous], 1957, Review of the International Statistical Institute, DOI DOI 10.2307/1401671

[6]

[Anonymous], 1994, Educational and Psychological Measurement

[7]

[Anonymous], 1987, Statistical Science

[8]

[Anonymous], 1997, MULTIVAR APPL SER

[9]

[Anonymous], 1956, Statistical methods and scientific inference

[10]

[Anonymous], 1996, Statistical significance: Rationale, Validity and Utility

← 1 2 3 4 5 6 7 8 9 10 →