Type I and Type II error concerns in fMRI research: re-balancing the scale

被引:1058
作者
Lieberman, Matthew D. [1 ,2 ,3 ]
Cunningham, William A. [4 ]
机构
[1] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Psychiat, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Biobehav Sci, Los Angeles, CA 90095 USA
[4] Ohio State Univ, Dept Psychol, Columbus, OH 43210 USA
关键词
social cognitive neuroscience; MR statistics; type II error;
D O I
10.1093/scan/nsp052
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Statistical thresholding (i.e. P-values) in fMRI research has become increasingly conservative over the past decade in an attempt to diminish Type I errors (i.e. false alarms) to a level traditionally allowed in behavioral science research. In this article, we examine the unintended negative consequences of this single-minded devotion to Type I errors: increased Type II errors (i.e. missing true effects), a bias toward studying large rather than small effects, a bias toward observing sensory and motor processes rather than complex cognitive and affective processes and deficient meta-analyses. Power analyses indicate that the reductions in acceptable P-values over time are producing dramatic increases in the Type II error rate. Moreover, the push for a mapwide false discovery rate (FDR) of 0.05 is based on the assumption that this is the FDR in most behavioral research; however, this is an inaccurate assessment of the conventions in actual behavioral research. We report simulations demonstrating that combined intensity and cluster size thresholds such as P < 0.005 with a 10 voxel extent produce a desirable balance between Types I and II error rates. This joint threshold produces high but acceptable Type II error rates and produces a FDR that is comparable to the effective FDR in typical behavioral science articles (while a 20 voxel extent threshold produces an actual FDR of 0.05 with relatively common imaging parameters). We recommend a greater focus on replication and meta-analysis rather than emphasizing single studies as the unit of analysis for establishing scientific truth. From this perspective, Type I errors are self-erasing because they will not replicate, thus allowing for more lenient thresholding to avoid Type II errors.
引用
收藏
页码:423 / 428
页数:6
相关论文
共 9 条
  • [1] AFNI: Software for analysis and visualization of functional magnetic resonance neuroimages
    Cox, RW
    [J]. COMPUTERS AND BIOMEDICAL RESEARCH, 1996, 29 (03): : 162 - 173
  • [2] Fisher R. A., 1926, J MINISTRY AGR GREAT, V33, P503, DOI DOI 10.1007/978-1-4612-4380-9_8
  • [3] IMPROVED ASSESSMENT OF SIGNIFICANT ACTIVATION IN FUNCTIONAL MAGNETIC-RESONANCE-IMAGING (FMRI) - USE OF A CLUSTER-SIZE THRESHOLD
    FORMAN, SD
    COHEN, JD
    FITZGERALD, M
    EDDY, WF
    MINTUN, MA
    NOLL, DC
    [J]. MAGNETIC RESONANCE IN MEDICINE, 1995, 33 (05) : 636 - 647
  • [4] Thresholding of statistical maps in functional neuroimaging using the false discovery rate
    Genovese, CR
    Lazar, NA
    Nichols, T
    [J]. NEUROIMAGE, 2002, 15 (04) : 870 - 878
  • [5] GRIFFIN DW, 1991, ADV EXP SOC PSYCHOL, V24, P319
  • [6] Correlations in Social Neuroscience Aren't Voodoo: Commentary on Vul et al. (2009)
    Lieberman, Matthew D.
    Berkman, Elliot T.
    Wager, Tor D.
    [J]. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE, 2009, 4 (03) : 299 - 307
  • [7] OCHSNER K, 2007, NEUROSCIENCE INTEGRA, P106
  • [8] Vul E, 2009, PERSPECT PSYCHOL SCI, V4, P274, DOI 10.1111/j.1745-6924.2009.01125.x
  • [9] Building memories: Remembering and forgetting of verbal experiences as predicted by brain activity
    Wagner, AD
    Schacter, DL
    Rotte, M
    Koutstaal, W
    Maril, A
    Dale, AM
    Rosen, BR
    Buckner, RL
    [J]. SCIENCE, 1998, 281 (5380) : 1188 - 1191