Pisces did not have increased heart failure: data-driven comparisons of binary proportions between levels of a categorical variable can result in incorrect statistical significance levels

被引:11
作者
Austin, Peter C. [1 ,2 ,3 ]
Goldwasser, Meredith A. [1 ]
机构
[1] Inst Clin Evaluat Sci, Toronto, ON M4N 3M5, Canada
[2] Univ Toronto, Dept Publ Hlth Sci, Toronto, ON, Canada
[3] Univ Toronto, Dept Hlth Policy Management & Evaluat, Toronto, ON, Canada
基金
加拿大健康研究院;
关键词
chi-squared test; maximal proportion; type I error rate; contingency table; astrology; significance testing;
D O I
10.1016/j.jclinepi.2007.05.007
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective: We examined the impact on statistical inference when a chi(2) test is used to compare the proportion of successes in the level of a categorical variable that has the highest observed proportion of successes with the proportion of successes in all other levels of the categorical variable combined. Study Design and Setting: Monte Carlo simulations and a case study examining the association between astrological sign and hospitalization for heart failure. Results: A standard chi(2) test results in an inflation of the type I error rate, with the type I error rate increasing as the number of levels of the categorical. variable increases. Using a standard X test, the hospitalization rate for Pisces was statistically significantly different from that of the other 11 astrological signs combined (P = 0.026). After accounting for the fact that the selection of Pisces was based on it having the highest observed proportion of heart failure hospitalizations, subjects born under the sign of Pisces no longer had a significantly higher rate of heart failure hospitalization compared to the other residents of Ontario (P = 0.152). Conclusions: Post hoc comparisons of the proportions of successes across different levels of a categorical variable can result in incorrect inferences. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:295 / 300
页数:6
相关论文
共 13 条
[1]  
AGRESTI A., 2019, INTRO CATEGORICAL DA
[2]   Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality [J].
Austin, PC ;
Tu, JV .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2004, 57 (11) :1138-1146
[3]   Bootstrap methods for developing predictive models [J].
Austin, PC ;
Tu, JV .
AMERICAN STATISTICIAN, 2004, 58 (02) :131-137
[4]   Inflation of the type I error rate when a continuous confounding variable is categorized in logistic regression analyses [J].
Austin, PC ;
Brunner, LJ .
STATISTICS IN MEDICINE, 2004, 23 (07) :1159-1178
[5]   Testing multiple statistical hypotheses resulted in spurious associations: A study of astrological signs and health [J].
Austin, Peter C. ;
Mamdani, Muhammad M. ;
Juurlink, David N. ;
Hux, Janet E. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (09) :964-969
[6]   Maximally selected χ2 statistics for kx2 tables [J].
Betensky, RA ;
Rabinowitz, D .
BIOMETRICS, 1999, 55 (01) :317-320
[7]   Maximally selected chi-square statistics for ordinal variables [J].
Boulesteix, AL .
BIOMETRICAL JOURNAL, 2006, 48 (03) :451-462
[8]   Maximally selected chi-square statistics and binary splits of nominal variables [J].
Boulesteix, Anne-Laure .
BIOMETRICAL JOURNAL, 2006, 48 (05) :838-848
[9]  
COPAS JB, 1991, STATISTICIAN, V40, P51
[10]   MAXIMALLY SELECTED CHI-SQUARE STATISTICS FOR SMALL SAMPLES [J].
HALPERN, J .
BIOMETRICS, 1982, 38 (04) :1017-1023