Maximally selected chi-square statistics and binary splits of nominal variables

被引:21
作者
Boulesteix, Anne-Laure [1 ]
机构
[1] Tech Univ Munich, Dept Med Stat & Epidemiol, D-8000 Munich, Germany
关键词
categorical variables; association test; contingency table; exact distribution; variable selection; selection bias;
D O I
10.1002/bimj.200510191
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We address the problem of maximally selected chi-square statistics in the case of a binary Y variable and a nominal X variable with several categories. The distribution of the maximally selected chi-square statistic has already been derived when the best cutpoint is chosen from a continuous or an ordinal X, but not when the best split is chosen from a nominal X. In this paper, we derive the exact distribution of the maximally selected chi-square statistic in this case using a combinatorial approach. Applications of the derived distribution to variable selection and hypothesis testing are discussed based on simulations. As an illustration, our method is applied to a birth data set.
引用
收藏
页码:838 / 848
页数:11
相关论文
共 27 条
[1]   Admissibility of exact conditional tests of stochastic order [J].
Berger, VW .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1998, 66 (01) :39-50
[2]  
BERGER VW, 2002, JMASM, V1, P269, DOI DOI 10.22237/JMASM/1036108980
[3]   Maximally selected χ2 statistics for kx2 tables [J].
Betensky, RA ;
Rabinowitz, D .
BIOMETRICS, 1999, 55 (01) :317-320
[4]   Maximally selected chi-square statistics for ordinal variables [J].
Boulesteix, AL .
BIOMETRICAL JOURNAL, 2006, 48 (03) :451-462
[5]  
Breiman L., 1998, CLASSIFICATION REGRE
[6]   Obstacles to reducing cesarean rates in a low-cesarean setting: The effect of maternal age, height, and weight [J].
Cnattingius, R ;
Cnattingius, S ;
Notzon, FC .
OBSTETRICS AND GYNECOLOGY, 1998, 92 (04) :501-506
[7]   GENERALIZATION OF ONE-SIDED 2-SAMPLE KOLMOGOROV-SMIRNOV STATISTIC FOR EVALUATING DIAGNOSTIC TESTS [J].
GAIL, MH ;
GREEN, SB .
BIOMETRICS, 1976, 32 (03) :561-570
[8]   Minimally selected p and other tests for a single abrupt changepoint in a binary sequence [J].
Halpern, AL .
BIOMETRICS, 1999, 55 (04) :1044-1050
[9]   MAXIMALLY SELECTED CHI-SQUARE STATISTICS FOR SMALL SAMPLES [J].
HALPERN, J .
BIOMETRICS, 1982, 38 (04) :1017-1023
[10]  
Hothorn T, 2003, COMPUT STAT DATA AN, V43, P121, DOI 10.1016/S0167-9473(02)00225-6