Splitting a Predictor at the Upper Quarter or Third and the Lower Quarter or Third

被引:122
作者
Gelman, Andrew [1 ,2 ]
Park, David K. [3 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
[2] Columbia Univ, Dept Polit Sci, New York, NY 10027 USA
[3] George Washington Univ, Dept Polit Sci, Washington, DC 20052 USA
关键词
Discretizing; Linear regression; Statistical communication; Trichotomizing; EXTREME GROUPS; SELECTION;
D O I
10.1198/tast.2009.0001
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
070103 [概率论与数理统计]; 140311 [社会设计与社会创新];
摘要
A linear regression of y on x can be approximated by a simple difference: the average values of y corresponding to the highest quarter or third of x, minus the average values of y corresponding to the lowest quarter or third of x. A simple theoretical analysis, similar to analyses that have been done in psychometrics, shows this comparison to perform reasonably well, with 80%-90% efficiency compared to the regression if the predictor is uniformly or normally distributed. By discretizing x into three categories, we claw back about half the efficiency lost by the commonly used strategy of dichotomizing the predictor. We illustrate with the example that motivated our research: an analysis of income and voting which we had originally performed for a scholarly journal but then wanted to communicate to a general audience.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 22 条
[1]
[Anonymous], 2006, POLARIZED AM DANCE P
[2]
Income, economic voting, and long-term political change in the US, 1952-1996 [J].
Brooks, C ;
Brady, D .
SOCIAL FORCES, 1999, 77 (04) :1339-1374
[3]
NOTE ON GROUPING [J].
COX, DR .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1957, 52 (280) :543-547
[4]
THE UPPER AND LOWER 27 PER-CENT RULE [J].
CURETON, EE .
PSYCHOMETRIKA, 1957, 22 (03) :293-296
[5]
27 PERCENT RULE REVISITED [J].
DAGOSTINO, RB ;
CURETON, EE .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1975, 35 (01) :47-50
[6]
THE USE OF EXTREME GROUPS TO TEST FOR THE PRESENCE OF A RELATIONSHIP [J].
FELDT, LS .
PSYCHOMETRIKA, 1961, 26 (03) :307-316
[8]
Rich state, poor state, red state, blue state: What's the matter with connecticut? [J].
Gelman, Andrew ;
Shor, Boris ;
Bafumi, Joseph ;
Park, David .
QUARTERLY JOURNAL OF POLITICAL SCIENCE, 2007, 2 (04) :345-367
[9]
Scaling regression inputs by dividing by two standard deviations [J].
Gelman, Andrew .
STATISTICS IN MEDICINE, 2008, 27 (15) :2865-2873
[10]
AVERAGE PREDICTIVE COMPARISONS FOR MODELS WITH NONLINEARITY, INTERACTIONS, AND VARIANCE COMPONENTS [J].
Gelman, Andrew ;
Pardoe, Iain .
SOCIOLOGICAL METHODOLOGY 2007, VOL 37, 2007, 37 :23-51