ON THE EFFECTS OF PREDICTOR MISCLASSIFICATION IN MULTIPLE LINEAR-REGRESSION ANALYSIS

被引:5
作者
CHRISTOPHER, SR
KUPPER, LL
机构
[1] RES TRIANGLE INST,RES TRIANGLE PK,NC 27709
[2] UNIV N CAROLINA,DEPT BIOSTAT,CHAPEL HILL,NC 27599
关键词
MEASUREMENT ERROR; CATEGORICAL VARIABLES; LEAST SQUARES REGRESSION;
D O I
10.1080/03610929508831472
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Unweighted least squares regression analysis procedures are frequently used to model the relationship between a continuous response variable and one or more categorical predictor variables. Very commonly, these categorical predictors are subject to misclassification error. The majority of research in the area of mismeasurement of predictors in linear regression analysis has focused on continuous predictors (i.e., on errors-in-variables models). The theory developed for these situations relies on assumptions that do not apply to categorical predictors. We examine the impact of categorical predictor misclassification on unweighted least squares regression analysis results based on models that may also include any number of perfectly measured continuous or categorical predictors. Distributional properties of the response variable conditional on the potentially misclassified observed data are determined. These properties are used to examine the bias properties of estimators of regression coefficients and their estimated variances for models fitted using the observed data. In particular, we show that the bias of estimated regression coefficients based on the use of misclassified categorical predictors can be away from the null. The impact of predictor misclassification on certain test statistics is also explored.
引用
收藏
页码:13 / 37
页数:25
相关论文
共 25 条
[1]  
BARTLETT MS, 1949, BIOMETRICS, V6, P207
[2]  
BOX GEP, 1961, B INT STAT I, V38, P339
[3]  
CARROLL RJ, 1989, STAT MED, V8, P1075
[5]   ERRORS OF MEASUREMENT IN STATISTICS [J].
COCHRAN, WG .
TECHNOMETRICS, 1968, 10 (04) :637-&
[6]   ESTIMATION IN MIXTURES OF 2 NORMAL DISTRIBUTIONS [J].
COHEN, AC .
TECHNOMETRICS, 1967, 9 (01) :15-&
[7]   ESTIMATING COMPONENTS OF A MIXTURE OF NORMAL DISTRIBUTIONS [J].
DAY, NE .
BIOMETRIKA, 1969, 56 (03) :463-&
[8]  
Fuller W.A., 2009, MEASUREMENT ERROR MO, V305, DOI DOI 10.1002/9780470316665
[9]   MISCLASSIFICATION AND THE DESIGN OF ENVIRONMENTAL-STUDIES [J].
GLADEN, B ;
ROGAN, WJ .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1979, 109 (05) :607-616
[10]  
GRAYBILL FA, 1969, INTRO MATRICES APPLI