CASE-CONTROL STUDIES WITH ERRORS IN COVARIATES

被引：67

作者：

CARROLL, RJ ^{[1
]}

GAIL, MH ^{[1
]}

LUBIN, JH ^{[1
]}

机构：

[1] NCI, EPIDEMIOL METHODS SECT, BETHESDA, MD 20892 USA

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 1993年 / 88卷 / 421期

关键词：

ASYMPTOTICS; CASE-CONTROL STUDY; DIFFERENTIAL MISCLASSIFICATION; ERRORS IN VARIABLES; LOGISTIC REGRESSION; PSEUDOLIKELIHOOD;

D O I：

10.2307/2290713

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We devise methods for estimating the parameters of a prospective logistic model with dichotomous response D and arbitrary covariates X from case-control data when these covariates are measured with error. We suppose that some fraction of the cases and controls provide only the error-prone covariate measurements, W (the ''incomplete'' or ''reduced'' data), whereas some of the cases and controls provide measurements on X and W (the ''complete'' data). We assume a measurement error density with a finite set of parameters alpha, namely f(W\XD)(w\x, d, alpha), and nondifferential error is treated as a special case of this model, f(W\X)(w\x, alpha). Our algorithm estimates both the logistic parameters and alpha from a pseudolikelihood. Because empirical distribution functions are used in place of needed distributions in the pseudolikelihoods, the required asymptotic theory is more elaborate than for pseudolikelihoods based on substitution for a finite number of nuisance parameters. We also examine computationally simpler methods under the assumptions that the disease is rare and that errors are nondifferential. Estimates of m(W) = E(X\W) are substituted for X in the logistic model when X is not available. Such estimates of m(W) can be obtained from the complete data described above or from an independent validation study. If measurements on X are not available, m(W) can still be estimated from replicated W measurements in some circumstances. A final approach uses approximate logistic regression techniques and is appropriate when a more accurate approximation is required than obtained by simply substituting m(W) for X. Asymptotic theory is presented for each of these procedures, and examples are used to illustrate the calculations.

引用

页码：185 / 199

页数：15

共 39 条

[1] MAXIMUM-LIKELIHOOD ESTIMATION OF PARAMETERS SUBJECT TO RESTRAINTS
AITCHISON, J
SILVEY, SD
[J]. ANNALS OF MATHEMATICAL STATISTICS, 1958, 29 (03): : 813 - 828
[2] SEPARATE SAMPLE LOGISTIC DISCRIMINATION
ANDERSON, JA
[J]. BIOMETRIKA, 1972, 59 (01) : 19 - 35
[3] MEASUREMENT ERROR IN THE GENERALIZED LINEAR-MODEL
ARMSTRONG, B
[J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1985, 14 (03) : 529 - 544
[4] ANALYSIS OF CASE-CONTROL DATA WITH COVARIATE MEASUREMENT ERROR - APPLICATION TO DIET AND COLON CANCER
ARMSTRONG, BG
WHITTEMORE, AS
HOWE, GR
[J]. STATISTICS IN MEDICINE, 1989, 8 (09) : 1151 - 1163
[5] ARE THERE 2 LOGISTIC REGRESSIONS FOR RETROSPECTIVE STUDIES
BRESLOW, N
POWERS, W
[J]. BIOMETRICS, 1978, 34 (01) : 100 - 105
[6] DOUBLE SAMPLING FOR EXACT VALUES IN THE NORMAL DISCRIMINANT MODEL WITH APPLICATION TO BINARY REGRESSION
BUONACCORSI, JP
[J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1990, 19 (12) : 4569 - 4586
[7] ON ERRORS-IN-VARIABLES FOR BINARY REGRESSION-MODELS
CARROLL, RJ
SPIEGELMAN, CH
LAN, KKG
BAILEY, KT
ABBOTT, RD
[J]. BIOMETRIKA, 1984, 71 (01) : 19 - 25
[8] CARROLL RJ, 1991, J ROY STAT SOC B MET, V53, P573
[9] APPROXIMATE QUASI-LIKELIHOOD ESTIMATION IN MODELS WITH SURROGATE PREDICTORS
CARROLL, RJ
STEFANSKI, LA
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (411) : 652 - 663
[10] A REVIEW OF METHODS FOR MISCLASSIFIED CATEGORICAL-DATA IN EPIDEMIOLOGY
CHEN, TT
[J]. STATISTICS IN MEDICINE, 1989, 8 (09) : 1095 - 1106

← 1 2 3 4 →