On the use and misuse of scalar scores of confounders in design and analysis of observational studies

被引:10
作者
Pfeiffer, R. M. [1 ]
Riedl, R. [2 ]
机构
[1] NCI, Bethesda, MD 20892 USA
[2] Med Univ Graz, Inst Med Informat Stat & Documentat, Graz, Austria
关键词
misspecified models; matched case-control study; summary scores; confounder scores; balancing scores; treatment effect; MAXIMUM-LIKELIHOOD-ESTIMATION; DISEASE RISK SCORES; PROPENSITY-SCORE; REGRESSION; STRATIFICATION; MULTIVARIATE; POPULATION; COHORT;
D O I
10.1002/sim.6467
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We assess the asymptotic bias of estimates of exposure effects conditional on covariates when summary scores of confounders, instead of the confounders themselves, are used to analyze observational data. First, we study regression models for cohort data that are adjusted for summary scores. Second, we derive the asymptotic bias for case-control studies when cases and controls are matched on a summary score, and then analyzed either using conditional logistic regression or by unconditional logistic regression adjusted for the summary score. Two scores, the propensity score (PS) and the disease risk score (DRS) are studied in detail. For cohort analysis, when regression models are adjusted for the PS, the estimated conditional treatment effect is unbiased only for linear models, or at the null for non-linear models. Adjustment of cohort data for DRS yields unbiased estimates only for linear regression; all other estimates of exposure effects are biased. Matching cases and controls on DRS and analyzing them using conditional logistic regression yields unbiased estimates of exposure effect, whereas adjusting for the DRS in unconditional logistic regression yields biased estimates, even under the null hypothesis of no association. Matching cases and controls on the PS yield unbiased estimates only under the null for both conditional and unconditional logistic regression, adjusted for the PS. We study the bias for various confounding scenarios and compare our asymptotic results with those from simulations with limited sample sizes. To create realistic correlations among multiple confounders, we also based simulations on a real dataset. Copyright (c) 2015John Wiley & Sons, Ltd.
引用
收藏
页码:2618 / 2635
页数:18
相关论文
共 31 条
[1]   Effect of hydroxychloroquine on the survival of patients with systemic lupus erythematosus:: Data from LUMINA, a multiethnic US cohort (LUMINA L) [J].
Alarcon, Graciela S. ;
McGwin, Gerald ;
Bertoli, Ana M. ;
Fessler, Barri J. ;
Calvo-Alen, Jaime ;
Bastian, Holly M. ;
Vila, Luis M. ;
Reveille, John D. .
ANNALS OF THE RHEUMATIC DISEASES, 2007, 66 (09) :1168-1172
[2]   Control for Confounding in Case-Control Studies Using the Stratification Score, a Retrospective Balancing Score [J].
Allen, Andrew S. ;
Satten, Glen A. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2011, 173 (07) :752-760
[3]   COX REGRESSION-MODEL FOR COUNTING-PROCESSES - A LARGE SAMPLE STUDY [J].
ANDERSEN, PK ;
GILL, RD .
ANNALS OF STATISTICS, 1982, 10 (04) :1100-1120
[4]  
[Anonymous], GEN LINEAR MODELS
[5]   Performance of Disease Risk Scores, Propensity Scores, and Traditional Multivariable Outcome Regression in the Presence of Multiple Confounders [J].
Arbogast, Patrick G. ;
Ray, Wayne A. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2011, 174 (05) :613-620
[6]   Conditioning on the propensity score can result in biased estimation of common measures of treatment effect: A Monte Carlo study (p n/a) [J].
Austin, Peter C. ;
Grootendorst, Paul ;
Normand, Sharon-Lise T. ;
Anderson, Geoffrey M. .
STATISTICS IN MEDICINE, 2007, 26 (16) :3208-3210
[7]   An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies [J].
Austin, Peter C. .
MULTIVARIATE BEHAVIORAL RESEARCH, 2011, 46 (03) :399-424
[8]   Statins Are Associated With a Reduced Risk of Hepatocellular Carcinoma in a Large Cohort of Patients With Diabetes [J].
El-Serag, Hashem B. ;
Johnson, Michael L. ;
Hachem, Christine ;
Morgana, Robert O. .
GASTROENTEROLOGY, 2009, 136 (05) :1601-1608
[9]   Statin use and risk of epilepsy A nested case-control study [J].
Etminan, Mahyar ;
Samii, Ali ;
Brophy, James M. .
NEUROLOGY, 2010, 75 (17) :1496-1500
[10]   Randomization does not justify logistic regression [J].
Freedman, David A. .
STATISTICAL SCIENCE, 2008, 23 (02) :237-249