Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders

被引:668
作者
Cepeda, MS
Boston, R
Farrar, JT
Strom, BL
机构
[1] Univ Penn, Sch Med, Ctr Clin Epidemiol & Biostat, Philadelphia, PA 19104 USA
[2] Univ Penn, Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
[3] Javeriana Univ, Sch Med, Bogota, Colombia
[4] Univ Penn, Sch Vet Med, New Bolton Ctr, Philadelphia, PA 19104 USA
关键词
bias (epidemiology); confounding factors (epidemiology); logistic models; models; statistical;
D O I
10.1093/aje/kwg115
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
The aim of this study was to use Monte Carlo simulations to compare logistic regression with propensity scores in terms of bias, precision, empirical coverage probability, empirical power, and robustness when the number of events is low relative to the number of confounders. The authors simulated a cohort study and performed 252,480 trials. In the logistic regression, the bias decreased as the number of events per confounder increased. In the propensity score, the bias decreased as the strength of the association of the exposure with the outcome increased. Propensity scores produced estimates that were less biased, more robust, and more precise than the logistic regression estimates. when there were seven or fewer events per confounder. The logistic regression empirical coverage probability increased as the number of events per confounder increased. The propensity score empirical coverage probability decreased after eight or more events per confounder. Overall, the propensity score exhibited more empirical power than logistic regression. Propensity scores are a good alternative to control for imbalances when there are seven or fewer events per confounder; however, empirical power could range from 35% to 60%. Logistic regression is the technique of choice when there are at least eight events per confounder.
引用
收藏
页码:280 / 287
页数:8
相关论文
共 33 条
[1]  
[Anonymous], ENCY BIOSTATISTICS
[2]  
ARRESTI A, 1996, INTRO CATEGORICAL DA, P71
[3]  
Bergstralh EJ, 1996, EPIDEMIOLOGY, V7, P331
[4]  
BERRY WD, 1985, MULTIPLE REGRESSION, P9, DOI DOI 10.1016/S1352-2310(02)01000-2
[5]  
CASELLA GEORGE, 1990, STAT INFERENCE, V345, P345
[6]  
Cepeda MS, 2000, PHARMACOEPIDEM DR S, V9, P103, DOI 10.1002/(SICI)1099-1557(200003/04)9:2<103::AID-PDS477>3.0.CO
[7]  
2-6
[8]   EFFECTIVENESS OF ADJUSTMENT BY SUBCLASSIFICATION IN REMOVING BIAS IN OBSERVATIONAL STUDIES [J].
COCHRAN, WG .
BIOMETRICS, 1968, 24 (02) :295-&
[9]   Importance of events per independent variable in proportional hazards analysis .1. Background, goals, and general strategy [J].
Concato, J ;
Peduzzi, P ;
Holford, TR ;
Feinstein, AR .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1995, 48 (12) :1495-1501
[10]   THE RISK OF DETERMINING RISK WITH MULTIVARIABLE MODELS [J].
CONCATO, J ;
FEINSTEIN, AR ;
HOLFORD, TR .
ANNALS OF INTERNAL MEDICINE, 1993, 118 (03) :201-210