Matching using estimated propensity scores: Relating theory to practice

被引:880
作者
Rubin, DB [1 ]
Thomas, N [1 ]
机构
[1] UNIV N CAROLINA, CHAPEL HILL, NC 27514 USA
关键词
bias reduction; nonrandomized studies; observational studies;
D O I
10.2307/2533160
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Matched sampling is a standard technique in the evaluation of treatments in observational studies. Matching on estimated propensity scores comprises an important class of procedures when there are numerous matching variables. Recent theoretical work (Rubin, D. B. and Thomas, N., 1992, The Annals of Statistics 20, 1079-1093) on affinely invariant matching methods with ellipsoidal distributions provides a general framework for evaluationg the operating characteristics of such methods. Moreover, Rubin and Thomas (1992, Biometrika 79, 797-809) uses this framework to derive several analytic approximations under normality for the distribution of the first two moments of the matching variables in samples obtained by matching on estimated linear propensity scores. Here we provide a bridge between these theoretical approximations and actual practice. First, we complete and refine the nomal-based analytic approximations, thereby making it possible to apply these results to practice. Second, we perform Monte Carlo evaluations of the analytic results under normal and nonnormal ellipsoidal distributions, which confirm the accuracy of the analytic approximations, and demonstrate the predictable ways in which the approximations deviate from simulation results when normal assumptions are violated within the ellipsoidal family. Third, we apply the analytic approximations to real data with clearly nonellipsoidal distributions, and show that the thoretical expressions, although derived under artificial distributional conditions, produce useful guidance for practice. Our results delineate the wide range of settings in which matching on estimated Linear propensity scores performs well, thereby providing useful information for the design of matching studies. When matching with a particular data set, our theoretical approximations provide benchmarks for expected performance under favorable conditions, thereby identifying matching variables requiring special treatment. After matching is complete and data analysis is at hand, our results provide the variances required to compute valid standard errors for common estimators.
引用
收藏
页码:249 / 264
页数:16
相关论文
共 26 条
[1]   LOWER MEDICARE MORTALITY AMONG A SET OF HOSPITALS KNOWN FOR GOOD NURSING-CARE [J].
AIKEN, LH ;
SMITH, HL ;
LAKE, ET .
MEDICAL CARE, 1994, 32 (08) :771-787
[2]   ROBUSTNESS OF FISHERS LINEAR DISCRIMINANT FUNCTION UNDER 2-COMPONENT MIXED NORMAL-MODELS [J].
ASHIKAGA, T ;
CHANG, PC .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1981, 76 (375) :676-680
[3]  
COCHRAN WG, 1973, SANKHYA SER A, V35, P417
[4]   THE PLANNING OF OBSERVATIONAL STUDIES OF HUMAN-POPULATIONS [J].
COCHRAN, WG .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-GENERAL, 1965, 128 (02) :234-266
[5]  
Cox D.R., 1974, THEORETICAL STAT
[6]   ASYMPTOTICS OF GRAPHICAL PROJECTION PURSUIT [J].
DIACONIS, P ;
FREEDMAN, D .
ANNALS OF STATISTICS, 1984, 12 (03) :793-815
[7]   EFFECTS OF MISSPECIFICATION OF THE PROPENSITY SCORE ON ESTIMATORS OF TREATMENT EFFECT [J].
DRAKE, C .
BIOMETRICS, 1993, 49 (04) :1231-1236
[8]  
EASTWOOD EA, 1988, AM J MENT RETARD, V93, P75
[9]   EFFICIENCY OF LOGISTIC REGRESSION COMPARED TO NORMAL DISCRIMINANT-ANALYSIS [J].
EFRON, B .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (352) :892-898
[10]  
Fang K -T., 1990, SYMMETRIC MULTIVARIA