The Essential Role of Pair Matching in Cluster-Randomized Experiments, with Application to the Mexican Universal Health Insurance Evaluation

被引:155
作者
Imai, Kosuke [1 ]
King, Gary
Nall, Clayton [2 ]
机构
[1] Princeton Univ, Dept Polit, Princeton, NJ 08544 USA
[2] Harvard Univ, Inst Quantitat Social Sci, Dept Govt, Cambridge, MA 02138 USA
关键词
Causal inference; community intervention trials; field experiments; group-randomized trials; place-randomized trials; health policy; matched-pair design; noncompliance; power; COMMUNITY INTERVENTION TRIALS; INFERENCE; DESIGN; ADJUSTMENT; MERITS; IDENTIFICATION; NONCOMPLIANCE; EXTENSION; CONSORT; POLICY;
D O I
10.1214/08-STS274
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A basic feature of many field experiments is that investigators are only able to randomize clusters of individuals-such as households, communities, firms, medical practices, schools or classrooms-even when the individual is the unit of interest. To recoup the resulting efficiency loss, some studies pair similar clusters and randomize treatment within pairs. However, many other studies avoid pairing, in part because of claims in the literature, echoed by clinical trials standards organizations, that this matched-pair, cluster-randomization design has serious problems. We argue that all such claims are unfounded. We also prove that the estimator recommended for this design in the literature is unbiased only in situations when matching is unnecessary; its standard error is also invalid. To overcome this problem without modeling assumptions, we develop a simple design-based estimator with much improved statistical properties. We also propose a model-based approach that includes some of the benefits of our design-based estimator as well as the estimator in the literature. Our methods also address ndividual-level noncompliance, which is common in applications but not allowed for in most existing methods. We show that from the perspective of bias, efficiency, power, robustness or research costs, and in large or small samples, pairing should be used in cluster-randomized experiments whenever feasible; failing to do so is equivalent to discarding a considerable fraction of one's data. We develop these techniques in the context of a randomized evaluation we are conducting of the Mexican Universal Health Insurance Program.
引用
收藏
页码:29 / 72
页数:44
相关论文
共 81 条
[1]  
Angrist JD, 1996, J AM STAT ASSOC, V91, P444, DOI 10.2307/2291629
[2]  
[Anonymous], 1990, Statistical Science, DOI DOI 10.1214/SS/1177012031
[3]  
[Anonymous], 9389 NAT BUR EC RES
[4]  
[Anonymous], 2006, CORE ANAL RANDOMIZED
[5]  
[Anonymous], MRC CLIN TRIALS SER
[6]   Using cluster randomized field experiments to study voting behavior [J].
Arceneaux, K .
ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 2005, 601 :169-179
[7]   ESTIMATING EFFECT OF TRAINING-PROGRAMS ON EARNINGS [J].
ASHENFELTER, O .
REVIEW OF ECONOMICS AND STATISTICS, 1978, 60 (01) :47-57
[8]  
BALL S, 1972, PR722 ED TEST SERV
[9]  
Box G.E., 1978, STAT EXPT
[10]   Optimal permutation tests for the analysis of group randomized trials [J].
Braun, TM ;
Feng, ZD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1424-1432