ANALYSIS OF LARGE HEALTH SURVEYS - ACCOUNTING FOR THE SAMPLING DESIGN

被引:90
作者
KORN, EL
GRAUBARD, BI
机构
来源
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY | 1995年 / 158卷
关键词
CAUSALITY; CLUSTERING; DESIGN-BASED INFERENCE; DESIGN EFFECTS; MODEL-BASED INFERENCE; SAMPLE WEIGHTS;
D O I
10.2307/2983292
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Large scale health surveys offer an opportunity to study associations between risk factors and outcomes in a population-based setting. Their complicated multistage sampling designs with differential probabilities of sampling individuals can make their analysis unstraightforward. Classical 'design-based' methods that yield approximately unbiased estimators of associations and standard errors can be highly inefficient. Model-based methods require assumptions which, if wrong, can lead to biased estimators of associations and standard errors. This paper examines the implications of utilizing the sample clustering and sample weights in the analysis of survey data. The approach is to estimate the inefficiency of using these aspects of the sampling design in a design-based analysis when actually it was unnecessary to do so. If the inefficiency is small, then that aspect of the design is used in a design-based fashion. Otherwise, additional modelling assumptions are incorporated into the analysis. By focusing attention on risk factor-outcome associations in large health surveys, specific recommendations for practitioners are given. The issues are demonstrated with real survey data including two controversial analyses previously published in medical references.
引用
收藏
页码:263 / 295
页数:33
相关论文
共 63 条