A SPLIT QUESTIONNAIRE SURVEY DESIGN

被引:111
作者
RAGHUNATHAN, TE [1 ]
GRIZZLE, JE [1 ]
机构
[1] FRED HUTCHINSON CANC RES CTR,CANC PREVENT RES PROGRAM,SEATTLE,WA 98104
关键词
GIBBS SAMPLING; MULTIPLE IMPUTATION; NONRESPONSE; RESPONDER BURDEN;
D O I
10.2307/2291129
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article develops a survey design where the questionnaire is split into components and individuals are administered the varying subsets of the components. A multiple imputation method for analyzing data from this design is developed, in which the imputations are created by random draws from the posterior predictive distribution of the missing parts, given the observed parts by using Gibbs sampling under a general location scale model. Results from two simulation studies that investigate the properties of the inferences using this design are reported. In the first study several random split questionnaire designs are imposed on the complete data from an existing survey collected using a long questionnaire, and the corresponding data elements are extracted to form split data sets. Inferences obtained using the complete data and the split data are then compared. This comparison suggests that little is lost, at least in the example considered, by administering only parts of the questionnaire to each sampled individual. The second simulation study reports on the investigation of the efficiency of the split questionnaire design and the robustness of the estimates to the distributional assumptions used to create imputations. In this study several complete and split data sets were generated under a variety of distributional assumptions, and the imputations for the split data sets were created assuming the normality of the distributions. The sampling properties of the point and interval estimates of the regression coefficient in a particular logistic regression model using both the complete and split data sets were compared. This comparison suggests that the loss in efficiency of the split questionnaire design decreases as the correlation among the variables that are within different parts increases. The proposed multiple imputation method seems to be sensitive to the skewness and relatively insensitive to the kurtosis, contrary to the assumed normality of the distribution for the observables.
引用
收藏
页码:54 / 63
页数:10
相关论文
共 30 条
[1]   SOLVING THE QUANDARY BETWEEN QUESTIONNAIRE LENGTH AND RESPONSE RATE IN EDUCATIONAL-RESEARCH [J].
ADAMS, LL ;
GALE, D .
RESEARCH IN HIGHER EDUCATION, 1982, 17 (03) :231-240
[2]  
Bishop Y, 1975, DISCRETE MULTIVARIAT
[3]   EFFECTS OF QUESTIONNAIRE LENGTH, RESPONDENT-FRIENDLY DESIGN, AND A DIFFICULT QUESTION ON RESPONSE RATES FOR OCCUPANT-ADDRESSED CENSUS MAIL SURVEYS [J].
DILLMAN, DA ;
SINCLAIR, MD ;
CLARK, JR .
PUBLIC OPINION QUARTERLY, 1993, 57 (03) :289-304
[4]  
DIXON WJ, 1988, BMDP STATISTICAL SOF, V2
[5]  
GELFAND A. E., 1992, BAYESIAN STATISTICS, V4, P147
[6]   SAMPLING-BASED APPROACHES TO CALCULATING MARGINAL DENSITIES [J].
GELFAND, AE ;
SMITH, AFM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (410) :398-409
[7]  
Gelman A, 1992, STAT SCI, V7, P457, DOI [DOI 10.1214/SS/1177011136, 10.1214/ss/1177011136]
[8]  
GILKS WR, 1992, J R STAT SOC C-APPL, V41, P337
[9]  
GRIZZLE JE, 1969, BIOMETRICS, V29, P489
[10]  
HERZOG AR, 1980, PUBLIC OPIN QUART, V45, P549