HYPOTHESIS-TESTING WITH COMPLEX SURVEY DATA - THE USE OF CLASSICAL QUADRATIC TEST STATISTICS WITH PARTICULAR REFERENCE TO REGRESSION PROBLEMS

被引:18
作者
GRAUBARD, BI [1 ]
KORN, EL [1 ]
机构
[1] NCI,BIOMETR RES BRANCH,BETHESDA,MD 20892
关键词
BALANCED HALF-SAMPLE REPEATED REPLICATION; COMPLEX SURVEY SAMPLE DATA; FAY JACKNIFE CHI-SQUARED TEST; MULTIPLE LINEAR REGRESSION; RAO-SCOTT TESTS; WALD TEST;
D O I
10.2307/2290345
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Sample surveys often have complex sample designs with multistage cluster sampling, stratification, and differential selection probabilities. This article is concerned with testing the null hypothesis H-0: theta = 0, where the p-dimensional parameter theta = g(mu) and mu is a q-dimensional vector of means. The asymptotic framework that consists of a sequence of increasing finite populations is used to define mu as the limit of finite population means. As part of the inference, we use replicated estimates of variances that take into account the complex sample design. The Wald statistic can be used to test H-0. But inference for theta based on the Wald statistic can have low power. Thus an alternative to using a Wald test is pursued in this article. First, define a classical quadratic test statistic that would be used if one had a simple random sample of the population. Second, treating this quadratic form as a population parameter, use design-based methods to estimate it from the observed survey data. Last, use a replication method to approximate the distribution of this estimated quadratic form to perform the hypothesis test. Specific applications of this general approach have been used previously in contingency table analysis. For small numbers of sampled first-stage clusters and large p, modified versions of the Fay procedure are proposed. Simulations show that these modified procedures maintain nominal levels better than the original Fay and the Rao-Scott procedures for testing a vector of means and a vector of regression coefficients. An application is given for testing whether design-based regression coefficients differ from ordinary least squares regression coefficients.
引用
收藏
页码:629 / 641
页数:13
相关论文
共 23 条
[1]  
BEAN JA, 1975, VITAL HLTH STATIS 65, V2
[2]  
Cochran WG., 1963, SAMPLING TECHNIQUES, V2nd ed
[3]   USING SAMPLE SURVEY WEIGHTS IN MULTIPLE-REGRESSION ANALYSES OF STRATIFIED SAMPLES [J].
DUMOUCHEL, WH ;
DUNCAN, GJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (383) :535-543
[4]  
Efron B, 1982, JACKKNIFE BOOTSTRAP, DOI 10.1137/1.9781611970319
[6]  
Frankel M. R., 1971, INFERENCE SURVEY SAM
[7]  
Fuller W.A., 1984, SURV METHODOL, V10, P97
[8]  
GRAUBARD BI, 1991, THESSI U MARYLAND
[9]  
JOHNSON NL, 1970, DISTRIBUTIONS STATIS, V2
[10]   STRATEGIES IN MULTIVARIATE-ANALYSIS OF DATA FROM COMPLEX SURVEYS [J].
KOCH, GG ;
FREEMAN, DH ;
FREEMAN, JL .
INTERNATIONAL STATISTICAL REVIEW, 1975, 43 (01) :59-78