THE EFFECTS OF MODEL SELECTION ON CONFIDENCE-INTERVALS FOR THE SIZE OF A CLOSED POPULATION

被引:58
作者
REGAL, RR
HOOK, EB
机构
[1] Department of Mathematics and Statistics, Univerisity of Minnesota-Duluth, Duluth, Minnesota
[2] School of Public Health, University of California, Berkeley, California
关键词
D O I
10.1002/sim.4780100506
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One encounters in the literature estimates of some rates of genetic and congenital disorders based on log-linear methods to model possible interactions among sources. Often the analyst chooses the simplest model consistent with the data for estimation of the size of a closed population and calculates confidence intervals on the assumption that this simple model is correct. However, despite an apparent excellent fit of the data to such a model, we note here that the resulting confidence intervals may well be misleading in that they can fail to provide an adequate coverage probability. We illustrate this with a simulation for a hypothetical population based on data reported in the literature from three sources. The simulated nominal 95 per cent confidence intervals contained the modelled population size only 30 per cent of the time. Only if external considerations justify the assumption of plausible interactions of sources would use of the simpler model's interval be justified.
引用
收藏
页码:717 / 721
页数:5
相关论文
共 8 条
[1]  
Wittes J.T., (1970)
[2]  
Wittes J.T., Colton T., Sidel V.W., Capture‐recapture methods for assessing the completeness of case ascertainment when using multiple information sources, Journal of Chronic Diseases, 27, pp. 25-36, (1974)
[3]  
Bishop Y.M.M., Fienberg S.E., Holland P.W., Discrete Multivariate Analysis: Theory and Practice, (1975)
[4]  
Hook E.B., Albright S.G., Cross P.K., Use of Bernoulli census and log‐linear methods for estimating the prevalence of spina bifida in live births and the completeness of vital records in New York State, American Journal of Epidemiology, 112, pp. 750-758, (1980)
[5]  
Pickands J., Raghavachari M., Exact and asymptotic inference for the size of a population, Biometrika, 74, pp. 355-363, (1987)
[6]  
Regal R.R., Hook E.B., Goodness‐of‐fit based confidence intervals for estimates of the size of a closed population, Statistics in Medicine, 3, pp. 287-291, (1984)
[7]  
McCullagh P., Nelder J.A., Generalized Linear Models, (1983)
[8]  
Hurvich C.M., Tsai C., The impact of model selection on inference in linear regression, The American Statistician, 44, pp. 214-217, (1990)