Geostatistical analysis of disease data: Visualization and propagation of spatial uncertainty in cancer mortality risk using Poisson kriging and p-field simulation

被引:39
作者
Goovaerts P. [1 ]
机构
[1] BioMedware Inc., Ann Arbor, MI
关键词
Pancreatic Cancer; Spatial Uncertainty; Kriging Variance; Spatial Outlier; Cancer Mortality Risk;
D O I
10.1186/1476-072X-5-7
中图分类号
学科分类号
摘要
Background: Smoothing methods have been developed to improve the reliability of risk cancer estimates from sparsely populated geographical entities. Filtering local details of the spatial variation of the risk leads however to the detection of larger clusters of low or high cancer risk while most spatial outliers are filtered out. Static maps of risk estimates and the associated prediction variance also fail to depict the uncertainty attached to the spatial distribution of risk values and does not allow its propagation through local cluster analysis. This paper presents a geostatistical methodology to generate multiple realizations of the spatial distribution of risk values. These maps are then fed into spatial operators, such as in local cluster analysis, allowing one to assess how risk spatial uncertainty translates into uncertainty about the location of spatial clusters and outliers. This novel approach is applied to age-adjusted breast and pancreatic cancer mortality rates recorded for white females in 295 US counties of the Northeast (1970-1994). A public-domain executable with example datasets is provided. Results: Geostatistical simulation generates risk maps that are more variable than the smooth risk map estimated by Poisson kriging and reproduce better the spatial pattern captured by the risk semivariogram model. Local cluster analysis of the set of simulated risk maps leads to a clear visualization of the lower reliability of the classification obtained for pancreatic cancer versus breast cancer: only a few counties in the large cluster of low risk detected in West Virginia and Southern Pennsylvania are significant over 90% of all simulations. On the other hand, the cluster of high breast cancer mortality in Niagara county, detected after application of Poisson kriging, appears on 60% of simulated risk maps. Sensitivity analysis shows that 500 realizations are needed to achieve a stable classification for pancreatic cancer, while convergence is reached for less than 300 realizations for breast cancer. Conclusion: The approach presented in this paper enables researchers to generate a set of simulated risk maps that are more realistic than a single map of smoothed mortality rates and allow the propagation of cancer risk uncertainty through local cluster analysis. Coupled with visualization and querying capabilities of geographical information systems, animated display of realizations can highlight areas that depart consistently from the general behavior observed across the region, guiding further investigation and control activities. © 2006 Goovaerts; licensee BioMed Central Ltd.
引用
收藏
页数:26
相关论文
共 38 条
[1]  
Mungiole M., Pickle L.W., Hansen Simonson K., Application of a weighted head-banging algorithm to mortality data maps, Statistics in Medicine, 18, pp. 3201-3209, (1999)
[2]  
Pickle L.W., Spatial analysis of disease, Biostatistical Applications in Cancer Research Volume Chapter 7, pp. 113-150, (2002)
[3]  
Waller L.A., Gotway C.A., Applied Spatial Statistics for Public Health Data, (2004)
[4]  
Christensen O.F., Waagepetersen R., Bayesian prediction of spatial count data using generalized linear mixed models, Biometrics, 58, pp. 280-286, (2002)
[5]  
Best N., Richardson S., Thomson A., A comparison of Bayesian spatial models for disease mapping, Statistical Methods in Medical Research, 14, pp. 35-59, (2005)
[6]  
Clayton D.G., Kaldor J., Empirical Bayes estimates of age-standardized relative risks for use in disease mapping, Biometrics, 43, pp. 671-681, (1987)
[7]  
Marshall R.J., Mapping disease and mortality rates using empirical Bayes estimators, Applied Statistics, 40, 2, pp. 283-294, (1991)
[8]  
Goovaerts P., Geostatistical analysis of disease data: Estimation of cancer mortality risk from empirical frequencies using Poisson kriging, International Journal of Health Geographics, 4, (2005)
[9]  
Goovaerts P., Jacquez G.M., Accounting for regional background and population size in the detection of spatial clusters and outliers using geostatistical filtering and spatial neutral models: The case of lung cancer in Long Island, New York, International Journal of Health Geographics, 3, (2004)
[10]  
Clayton D.G., Bernardinelli L., Bayesian methods for mapping disease risk, Geographical and Environmental Epidemiology: Methods for Small-Area Studies, pp. 265-276, (1992)