Measuring similarity between geospatial lifelines in studies of environmental health

被引:10
作者
Sinha G. [1 ]
Mark D.M. [1 ]
机构
[1] Department of Geography, Natl. Ctr. for Geogr. Info./Analysis, University at Buffalo, Buffalo
关键词
Geospatial lifelines; Mobile objects; Spatio-temporal clustering;
D O I
10.1007/s10109-005-0153-8
中图分类号
学科分类号
摘要
Many epidemiological studies involve analysis of clusters of diseases to infer locations of environmental hazards that could be responsible for the disease. This approach is however only suitable for sedentary populations or diseases with small latency periods. For migratory populations and diseases with long latency periods, people may change their residential location between time of exposure and onset of ill health. For such situations, clusters are diffused and diluted by in- and out-migration and may become very difficult to detect. One way to address the problem of diffused clusters is to include in analyses not only current residential locations, but all past locations at which cases might have been exposed to environmental hazardous. In this paper, we assume that a person's residential history provides such information and represent it through a discrete geospatial lifeline data model. Clusters of similar geospatial lifelines represent individuals who have similar residential histories - and therefore represent people who are more likely to have had similar environmental exposure histories. We therefore introduce a lifeline distance (dissimilarity) measure to detect clusters of cases, providing a basis for revealing possible regions in space-time where environmental hazards might have existed in the past. The ability of the measure to distinguish cases from controls is tested using two sets of synthetically generated cases and controls. Results indicate that the measure is able to consistently distinguish between populations of cases and controls with statistically significant results. The lifeline distance measure consistently outperforms another measure which uses only the distance between subjects' residences at time of diagnosis. However, the advantages of using the entire residential history are only partly realized, since the ability to distinguish between cases and controls is only moderately better for the lifeline distance function. Future work is needed to investigate modifications to the inter-lifeline distance measure in order to enhance the potential of this approach to detect locations of environmental hazards over the lifespan. © Springer-Verlag Berlin Heidelberg 2005.
引用
收藏
页码:115 / 136
页数:21
相关论文
共 31 条
[1]
Besag J., Newell J., The Detection of Clusters in Rare Diseases, Journal of the Royal Statistical Society, A154, PART 1, pp. 143-155, (1991)
[2]
Bonner M.B., Han D., Nie J., Rogerson P., Vena J., Freudenheim J.L., Positional Accuracy of Geocoded Addresses in Epidemiologic Research, Epidemiology, 14, pp. 408-412, (2003)
[3]
Das G., Mannila H., Finding Similar Time Series, Proceedings of the Conference on Principles of Knowledge Discovery and Data Mining, (1997)
[4]
Duda R.O., Hart P.E., Stork D.G., Pattern Classification, (2001)
[5]
Gibson J.J., The Ecological Approach to Visual Perception, (1979)
[6]
Hagerstrand T., What About People in Regional Science?, Papers of the Regional Science Association, 24, pp. 1-21, (1970)
[7]
Hagerstrand T., The Time-Space Trajectory Model and its Use in the Evaluation of Systems of Transportation, International Conference on Transportation Research, (1976)
[8]
Han D., Geographical epidemiology of breast cancer in Western New York: Exploring spatiotemporal clustering in GIS, (2002)
[9]
Hornsby K., Egenhofer M., Modeling Moving Objects Over Multiple Granularities, Annals of Mathematics and Artificial Intelligence, 36, pp. 177-194, (2002)
[10]
Keogh E., A Fast and Robust Method for Pattern Matching in Time Series Databases, 9th International Conference on Tools With Artificial Intelligence (TAI '97), (1997)