On detecting spatial outliers

被引:67
作者
Chen, Dechang [2 ]
Lu, Chang-Tien [1 ]
Kou, Yufeng [1 ]
Chen, Feng [1 ]
机构
[1] Virginia Polytech Inst & State Univ, Dept Comp Sci, Falls Church, VA 22043 USA
[2] Uniformed Serv Univ Hlth Sci, Dept Prevent Med & Biometr, Bethesda, MD 20814 USA
关键词
algorithm; outlier detection; spatial data mining;
D O I
10.1007/s10707-007-0038-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-increasing volume of spatial data has greatly challenged our ability to extract useful but implicit knowledge from them. As an important branch of spatial data mining, spatial outlier detection aims to discover the objects whose non-spatial attribute values are significantly different from the values of their spatial neighbors. These objects, called spatial outliers, may reveal important phenomena in a number of applications including traffic control, satellite image analysis, weather forecast, and medical diagnosis. Most of the existing spatial outlier detection algorithms mainly focus on identifying single attribute outliers and could potentially misclassify normal objects as outliers when their neighborhoods contain real spatial outliers with very large or small attribute values. In addition, many spatial applications contain multiple non-spatial attributes which should be processed altogether to identify outliers. To address these two issues, we formulate the spatial outlier detection problem in a general way, design two robust detection algorithms, one for single attribute and the other for multiple attributes, and analyze their computational complexities. Experiments were conducted on a real-world data set, West Nile virus data, to validate the effectiveness of the proposed algorithms.
引用
收藏
页码:455 / 475
页数:21
相关论文
共 45 条
[41]  
Wong WK, 2002, EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, P217
[42]   Bayesian Ying-Yang machine, clustering and number of clusters [J].
Xu, L .
PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) :1167-1178
[43]   On-line unsupervised outlier detection using finite mixtures with discounting learning algorithms [J].
Yamanishi, K ;
Takeuchi, JI ;
Williams, G ;
Milne, P .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (03) :275-300
[44]  
Zanero S., 2004, P 2004 ACM S APPL CO, P412, DOI DOI 10.1145/967900.967988
[45]  
Zhang T., 1996, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, V25, P103, DOI [DOI 10.1145/235968.233324, 10.1145/235968.233324]