On detecting spatial outliers

被引:67
作者
Chen, Dechang [2 ]
Lu, Chang-Tien [1 ]
Kou, Yufeng [1 ]
Chen, Feng [1 ]
机构
[1] Virginia Polytech Inst & State Univ, Dept Comp Sci, Falls Church, VA 22043 USA
[2] Uniformed Serv Univ Hlth Sci, Dept Prevent Med & Biometr, Bethesda, MD 20814 USA
关键词
algorithm; outlier detection; spatial data mining;
D O I
10.1007/s10707-007-0038-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-increasing volume of spatial data has greatly challenged our ability to extract useful but implicit knowledge from them. As an important branch of spatial data mining, spatial outlier detection aims to discover the objects whose non-spatial attribute values are significantly different from the values of their spatial neighbors. These objects, called spatial outliers, may reveal important phenomena in a number of applications including traffic control, satellite image analysis, weather forecast, and medical diagnosis. Most of the existing spatial outlier detection algorithms mainly focus on identifying single attribute outliers and could potentially misclassify normal objects as outliers when their neighborhoods contain real spatial outliers with very large or small attribute values. In addition, many spatial applications contain multiple non-spatial attributes which should be processed altogether to identify outliers. To address these two issues, we formulate the spatial outlier detection problem in a general way, design two robust detection algorithms, one for single attribute and the other for multiple attributes, and analyze their computational complexities. Experiments were conducted on a real-world data set, West Nile virus data, to validate the effectiveness of the proposed algorithms.
引用
收藏
页码:455 / 475
页数:21
相关论文
共 45 条
[1]  
Aggarwal C. C., 2001, SIGMOD Record, V30, P37, DOI 10.1145/376284.375668
[2]  
Aggarwal CC, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P61, DOI 10.1145/304181.304188
[3]  
Aggarwal CC, 2001, SIGMOD RECORD, V30, P13, DOI 10.1145/373626.373638
[4]  
[Anonymous], P ACM SIGMOD INT C M
[5]   LOCAL INDICATORS OF SPATIAL ASSOCIATION - LISA [J].
ANSELIN, L .
GEOGRAPHICAL ANALYSIS, 1995, 27 (02) :93-115
[6]  
Barnett V., 1984, Outliers in Statistical Data, V2nd
[7]  
Blum M., 1973, Journal of Computer and System Sciences, V7, P448, DOI 10.1016/S0022-0000(73)80033-9
[8]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[9]   The ordering of spatial data and the detection of multiple outliers [J].
Cerioli, A ;
Riani, M .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1999, 8 (02) :239-258
[10]   Distributed data mining in credit card fraud detection [J].
Chan, PK ;
Fan, W ;
Prodromidis, AL ;
Stolfo, SJ .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (06) :67-74