SLOM: a new measure for local spatial outliers

被引:177
作者
Chawla, S [1 ]
Sun, P [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
关键词
spatial local outlier; spatial neighbourhood; oscillating parameter; R-trees index; complexity;
D O I
10.1007/s10115-005-0200-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
We propose a measure, spatial local outlier measure (SLOM), which captures the local behaviour of datum in their spatial neighbourhood. With the help of SLOM, we are able to discern local spatial outliers that are usually missed by global techniques, like "three standard deviations away from the mean". Furthermore, the measure takes into account the local stability around a data point and suppresses the reporting of outliers in highly unstable areas, where data are too heterogeneous and the notion of outliers is not meaningful. We prove several properties of SLOM and report experiments on synthetic and real data sets that show that our approach is novel and scalable to large datasets.
引用
收藏
页码:412 / 429
页数:18
相关论文
共 15 条
[1]
AGGARWAL CC, 2001, P 2000 ACM SIGMOD IN
[2]
ANGIULLI F, 2002, P 6 EUR C PRINC PRAC
[3]
[Anonymous], 2003, P 9 ACM SIGKDD INT C
[4]
[Anonymous], 2003, TECHNIQUES
[5]
LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[6]
Hawkins D.M, 1980, IDENTIFICATION OUTLI, V11, DOI [10.1007/978-94-015-3994-4, DOI 10.1007/978-94-015-3994-4]
[7]
Knorr E. M., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P392
[8]
Lu CT, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P597
[9]
Detecting spatial Outliers with multiple attributes [J].
Lu, CT ;
Chen, DC ;
Kou, YF .
15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, :122-128
[10]
McPhaden M. J., 2002, Encyclopedia Global Environ. Change, V1, P353