Random walks to identify anomalous free-form spatial scan windows

被引:16
作者
Janeja, Vandana P. [1 ]
Atluri, Vijayalakshmi [2 ,3 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Informat Syst, Baltimore, MD 21250 USA
[2] Rutgers State Univ, Management Sci & Informat Syst Dept, Newark, NJ 07102 USA
[3] Rutgers State Univ, CIMIC, Newark, NJ 07102 USA
基金
美国国家科学基金会;
关键词
spatial scan statistics; random walk; spatial scan window;
D O I
10.1109/TKDE.2008.96
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Often, it is required to identify anomalous windows over a spatial region that reflect unusual rate of occurrence of a specific event of interest. A spatial scan statistic-based approach essentially considers a scan window and computes the statistic of a parameter(s) of interest, and identifies anomalous windows by moving the scan window in the region. While this approach has been successfully employed in identifying anomalous windows, earlier proposals adopting spatial scan statistic suffer from two limitations: 1) In general, the scan window should be of a regular shape (e.g., circle, rectangle, cylinder). Thus, most approaches are capable of identifying anomalous windows of fixed shapes only. However, the region of anomaly, in general, is not necessarily of a regular shape. Recent proposals to identify windows of irregular shapes identify windows much larger than the true anomalies or penalize large-sized windows. 2) These techniques take into account autocorrelation among spatial data but not spatial heterogeneity. As a result, they often result in inaccurate anomalous windows. To address these limitations, in this paper, we propose a random-walk-based Free-Form Spatial Scan Statistic (FS3). We construct a weighted Delaunay nearest neighbor (WDNN) graph to capture both spatial autocorrelation and heterogeneity. We then use random walks to identify natural free-form scan windows that are not restricted to a predefined shape. We use spatial scan statistics to identify anomalous windows and prove that they are not random but indeed are formed as a result of an anomaly. Application of FS3 on real data sets has shown that it can identify more refined anomalous windows with better likelihood ratio of it being an anomaly than those identified by earlier spatial scan statistic approaches.
引用
收藏
页码:1378 / 1392
页数:15
相关论文
共 41 条
[1]  
Anselin L., 2004, Advances in Spatial Econometrics, P1, DOI 10.1007/978-3-662-05617-2_1
[2]  
ATHAS WF, 1993, LOS ALAMOS CANC RATE
[3]  
AURENHAMMER F, 1991, COMPUT SURV, V23, P345, DOI 10.1145/116873.116880
[4]  
Barber M., 1970, RANDOM RESTRICTED WA
[5]  
Barnett V., 1984, Outliers in Statistical Data, V2nd
[6]   THE DETECTION OF CLUSTERS IN RARE DISEASES [J].
BESAG, J ;
NEWELL, J .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1991, 154 :143-155
[7]  
Breunig MM, 1999, LECT NOTES ARTIF INT, V1704, P262
[8]  
Chen YX, 2007, IEEE DATA MINING, P113, DOI 10.1109/ICDM.2007.10
[9]  
*DEP HLTH HUM SERV, 1999, INC MORT WEB BAS REP
[10]   Evaluation of spatial scan statistics for irregularly shaped clusters [J].
Duczmal, L ;
Kulldorff, M ;
Huang, L .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2006, 15 (02) :428-442