Detecting and cleaning outliers for robust estimation of variogram models in insect count data

被引:13
作者
Park, Jung-Joon [5 ]
Shin, Key-Il [4 ]
Lee, Joon-Ho [3 ]
Lee, Sung Eun [2 ]
Lee, Woo-Kyun [1 ]
Cho, Kijong [1 ]
机构
[1] Korea Univ, Div Environm Sci & Ecol Engn, Seoul 136701, South Korea
[2] Nanotoxtech Inc, Ansan 426901, South Korea
[3] Seoul Natl Univ, Dept Agr Biotechnol, Entomol Program, Seoul 151921, South Korea
[4] Hankuk Univ Foreign Studies, Dept Stat, Yongin 449791, South Korea
[5] Korea Univ, Inst Life Sci & Nat Resources, Seoul 136701, South Korea
关键词
Variogram models; Spatial additive model; Outlier cleaner; Western flower thrips; Greenhouse whitefly; Box-Cox transformation; GEOSTATISTICS; THYSANOPTERA; RESISTANCE; THRIPIDAE; CORN;
D O I
10.1007/s11284-011-0863-y
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Outlier detection and cleaning procedures were evaluated to estimate mathematical restricted variogram models with discrete insect population count data. Because variogram modeling is significantly affected by outliers, methods to detect and clean outliers from data sets are critical for proper variogram modeling. In this study, we examined spatial data in the form of discrete measurements of insect counts on a rectangular grid. Two well-known insect pest population data were analyzed; one data set was the western flower thrips, Frankliniella occidentalis (Pergande) on greenhouse cucumbers and the other was the greenhouse whitefly, Trialeurodes vaporariorum (Westwood) on greenhouse cherry tomatoes. A spatial additive outlier model was constructed to detect outliers in both the isolated and patchy spatial distributions of outliers, and the outliers were cleaned with the neighboring median cleaner. To analyze the effect of outliers, we compared the relative nugget effects of data cleaned of outliers and data still containing outliers after transformation. In addition, the correlation coefficients between the actual and predicted values were compared using the leave-one-out cross-validation method with data cleaned of outliers and non-cleaned data after unbiased back transformation. The outlier detection and cleaning procedure improved geostatistical analysis, particularly by reducing the nugget effect, which greatly impacts the prediction variance of kriging. Consequently, the outlier detection and cleaning procedures used here improved the results of geostatistical analysis with highly skewed and extremely fluctuating data, such as insect counts.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 61 条
[1]  
[Anonymous], 1995, J Environ Sci, DOI [10.13671/j.hjkxxb.1995.03.001, DOI 10.13671/J.HJKXXB.1995.03.001]
[2]  
[Anonymous], TEMPORAL GEOGRAPHICA
[3]  
[Anonymous], 1989, Applied Geostatistics
[4]   GEOSTATISTICAL METHODS FOR DETECTION OF OUTLIERS IN GROUNDWATER QUALITY SPATIAL FIELDS [J].
BARDOSSY, A ;
KUNDZEWICZ, ZW .
JOURNAL OF HYDROLOGY, 1990, 115 (1-4) :343-359
[5]  
Barnett V., 1994, Outliers in statistical data
[6]  
Bartlett MS, 1936, J STAT SOC S, V3, P22
[7]  
Ben-Gal I, 2005, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, P131, DOI 10.1007/0-387-25465-X_7
[8]   SAMPLING INSECT POPULATIONS FOR THE PURPOSE OF IPM DECISION-MAKING [J].
BINNS, MR ;
NYROP, JP .
ANNUAL REVIEW OF ENTOMOLOGY, 1992, 37 :427-453
[9]  
Birkhoff G., 1967, Amer. Math. Soc. Colloq. Publ., V25
[10]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252