A fuzzy vector valued KNN-algorithm for automatic outlier detection

被引:42
作者
Ostermark, Ralf [1 ]
机构
[1] Abo Akad Univ, Turku 20900, Finland
关键词
K nearest neighbors; Fuzzy distance; Outlier detection; Genetic search; Geno-mathematical programming; LINEAR-REGRESSION; MODEL;
D O I
10.1016/j.asoc.2009.03.009
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
The K nearest neighbors approach is a viable technique in time series analysis when dealing with ill-conditioned and possibly chaotic processes. Such problems are frequently encountered in, e. g., finance and production economics. More often than not, the observed processes are distorted by nonnormal disturbances, incomplete measurements, etc. that undermine the identification, estimation and performance of multivariate techniques. If outliers can be duly recognized, many crisp statistical techniques may perform adequately as such. Geno-mathematical programming provides a connection between statistical time series theory and fuzzy regression models that may be utilized e. g., in the detection of outliers. In this paper we propose a fuzzy distance measure for detecting outliers via geno-mathematical parametrization. Fuzzy KNN is connected as a linkable library to the genetic hybrid algorithm (GHA) of the author, in order to facilitate the determination of the LR-type fuzzy number for automatic outlier detection in time series data. We demonstrate that GHA[Fuzzy KNN] provides a platform for automatically detecting outliers in both simulated and real world data. (C) 2009 Elsevier B. V. All rights reserved.
引用
收藏
页码:1263 / 1272
页数:10
相关论文
共 43 条
[1]
Abdi Herve., 2007, ENCY MEASUREMENT STA, P1
[2]
NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[3]
[Anonymous], 1936, P NATL I SCI ENCES, DOI [DOI 10.1007/S13171-019-00164-5, 10.1007/s13171-019-00164-5]
[4]
[Anonymous], 1991, Introduction to Fuzzy Arithmetic
[5]
Aoki M., 1990, State space modeling of time series, V2nd
[6]
BROCKWELL P, 2002, INTRO TIME SERIES FO, V36
[7]
Burago D., 2001, CRM P LECT NOTES
[8]
A theoretical development on a fuzzy distance measure for fuzzy numbers [J].
Chakraborty, C ;
Chakraborty, D .
MATHEMATICAL AND COMPUTER MODELLING, 2006, 43 (3-4) :254-261
[9]
CHANG PT, 1994, PROCEEDINGS OF THE THIRD IEEE CONFERENCE ON FUZZY SYSTEMS - IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, VOLS I-III, P1365, DOI 10.1109/FUZZY.1994.343613