Nearest-Neighbor Guided Evaluation of Data Reliability and Its Applications

被引:54
作者
Boongoen, Tossapon [1 ]
Shen, Qiang [1 ]
机构
[1] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2010年 / 40卷 / 06期
基金
英国工程与自然科学研究理事会;
关键词
Alias detection; data reliability; nearest neighbor; ordered weighted averaging (OWA) aggregation; unsupervised feature selection; weight determination; FEATURE-SELECTION; DIMENSIONALITY REDUCTION; CONSENSUS; ALGORITHMS; SEARCH;
D O I
10.1109/TSMCB.2010.2043357
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The intuition of data reliability has recently been incorporated into the main stream of research on ordered weighted averaging (OWA) operators. Instead of relying on human-guided variables, the aggregation behavior is determined in accordance with the underlying characteristics of the data being aggregated. Data-oriented operators such as the dependent OWA (DOWA) utilize centralized data structures to generate reliable weights, however. Despite their simplicity, the approach taken by these operators neglects entirely any local data structure that represents a strong agreement or consensus. To address this issue, the cluster-based OWA (Clus-DOWA) operator has been proposed. It employs a cluster-based reliability measure that is effective to differentiate the accountability of different input arguments. Yet, its actual application is constrained by the high computational requirement. This paper presents a more efficient nearest-neighbor-based reliability assessment for which an expensive clustering process is not required. The proposed measure can be perceived as a stress function, from which the OWA weights and associated decision-support explanations can be generated. To illustrate the potential of this measure, it is applied to both the problem of information aggregation for alias detection and the problem of unsupervised feature selection (in which unreliable features are excluded from an actual learning process). Experimental results demonstrate that these techniques usually outperform their conventional state-of-the-art counterparts.
引用
收藏
页码:1622 / 1633
页数:12
相关论文
共 69 条
[61]   Centered OWA operators [J].
Yager, R. R. .
SOFT COMPUTING, 2007, 11 (07) :631-639
[62]   Using stress functions to obtain OWA operators [J].
Yager, Ronald R. .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2007, 15 (06) :1122-1129
[63]  
Yager RR, 1996, INT J INTELL SYST, V11, P49, DOI 10.1002/(SICI)1098-111X(199601)11:1<49::AID-INT3>3.3.CO
[64]  
2-L
[65]   ON ORDERED WEIGHTED AVERAGING AGGREGATION OPERATORS IN MULTICRITERIA DECISION-MAKING [J].
YAGER, RR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1988, 18 (01) :183-190
[66]   FAMILIES OF OWA OPERATORS [J].
YAGER, RR .
FUZZY SETS AND SYSTEMS, 1993, 59 (02) :125-148
[67]  
ZADEH LA, 1975, INFORM SCIENCES, V8, P199, DOI [10.1016/0020-0255(75)90036-5, 10.1016/0020-0255(75)90046-8]
[68]   Constraint Score: A new filter method for feature selection with pairwise constraints [J].
Zhang, Daoqiang ;
Chen, Songcan ;
Zhou, Zhi-Hua .
PATTERN RECOGNITION, 2008, 41 (05) :1440-1451
[69]   Locality sensitive semi-supervised feature selection [J].
Zhao, Jidong ;
Lu, Ke ;
He, Xiaofei .
NEUROCOMPUTING, 2008, 71 (10-12) :1842-1849