Theoretical and empirical analysis of ReliefF and RReliefF

被引:2189
作者
Robnik-Sikonja, M [1 ]
Kononenko, I [1 ]
机构
[1] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana 1001, Slovenia
关键词
attribute evaluation; feature selection; Relief algorithm; classification; regression;
D O I
10.1023/A:1025667309714
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relief algorithms are general and successful attribute estimators. They are able to detect conditional dependencies between attributes and provide a unified view on the attribute estimation in regression and classification. In addition, their quality estimates have a natural interpretation. While they have commonly been viewed as feature subset selection methods that are applied in prepossessing step before a model is learned, they have actually been used successfully in a variety of settings, e. g., to select splits or to guide constructive induction in the building phase of decision or regression tree learning, as the attribute weighting method and also in the inductive logic programming. A broad spectrum of successful uses calls for especially careful investigation of various features Relief algorithms have. In this paper we theoretically and empirically investigate and discuss how and why they work, their theoretical and practical properties, their parameters, what kind of dependencies they detect, how do they scale up to large number of examples and features, how to sample data for them, how robust are they regarding the noise, how irrelevant and redundant attributes influence their output and how different metrics influences them.
引用
收藏
页码:23 / 69
页数:47
相关论文
共 40 条
[1]  
[Anonymous], P AN WAR MIN DAT
[2]  
[Anonymous], 1995, P 14 INT JOINT C ART
[3]  
[Anonymous], KNOWLEDGE DISCOVERY
[4]  
[Anonymous], P INT S METH INT SYS
[5]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[6]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[7]  
Brodley C. E., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P73
[8]  
Cestnik B, 1987, Progress in Machine Learning, P31
[9]   Modelling the effects of environmental conditions on apparent photosynthesis of Stipa bromoides by machine learning tools [J].
Dalaka, A ;
Kompare, B ;
Robnik-Sikonja, M ;
Sgardelis, SP .
ECOLOGICAL MODELLING, 2000, 129 (2-3) :245-257
[10]  
DENG K, 1995, P 12 INT JOINT C ART, P1233