A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability

被引:602
作者
Garcia, S. [1 ]
Fernandez, A. [2 ]
Luengo, J. [2 ]
Herrera, F. [2 ]
机构
[1] Univ Jaen, Dept Comp Sci, Jaen 23071, Spain
[2] Univ Granada, Dept Comp Sci & Artificial Intelligence, E-18071 Granada, Spain
关键词
Genetics-based machine learning; Genetic algorithms; Statistical tests; Non-parametric tests; Cohen's kappa; Interpretability; Classification; COEVOLUTIONARY ALGORITHM; CLASSIFIER SYSTEMS; COMPLEXITY; INTERVALS; ROC;
D O I
10.1007/s00500-008-0392-y
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
The experimental analysis on the performance of a proposed method is a crucial and necessary task to carry out in a research. This paper is focused on the statistical analysis of the results in the field of genetics-based machine Learning. It presents a study involving a set of techniques which can be used for doing a rigorous comparison among algorithms, in terms of obtaining successful classification models. Two accuracy measures for multi-class problems have been employed: classification rate and Cohen's kappa. Furthermore, two interpretability measures have been employed: size of the rule set and number of antecedents. We have studied whether the samples of results obtained by genetics-based classifiers, using the performance measures cited above, check the necessary conditions for being analysed by means of parametrical tests. The results obtained state that the fulfillment of these conditions are problem-dependent and indefinite, which supports the use of non-parametric statistics in the experimental analysis. In addition, non-parametric tests can be satisfactorily employed for comparing generic classifiers over various data-sets considering any performance measure. According to these facts, we propose the use of the most powerful non-parametric statistical tests to carry out multiple comparisons. However, the statistical analysis conducted on interpretability must be carefully considered.
引用
收藏
页码:959 / 977
页数:19
相关论文
共 47 条
[1]
AGUILARRUIZ JS, 2000, IEEE T EVOLUT COMPUT, V11, P466
[2]
KEEL: a software tool to assess evolutionary algorithms for data mining problems [J].
Alcala-Fdez, J. ;
Sanchez, L. ;
Garcia, S. ;
del Jesus, M. J. ;
Ventura, S. ;
Garrell, J. M. ;
Otero, J. ;
Romero, C. ;
Bacardit, J. ;
Rivas, V. M. ;
Fernandez, J. C. ;
Herrera, F. .
SOFT COMPUTING, 2009, 13 (03) :307-318
[3]
ALPAYDIN E, 2004, INTRO MACHINE LEARNI, V452
[4]
NOW G-Net: Learning classification programs on networks of workstations [J].
Anglano, C ;
Botta, M .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2002, 6 (05) :463-480
[5]
Asuncion Arthur, 2007, Uci machine learning repository
[6]
Bacardit J, 2004, LECT NOTES COMPUT SC, V3103, P726
[7]
Bacardit J, 2003, LECT NOTES COMPUT SC, V2724, P1818
[8]
BACARDIT J, 2007, LNCS, V4399, P61
[9]
Bacardit J., 2004, PITTSBURGH GENETIC B
[10]
Strategies for learning in class imbalance problems [J].
Barandela, R ;
Sánchez, JS ;
García, V ;
Rangel, E .
PATTERN RECOGNITION, 2003, 36 (03) :849-851