Lexicographic multi-objective evolutionary induction of decision trees

被引:25
作者
Basgalupp, Marcia P. [1 ]
de Carvalho, Andre C. P. L. F. [1 ]
Barros, Rodrigo C. [2 ]
Ruiz, Duncan D. [2 ]
Freitas, Alex A. [3 ]
机构
[1] Univ Sao Paulo, Inst Math Sci & Comp, BR-668 Sao Carlos, SP, Brazil
[2] Pontifical Catholic Univ RS, Fac Informat, BR-90619900 Porto Alegre, RS, Brazil
[3] Univ Kent, Comp Lab, Canterbury CT2 7NZ, Kent, England
基金
巴西圣保罗研究基金会;
关键词
lexicographic multi-objective genetic algorithms; decision tree induction; classification tasks; data mining;
D O I
10.1504/IJBIC.2009.022779
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Among the several tasks that evolutionary algorithms have successfully employed, the induction of classification rules and decision trees has been shown to be a relevant approach for several application domains. Decision tree induction algorithms represent one of the most popular techniques for dealing with classification problems. However, conventionally used decision trees induction algorithms present limitations due to the strategy they usually implement: recursive top-down data partitioning through a greedy split evaluation. The main problem with this strategy is quality loss during the partitioning process, which can lead to statistically insignificant rules. In this paper, we propose a new GA-based algorithm for decision tree induction. The proposed algorithm aims to prevent the greedy strategy and to avoid converging to local optima. For such, it is based on a lexicographic multi-objective approach. In order to evaluate the proposed algorithm, it is compared with a well-known and frequently used decision tree induction algorithm using different public datasets. According to the experimental results, the proposed algorithm is able to avoid the previously described problems, reporting accuracy gains. Even more important, the proposed algorithm induced models with a significantly reduction in the complexity considering tree sizes.
引用
收藏
页码:105 / 117
页数:13
相关论文
共 36 条
[1]  
[Anonymous], 1993, C4.5: Programs for machine learning
[2]  
[Anonymous], 2001, An Introduction to Genetic Algorithms. Complex Adaptive Systems
[3]  
[Anonymous], 2006, Evolutionary Algorithms for Solving Multi-Objective Problems (Genetic and Evolutionary Computation)
[4]  
Bandar Z., 1999, ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378), P429, DOI 10.1109/ICONIP.1999.845633
[5]  
Bot MCJ, 2000, LECT NOTES COMPUT SC, V1802, P247
[6]  
Breiman L., 1984, BIOMETRICS, V40, P874, DOI 10.1201/9781315139470
[7]  
Buntine W., 1993, ARTIF, P182, DOI [10.1007/978-1-4899-4537-2_15, DOI 10.1007/978-1-4899-4537-2_15]
[8]  
Cantu-Paz E., 2000, Proc. Genetic Evolutionary Computation Conf, San Francisco, P1053
[9]   A hybrid decision tree/genetic algorithm method for data mining [J].
Carvalho, DR ;
Freitas, AA .
INFORMATION SCIENCES, 2004, 163 (1-3) :13-35
[10]   The role of occam's razor in knowledge discovery [J].
Domingos, P .
DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 3 (04) :409-425