Forest PA: Constructing a decision forest by penalizing attributes used in previous trees

被引:79
作者
Adnan, Md Nasim [1 ]
Islam, Md Zahidul [1 ]
机构
[1] Charles Sturt Univ, Sch Comp & Math, Bathurst, NSW 2795, Australia
关键词
Classification; Decision Tree; Decision Forest; Random Forest; Ensemble Accuracy; CLASSIFIER ENSEMBLES; DIVERSITY; ACCURACY;
D O I
10.1016/j.eswa.2017.08.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a new decision forest algorithm that builds a set of highly accurate decision trees by exploiting the strength of all non-class attributes available in a data set, unlike some existing algorithms that use a subset of the non-class attributes. At the same time to promote strong diversity, the proposed algorithm imposes penalties (disadvantageous weights) to those attributes that participated in the latest tree in order to generate the subsequent trees. Besides, some other weight-related concerns are taken into account so that the trees generated by the proposed algorithm remain individually accurate and retain strong diversity. In order to show the worthiness of the proposed algorithm, we carry out experiments on 20 well known data sets that are publicly available from the UCI Machine Learning Repository. The experimental results indicate that the proposed algorithm is effective in generating highly accurate and more balanced decision forests compared to other prominent decision forest algorithms. Accordingly, the proposed algorithm is expected to be very effective in the domain of expert and intelligent systems. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:389 / 403
页数:15
相关论文
共 60 条
[1]  
Adnan M. N., 2014, P 17 INT C COMP INF
[2]  
Adnan Md.N., 2015, C RES PRACTICE INFOR, V168, P89
[3]   Optimizing the number of trees in a decision forest to discover a subforest with high ensemble accuracy using a genetic algorithm [J].
Adnan, Md Nasim ;
Islam, Md Zahidul .
KNOWLEDGE-BASED SYSTEMS, 2016, 110 :86-97
[4]   Forest CERN: A New Decision Forest Building Technique [J].
Adnan, Md. Nasim ;
Islam, Md. Zahidul .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 :304-315
[5]  
Adnan MN, 2014, LECT NOTES ARTIF INT, V8933, P370, DOI 10.1007/978-3-319-14717-8_29
[6]  
Adnan MN., 2015, P ESANN, P385
[7]  
Amasyali M. F., 2014, IEEE T KNOWL DATA EN, V16, P145
[8]  
[Anonymous], 2009, THESIS
[9]  
[Anonymous], 2016, Uci machine learning repository
[10]  
[Anonymous], 2011, Pei. data mining concepts and techniques, DOI 10.1016/C2009-0-61819-5