Forest CERN: A New Decision Forest Building Technique

被引:20
作者
Adnan, Md. Nasim [1 ]
Islam, Md. Zahidul [1 ]
机构
[1] Charles Sturt Univ, Sch Comp & Math, Bathurst, NSW 2795, Australia
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I | 2016年 / 9651卷
关键词
Decision tree; Decision forest; Ensemble accuracy;
D O I
10.1007/978-3-319-31753-3_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Persistent efforts are going on to propose more accurate decision forest building techniques. In this paper, we propose a new decision forest building technique called "Forest by Continuously Excluding Root Node (Forest CERN)". The key feature of the proposed technique is that it strives to exclude attributes that participated in the root nodes of previous trees by imposing penalties on them to obstruct them appear in some subsequent trees. Penalties are gradually lifted in such a manner that those attributes can reappear after a while. Other than that, our technique uses bootstrap samples to generate predefined number of trees. The target of the proposed algorithm is to maximize tree diversity without impeding individual tree accuracy. We present an elaborate experimental results involving fifteen widely used data sets from the UCI Machine Learning Repository. The experimental results indicate the effectiveness of the proposed technique in most of the cases.
引用
收藏
页码:304 / 315
页数:12
相关论文
共 22 条
[11]   Extremely randomized trees [J].
Geurts, P ;
Ernst, D ;
Wehenkel, L .
MACHINE LEARNING, 2006, 63 (01) :3-42
[12]  
Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601
[13]  
Hu H, 2006, WISB '06, P35
[14]  
Islam Z, 2011, P 9 AUSTR DAT MIN C
[15]   CAIM discretization algorithm [J].
Kurgan, LA ;
Cios, KJ .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (02) :145-153
[16]  
Li JY, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P585
[17]   Out-of-bag estimation of the optimal sample size in bagging [J].
Martinez-Munoz, Gonzalo ;
Suarez, Alberto .
PATTERN RECOGNITION, 2010, 43 (01) :143-152
[18]  
Polikar R., 2006, IEEE Circuits and Systems Magazine, V6, P21, DOI 10.1109/MCAS.2006.1688199
[19]  
Quinlan J.R., 1993, C4 5 PROGRAMS MACHIN, V1
[20]   Rotation forest:: A new classifier ensemble method [J].
Rodriguez, Juan J. ;
Kuncheva, Ludmila I. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (10) :1619-1630