Investigation of the random forest framework for classification of hyperspectral data

被引:980
作者
Ham, J
Chen, YC
Crawford, MM
Ghosh, J
机构
[1] Univ Texas, Ctr Space Res, Austin, TX 78759 USA
[2] Univ Texas, Dept Elect & Comp Engn, Austin, TX 78712 USA
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2005年 / 43卷 / 03期
关键词
binary hierarchical classifier (BHC); classification; classification and regression trees (CART); Hyperion; hyperspectral; Okavango Delta; random forests; random subspace feature selection; RANDOM SUBSPACE METHOD; COVER;
D O I
10.1109/TGRS.2004.842481
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Statistical classification of hyperspectral data is challenging because the inputs are high in dimension and represent multiple classes that are sometimes quite mixed, while the amount and quality of ground truth in the form of labeled data is typically limited. The resulting classifiers are often unstable and have poor generalization. This paper investigates two approaches based on the concept of random forests of classifiers implemented within a binary hierarchical multiclassifier system, with the goal of achieving improved generalization of the classifier in analysis of hyperspectral data, particularly when the quantity of training data is limited. A new classifier is proposed that incorporates bagging of training samples and adaptive random subspace feature selection within a binary hierarchical classifier (BHC), such that the number of features that is selected at each node of the tree is dependent on the quantity of associated training data. Results are compared to a random forest implementation based on the framework of classification and regression trees. For both methods, classification results obtained from experiments on data acquired by the National Aeronautics and Space Administration (NASA) Airborne Visible/Infrared Imaging Spectrometer instrument over the Kennedy Space Center, Florida, and by Hyperion on the NASA Earth Observing I satellite over the Okavango Delta of Botswana are superior to those from the original best basis BHC algorithm and a random subspace extension of the BHC.
引用
收藏
页码:492 / 501
页数:10
相关论文
共 26 条
[1]   Spectral unmixing of vegetation, soil and dry carbon cover in arid regions: comparing multispectral and hyperspectral observations [J].
Asner, GP ;
Heidebrecht, KB .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2002, 23 (19) :3939-3958
[2]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[3]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[4]  
BOARDMAN JW, 1994, INT GEOSCI REMOTE SE, P2369, DOI 10.1109/IGARSS.1994.399740
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
CRAWFORD M, 2003, 2003 TYRRH INT WORKS
[8]  
Dietterich T. G., 1995, Journal of Artificial Intelligence Research, V2, P263
[9]   Round robin classification [J].
Fürnkranz, J .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (04) :721-747
[10]  
Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601