Adaptive feature spaces for land cover classification with limited ground truth data

被引:18
作者
Morgan, JT [1 ]
Ham, J
Crawford, MM
机构
[1] Univ Texas, Ctr Space Res, Austin, TX 78712 USA
[2] Univ Texas, Dept Elect & Comp Engn, Austin, TX 78712 USA
基金
美国国家航空航天局; 美国国家科学基金会;
关键词
multiclass problems; multiple classifier systems; hierarchical classifiers; error correcting output codes; small sample size problem; remote sensing;
D O I
10.1142/S0218001404003411
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification of land cover based on hyperspectral data is very challenging because typically tens of classes with uneven priors are involved, the inputs are high dimensional, and there is often scarcity of labeled data. Several researchers have observed that it is often preferable to decompose a multiclass problem into multiple two-class problems, solve each such subproblem using a suitable binary classifier, and then combine the outputs of this collection of classifiers in a suitable manner to obtain the answer to the original multiclass problem. This approach is taken by the popular error correcting output codes (ECOC) technique, as well by the binary hierarchical classifier (BHC). Classical techniques for dealing with small sample sizes include regularization of covariance matrices and feature reduction. In this paper we address the twin problems of small sample sizes and multiclass settings by proposing a feature reduction scheme that adaptively adjusts to the amount of labeled data available. This scheme can be used in conjunction with ECOC and the BHC, as well as other approaches such as round-robin classification that decompose a multiclass problem into a number of two (meta)class problems. In particular, we develop the best-basis binary hierarchical classifier (BB-BHC) and best basis ECOC (BB-ECOC) families of models that are adapted to "small sample size" situations. Currently, there are few studies that compare the efficacy of different approaches to multiclass problems in general settings as well as in the specific context of small sample sizes. Our experiments on two sets of remote sensing data show that both BB-BHC and BB-ECOC methods are superior to their nonadaptive versions when faced with limited data, with the BB-BHC showing a slight edge in terms of classification accuracy as well as interpretability.
引用
收藏
页码:777 / 799
页数:23
相关论文
共 56 条
[1]   EFFICIENT CLASSIFICATION FOR MULTICLASS PROBLEMS USING MODULAR NEURAL NETWORKS [J].
ANAND, R ;
MEHROTRA, K ;
MOHAN, CK ;
RANKA, S .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01) :117-124
[2]  
Anderson TW., 1984, INTRO MULTIVARIATE S
[3]  
[Anonymous], 1996, NEURAL NETWORKS TRIC
[4]  
[Anonymous], ADV KERNEL METHODS S
[5]   BACKPROPAGATION USES PRIOR INFORMATION EFFICIENTLY [J].
BARNARD, E ;
BOTHA, EC .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (05) :794-802
[6]  
Bishop C. M., 1996, Neural networks for pattern recognition
[7]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]  
BREIMAN L, 1999, COMBINING ARTIFICIAL, P331
[10]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482