A genetic-algorithm-based selective principal component analysis (GA-SPCA) method for high-dimensional data feature extraction

被引:54
作者
Yao, HB [1 ]
Tian, L [1 ]
机构
[1] Univ Illinois, Dept Agr Engn, Urbana, IL 61801 USA
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2003年 / 41卷 / 06期
关键词
feature extraction; genetic algorithm; hyperspectral image; selective principal component analysis; supervised dimension reduction;
D O I
10.1109/TGRS.2003.811691
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In this paper, a genetic-algorithm-based selective principal component analysis (GA-SPCA) method is proposed and tested using hyperspectral remote sensing data and ground reference data collected within an agricultural field. The proposed method uses a global optimizer, the genetic algorithms, to select a subset of the original image bands, which first reduces the data dimension. A principal component transformation is subsequently applied to the selected bands. By extracting features from the resulting eigenimage, the remote sensing data, originally high in dimension, will be further reduced to a feature space with one to several principal component bands. Subsequent image processing on the reduced feature space can thus be performed with improved accuracy. Experiments were conducted using three sets of ground reference data: corn chlorophyll content, corn plant population, and various corn hybrids. The results showed that with GA-SPCA, the number of original bands used for principal component analysis (PCA) could be reduced to 17, 26, and 25 from a 60-band hyperspectral image, respectively. In all cases, the correlation coefficients between image and ground reference data were greater when using GA-SPCA than that for PCA results with all original bands. This indicates that bands with no contribution to a specific application were removed prior to PCA. The variance related to a specific application within the image was transformed with more emphasis by using bands sensitive to that application. The selected bands can also provide useful information for future imaging sensor development.
引用
收藏
页码:1469 / 1478
页数:10
相关论文
共 34 条
[1]   Delineation of soil variability using geostatistics and fuzzy clustering analyses of hyperspectral data [J].
Ahn, CW ;
Baumgardner, MF ;
Biehl, LL .
SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 1999, 63 (01) :142-150
[2]   Pixel classification using variable string genetic algorithms with chromosome differentiation [J].
Bandyopadhyay, S ;
Pal, SK .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2001, 39 (02) :303-308
[3]   Information theory-based band selection for multispectral systems [J].
Bassett, EM ;
Shen, SS .
IMAGING SPECTROMETRY III, 1997, 3118 :28-35
[4]   LIGHT REFLECTANCE COMPARED WITH OTHER NITROGEN STRESS MEASUREMENTS IN CORN LEAVES [J].
BLACKMER, TM ;
SCHEPERS, JS ;
VARVEL, GE .
AGRONOMY JOURNAL, 1994, 86 (06) :934-938
[5]   Comparing prediction power and stability of broadband and hyperspectral vegetation indices for estimation of green leaf area index and canopy chlorophyll density [J].
Broge, NH ;
Leblanc, E .
REMOTE SENSING OF ENVIRONMENT, 2001, 76 (02) :156-172
[6]  
CHAVEZ PS, 1989, PHOTOGRAMM ENG REM S, V55, P339
[7]  
DEJONG KA, 1975, THESIS U MICHIGAN AN
[8]   EVALUATING WHEAT NITROGEN STATUS WITH CANOPY REFLECTANCE INDEXES AND DISCRIMINANT-ANALYSIS [J].
FILELLA, I ;
SERRANO, L ;
SERRA, J ;
PENUELAS, J .
CROP SCIENCE, 1995, 35 (05) :1400-1405
[9]  
Goldberg D. E., 1989, GENETIC ALGORITHMS S
[10]  
GOLDBERG DE, 1989, PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON GENETIC ALGORITHMS, P70