A 2-D wavelet decomposition-based bag-of-visual-words model for land-use scene classification

被引:75
作者
Zhao, Lijun [1 ,2 ]
Tang, Ping [1 ]
Huo, Lianzhi [1 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
FEATURE-EXTRACTION; COVER CLASSIFICATION; TEXTURE; FEATURES; IMAGERY; KERNELS; SCALE;
D O I
10.1080/01431161.2014.890762
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Previous works about spatial information incorporation into a traditional bag-of-visual-words (BOVW) model mainly consider the spatial arrangement of an image, ignoring the rich textural information in land-use remote-sensing images. Hence, this article presents a 2-D wavelet decomposition (WD)-based BOVW model for land-use scene classification, since the 2-D wavelet decomposition method does well not only in textural feature extraction, but also in the multi-resolution representation of an image, which is favourable for the use of both spatial arrangement and textural information in land-use images. The proposed method exploits the textural structures of an image with colour information transformed into greyscale. Moreover, it works first by decomposing the greyscale image into different sub-images using 2-D discrete wavelet transform (DWT) and then by extracting local features of the greyscale image and all the decomposed images with dense regions in which a given image is evenly sampled by a regular grid with a specified grid space. After that, the method generates the corresponding visual vocabularies and computes histograms of visual word occurrences of local features found in each former image. Specifically, the soft-assignment or multi-assignment (MA) technique is employed, accounting for the impact of clustering on visual vocabulary creation that two similar image patches may be clustered into different clusters when increasing the size of visual vocabulary. The proposed method is evaluated on a ground truth image dataset of 21 land-use classes manually extracted from high-resolution remote-sensing images. Experimental results demonstrate that the proposed method significantly outperforms previous methods, such as the traditional BOVW model, the spatial pyramid representation-based BOVW method, the multi-resolution representation-based BOVW method, and so on, and even exceeds the best result obtained from the creator of the land-use dataset. Therefore, the proposed approach is very suitable for land-use scene classification tasks.
引用
收藏
页码:2296 / 2310
页数:15
相关论文
共 44 条
  • [1] [Anonymous], 2007, P 6 ACM INT C IMAGE
  • [2] Texture classification using wavelet transform
    Arivazhagan, S
    Ganesan, L
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) : 1513 - 1521
  • [3] Barla A, 2003, 2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, P513
  • [4] Bag of spatio-visual words for context inference in scene classification
    Bolovinou, A.
    Pratikakis, I.
    Perantonis, S.
    [J]. PATTERN RECOGNITION, 2013, 46 (03) : 1039 - 1053
  • [5] Scene classification using a hybrid generative/discriminative approach
    Bosch, Anna
    Zisserman, Andrew
    Munoz, Xavier
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) : 712 - 727
  • [6] Which is the best way to organize/classify images by content?
    Bosch, Anna
    Munoz, Xavier
    Marti, Robert
    [J]. IMAGE AND VISION COMPUTING, 2007, 25 (06) : 778 - 791
  • [7] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [8] Automatic landslide detection from remote-sensing imagery using a scene classification method based on BoVW and pLSA
    Cheng, Gong
    Guo, Lei
    Zhao, Tianyun
    Han, Junwei
    Li, Huihui
    Fang, Jun
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2013, 34 (01) : 45 - 59
  • [9] Latent semantic kernels
    Cristianini, N
    Shawe-Taylor, J
    Lodhi, H
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2002, 18 (2-3) : 127 - 152
  • [10] Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678