Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning

被引:627
作者
Han, Junwei [1 ]
Zhang, Dingwen [1 ]
Cheng, Gong [1 ]
Guo, Lei [1 ]
Ren, Jinchang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow G1 1XW, Lanark, Scotland
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2015年 / 53卷 / 06期
基金
美国国家科学基金会;
关键词
Bayesian framework; deep Boltzmann machine (DBM); object detection; weakly supervised learning (WSL); VISUAL SALIENCY; TARGET DETECTION; CLASSIFICATION; EXTRACTION; FRAMEWORK;
D O I
10.1109/TGRS.2014.2374218
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The abundant spatial and contextual information provided by the advanced remote sensing technology has facilitated subsequent automatic interpretation of the optical remote sensing images (RSIs). In this paper, a novel and effective geospatial object detection framework is proposed by combining the weakly supervised learning (WSL) and high-level feature learning. First, deep Boltzmann machine is adopted to infer the spatial and structural information encoded in the low-level and middle-level features to effectively describe objects in optical RSIs. Then, a novel WSL approach is presented to object detection where the training sets require only binary labels indicating whether an image contains the target object or not. Based on the learnt high-level features, it jointly integrates saliency, intraclass compactness, and interclass separability in a Bayesian framework to initialize a set of training examples from weakly labeled images and start iterative learning of the object detector. A novel evaluation criterion is also developed to detect model drift and cease the iterative learning. Comprehensive experiments on three optical RSI data sets have demonstrated the efficacy of the proposed approach in bench-marking with several state-of-the-art supervised-learning-based object detection approaches.
引用
收藏
页码:3325 / 3337
页数:13
相关论文
共 40 条
[21]   A New Method on Inshore Ship Detection in High-Resolution Satellite Images Using Shape and Context Information [J].
Liu, Ge ;
Zhang, Yasen ;
Zheng, Xinwei ;
Sun, Xian ;
Fu, Kun ;
Wang, Hongqi .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (03) :617-621
[22]   Automated Vehicle Extraction and Speed Determination From QuickBird Satellite Images [J].
Liu, Wen ;
Yamazaki, Fumio ;
Tuong Thuy Vu .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2011, 4 (01) :75-82
[23]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[24]   Optical Image Classification: A Ground-Truth Design Framework [J].
Pasolli, Edoardo ;
Melgani, Farid ;
Alajlan, Naif ;
Conci, Nicola .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2013, 51 (06) :3580-3597
[25]   Low-Level Visual Saliency With Application on Aerial Imagery [J].
Rigas, Ioannis ;
Economou, George ;
Fotopoulos, Spiros .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2013, 10 (06) :1389-1393
[26]  
Salakhutdinov R., 2009, P 12 INT C ART INT S, Vvol 5, ppp 448
[27]  
Shao L., IEEE T NEUR IN PRESS
[28]   Learning Deep and Wide: A Spectral Method for Learning Deep Networks [J].
Shao, Ling ;
Wu, Di ;
Li, Xuelong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (12) :2303-2308
[29]   Urban-Area and Building Detection Using SIFT Keypoints and Graph Theory [J].
Sirmacek, Berit ;
Uensalan, Cern .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (04) :1156-1167
[30]  
Siva P., P EUR C COMP VIS, P594