Learning Hierarchical Features for Scene Labeling

被引:1860
作者
Farabet, Clement [1 ,2 ]
Couprie, Camille [1 ]
Najman, Laurent [2 ]
LeCun, Yann [1 ]
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10003 USA
[2] Univ Paris Est, Equipe A3SI, ESIEE Paris, Lab Informat Gaspard Monge, F-93160 Noisy Le Grand, France
关键词
Convolutional networks; deep learning; image segmentation; image classification; scene parsing; ENERGY MINIMIZATION; OBJECT RECOGNITION;
D O I
10.1109/TPAMI.2012.231
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene labeling consists of labeling each pixel in an image with the category of the object it belongs to. We propose a method that uses a multiscale convolutional network trained from raw pixels to extract dense feature vectors that encode regions of multiple sizes centered on each pixel. The method alleviates the need for engineered features, and produces a powerful representation that captures texture, shape, and contextual information. We report results using multiple postprocessing methods to produce the final labeling. Among those, we propose a technique to automatically retrieve, from a pool of segmentation components, an optimal set of components that best explain the scene; these components are arbitrary, for example, they can be taken from a segmentation tree or from any family of oversegmentations. The system yields record accuracies on the SIFT Flow dataset (33 classes) and the Barcelona dataset (170 classes) and near-record accuracy on Stanford background dataset (eight classes), while being an order of magnitude faster than competing approaches, producing a 320 x 240 image labeling in less than a second, including feature extraction.
引用
收藏
页码:1915 / 1929
页数:15
相关论文
共 47 条
  • [11] An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision
    Boykov, Y
    Kolmogorov, V
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (09) : 1124 - 1137
  • [12] Boykov YY, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P105, DOI 10.1109/ICCV.2001.937505
  • [13] CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts
    Carreira, Joao
    Sminchisescu, Cristian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (07) : 1312 - 1328
  • [14] Ciresan D, 2011, 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), P1918, DOI 10.1109/IJCNN.2011.6033458
  • [15] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [16] Farabet C., 2010, P INT S CIRC SYST MA
  • [17] Farabet C., 2012, P INT C MACH LEARN J
  • [18] Farabet C., 2012, CORR, Vabs/1202.2160
  • [19] Farabet C., 2011, P 5 IEEE WORKSH EMB
  • [20] Efficient graph-based image segmentation
    Felzenszwalb, PF
    Huttenlocher, DP
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) : 167 - 181