Do Convolutional Neural Networks Learn Class Hierarchy?

被引:124
作者
Alsallakh, Bilal [1 ]
Jourabloo, Amin [2 ]
Ye, Mao [1 ]
Liu, Xiaoming [2 ]
Ren, Liu [1 ]
机构
[1] Bosch Res North Amer, Palo Alto, CA 94304 USA
[2] Michigan State Univ, E Lansing, MI 48824 USA
关键词
Convolutional Neural Networks; deep learning; image classification; large-scale classification; confusion matrix;
D O I
10.1109/TVCG.2017.2744683
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Convolutional Neural Networks (CNNs) currently achieve state-of-the-art accuracy in image classification. With a growing number of classes. the accuracy usually drops as the possibilities of confusion increase. Interestingly, the class confusion patterns follow a hierarchical structure over the classes. We present visual-analytics methods to reveal and analyze this hierarchy of similar classes in relation with CNN-internal data. We found that this hierarchy not only dictates the confusion patterns between the classes, it furthermore dictates the learning behavior of CNNs. In particular, the early layers in these networks develop feature detectors that can separate high-level groups of classes quite well. even after a few training epochs. In contrast, the latter layers require substantially more epochs to develop specialized feature detectors that can separate individual classes. We demonstrate how these insights are key to significant improvement in accuracy by designing hierarchy-aware CNNs that accelerate model convergence and alleviate overfitting. We further demonstrate how our methods help in identifying various quality issues in the training data.
引用
收藏
页码:152 / 162
页数:11
相关论文
共 77 条
  • [51] Mäkinen E, 2005, INFORM-J COMPUT INFO, V29, P357
  • [52] WORDNET - A LEXICAL DATABASE FOR ENGLISH
    MILLER, GA
    [J]. COMMUNICATIONS OF THE ACM, 1995, 38 (11) : 39 - 41
  • [53] Deep Decision Network for Multi-Class Image Classification
    Murthy, Venkatesh N.
    Singh, Vivek
    Chen, Terrence
    Manmatha, R.
    Comaniciu, Dorin
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2240 - 2248
  • [54] Visual Boosting in Pixel-based Visualizations
    Oelke, Daniela
    Janetzko, Halldor
    Simon, Svenja
    Neuhaus, Klaus
    Keim, Daniel A.
    [J]. COMPUTER GRAPHICS FORUM, 2011, 30 (03) : 871 - 880
  • [55] Patel Kayur, 2010, P 23 ANN ACM S US IN, P37, DOI DOI 10.1145/1866029.1866038
  • [56] Visualizing the Hidden Activity of Artificial Neural Networks
    Rauber, Paulo E.
    Fadel, Samuel G.
    Falcao, Alexandre X.
    Telea, Alexandru C.
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) : 101 - 110
  • [57] Squares: Supporting Interactive Performance Analysis for Multiclass Classifiers
    Ren, Donghao
    Amershi, Saleema
    Lee, Bongshin
    Suh, Jina
    Williams, Jason D.
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) : 61 - 70
  • [58] "Why Should I Trust You?" Explaining the Predictions of Any Classifier
    Ribeiro, Marco Tulio
    Singh, Sameer
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1135 - 1144
  • [59] Deep learning
    Rusk, Nicole
    [J]. NATURE METHODS, 2016, 13 (01) : 35 - 35
  • [60] ImageNet Large Scale Visual Recognition Challenge
    Russakovsky, Olga
    Deng, Jia
    Su, Hao
    Krause, Jonathan
    Satheesh, Sanjeev
    Ma, Sean
    Huang, Zhiheng
    Karpathy, Andrej
    Khosla, Aditya
    Bernstein, Michael
    Berg, Alexander C.
    Fei-Fei, Li
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) : 211 - 252