Do Convolutional Neural Networks Learn Class Hierarchy?

被引：124

作者：

Alsallakh, Bilal ^{[1
]}

Jourabloo, Amin ^{[2
]}

Ye, Mao ^{[1
]}

Liu, Xiaoming ^{[2
]}

Ren, Liu ^{[1
]}

机构：

[1] Bosch Res North Amer, Palo Alto, CA 94304 USA

[2] Michigan State Univ, E Lansing, MI 48824 USA

来源：

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS | 2018年 / 24卷 / 01期

关键词：

Convolutional Neural Networks; deep learning; image classification; large-scale classification; confusion matrix;

D O I：

10.1109/TVCG.2017.2744683

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Convolutional Neural Networks (CNNs) currently achieve state-of-the-art accuracy in image classification. With a growing number of classes. the accuracy usually drops as the possibilities of confusion increase. Interestingly, the class confusion patterns follow a hierarchical structure over the classes. We present visual-analytics methods to reveal and analyze this hierarchy of similar classes in relation with CNN-internal data. We found that this hierarchy not only dictates the confusion patterns between the classes, it furthermore dictates the learning behavior of CNNs. In particular, the early layers in these networks develop feature detectors that can separate high-level groups of classes quite well. even after a few training epochs. In contrast, the latter layers require substantially more epochs to develop specialized feature detectors that can separate individual classes. We demonstrate how these insights are key to significant improvement in accuracy by designing hierarchy-aware CNNs that accelerate model convergence and alleviate overfitting. We further demonstrate how our methods help in identifying various quality issues in the training data.

引用

页码：152 / 162

页数：11

共 77 条

[1] Visual Methods for Analyzing Probabilistic Classification Data
Alsallakh, Bilal
Hanbury, Allan
Hauser, Helwig
Miksch, Silvia
Rauber, Andreas
[J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (12) : 1703 - 1712
[2] ModelTracker: Redesigning Performance Analysis Tools for Machine Learning
Amershi, Saleema
Chickering, Max
Drucker, Steven M.
Lee, Bongshin
Simard, Patrice
Suh, Jina
[J]. CHI 2015: PROCEEDINGS OF THE 33RD ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2015, : 337 - 346
[3] Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[4] [Anonymous], 2016, P ADV NEURAL INFORM
[5] [Anonymous], 2009, CHI2009 P 27 ANN CHI
[6] [Anonymous], 2009, P INT C ART INT STAT
[7] [Anonymous], ICML WORKSH VIS DEEP
[8] [Anonymous], 2016, Multifaceted feature visualization: Uncovering the different types of features learned by each neuron in deep neural networks
[9] [Anonymous], 2015, ICLR
[10] [Anonymous], Understanding neural networks through deep visualization

← 1 2 3 4 5 6 7 8 →