PCANet: A Simple Deep Learning Baseline for Image Classification?

被引:1448
作者
Chan, Tsung-Han [1 ]
Jia, Kui [2 ]
Gao, Shenghua [3 ]
Lu, Jiwen
Zeng, Zinan [4 ]
Ma, Yi [3 ,5 ]
机构
[1] MediaTek Inc, Hsinchu 30078, Taiwan
[2] Univ Macau, Fac Sci & Technol, Dept Comp & Informat Sci, Macau, Peoples R China
[3] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 200031, Peoples R China
[4] Adv Digital Sci Ctr, Singapore 138632, Singapore
[5] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
关键词
Convolution neural network; deep learning; PCA network; random network; LDA network; face recognition; handwritten digit recognition; object classification; FACE-RECOGNITION; PATTERNS; MAGNITUDES; ALGORITHM; MODELS;
D O I
10.1109/TIP.2015.2475625
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
In this paper, we propose a very simple deep learning network for image classification that is based on very basic data processing components: 1) cascaded principal component analysis (PCA); 2) binary hashing; and 3) blockwise histograms. In the proposed architecture, the PCA is employed to learn multistage filter banks. This is followed by simple binary hashing and block histograms for indexing and pooling. This architecture is thus called the PCA network (PCANet) and can be extremely easily and efficiently designed and learned. For comparison and to provide a better understanding, we also introduce and study two simple variations of PCANet: 1) RandNet and 2) LDANet. They share the same topology as PCANet, but their cascaded filters are either randomly selected or learned from linear discriminant analysis. We have extensively tested these basic networks on many benchmark visual data sets for different tasks, including Labeled Faces in the Wild (LFW) for face verification; the MultiPIE, Extended Yale B, AR, Facial Recognition Technology (FERET) data sets for face recognition; and MNIST for hand-written digit recognition. Surprisingly, for all tasks, such a seemingly naive PCANet model is on par with the state-of-the-art features either prefixed, highly hand-crafted, or carefully learned [by deep neural networks (DNNs)]. Even more surprisingly, the model sets new records for many classification tasks on the Extended Yale B, AR, and FERET data sets and on MNIST variations. Additional experiments on other public data sets also demonstrate the potential of PCANet to serve as a simple but highly competitive baseline for texture classification and object recognition.
引用
收藏
页码:5017 / 5032
页数:16
相关论文
共 65 条
[1]
Face description with local binary patterns:: Application to face recognition [J].
Ahonen, Timo ;
Hadid, Abdenour ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) :2037-2041
[2]
[Anonymous], 2010, Advances in Neural Information Processing Systems (NIPS 2010)
[3]
[Anonymous], 2012, ICML
[4]
[Anonymous], 2008, 2008 IEEE C COMP VIS, DOI DOI 10.1109/CVPR.2008.4587369
[5]
[Anonymous], 2013, Caffe: An open source convolutional architecture for fast feature embedding
[6]
[Anonymous], 2010, Adv. Neural Inf. Process. Syst
[7]
[Anonymous], P BRIT MACH VIS C
[8]
[Anonymous], 2010, Proceedings of the 27th International Conference on Machine Learning (ICML-10)
[9]
Fast High Dimensional Vector Multiplication Face Recognition [J].
Barkan, Oren ;
Weill, Jonathan ;
Wolf, Lior ;
Aronowitz, Hagai .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1960-1967
[10]
Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522