Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks

被引:482
作者
Dosovitskiy, Alexey [1 ]
Fischer, Philipp [1 ]
Springenberg, Jost Tobias [1 ]
Riedmiller, Martin [1 ]
Brox, Thomas [1 ]
机构
[1] Univ Freiburg, Dept Comp Sci, Freiburg, Germany
基金
欧洲研究理事会;
关键词
Convolutional networks; unsupervised learning; feature learning; image classification; descriptor matching;
D O I
10.1109/TPAMI.2015.2496141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional networks have proven to be very successful in learning task specific features that allow for unprecedented performance on various computer vision tasks. Training of such networks follows mostly the supervised learning paradigm, where sufficiently many input-output pairs are required for training. Acquisition of large training sets is one of the key challenges, when approaching a new task. In this paper, we aim for generic feature learning and present an approach for training a convolutional network using only unlabeled data. To this end, we train the network to discriminate between a set of surrogate classes. Each surrogate class is formed by applying a variety of transformations to a randomly sampled 'seed' image patch. In contrast to supervised network training, the resulting feature representation is not class specific. It rather provides robustness to the transformations that have been applied during training. This generic feature representation allows for classification results that outperform the state of the art for unsupervised learning on several popular datasets (STL-10, CIFAR-10, Caltech-101, Caltech-256). While features learned with our approach cannot compete with class specific features from supervised training on a classification task, we show that they are advantageous on geometric matching problems, where they also outperform the SIFT descriptor.
引用
收藏
页码:1734 / 1747
页数:14
相关论文
共 46 条
[1]   Training Hierarchical Feed-Forward Visual Recognition Models Using Transfer Learning from Pseudo-Tasks [J].
Ahmed, Amr ;
Yu, Kai ;
Xu, Wei ;
Gong, Yihong ;
Xing, Eric .
COMPUTER VISION - ECCV 2008, PT III, PROCEEDINGS, 2008, 5304 :69-+
[2]  
Amini MR, 2002, FR ART INT, V77, P390
[3]  
[Anonymous], 2010, P ADV NEUR INF PROC
[4]  
[Anonymous], 2012, ICML
[5]  
[Anonymous], ARXIVCS12070580V3
[6]  
Bo L., 2012, P INT S EXP ROB ISER, P387, DOI DOI 10.1007/978-3-319-00065-7_27
[7]   Multipath Sparse Coding Using Hierarchical Matching Pursuit [J].
Bo, Liefeng ;
Ren, Xiaofeng ;
Fox, Dieter .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :660-667
[8]  
Boureau YL, 2011, IEEE I CONF COMP VIS, P2651, DOI 10.1109/ICCV.2011.6126555
[9]  
Coates A., 2011, P 14 INT C ART INT S, P215
[10]  
Coates A., 2011, ADV NEURAL INFORM PR, V24, P2528, DOI DOI 10.1016/J.PSYCHRES.2009.03.008