Deep Clustering for Unsupervised Learning of Visual Features

被引:1741
作者
Caron, Mathilde [1 ]
Bojanowski, Piotr [1 ]
Joulin, Armand [1 ]
Douze, Matthijs [1 ]
机构
[1] Facebook AI Res, Paris, France
来源
COMPUTER VISION - ECCV 2018, PT XIV | 2018年 / 11218卷
关键词
Unsupervised learning; Clustering;
D O I
10.1007/978-3-030-01264-9_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a class of unsupervised learning methods that has been extensively applied and studied in computer vision. Little work has been done to adapt it to the end-to-end training of visual features on large-scale datasets. In this work, we present DeepCluster, a clustering method that jointly learns the parameters of a neural network and the cluster assignments of the resulting features. DeepCluster iteratively groups the features with a standard clustering algorithm, k-means, and uses the subsequent assignments as supervision to update the weights of the network. We apply DeepCluster to the unsupervised training of convolutional neural networks on large datasets like ImageNet and YFCC100M. The resulting model outperforms the current state of the art by a significant margin on all the standard benchmarks.
引用
收藏
页码:139 / 156
页数:18
相关论文
共 73 条
[1]   Learning to See by Moving [J].
Agrawal, Pulkit ;
Carreira, Joao ;
Malik, Jitendra .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :37-45
[2]  
[Anonymous], 1994, Advances in Neural Information Processing Systems
[3]  
[Anonymous], 2009, U MONTREAL
[4]  
[Anonymous], 2010, ICML
[5]  
[Anonymous], Understanding neural networks through deep visualization
[6]  
Bach F., 2012, ARXIV12066413
[7]  
Bach F.R., 2008, Advances in Neural Information Processing Systems, P49
[8]  
Bengio Y., 2006, ADV NEURAL INFORM PR, V19
[9]  
Bojanowski P., 2017, arXiv preprint arXiv:1707.05776
[10]  
Bottou Leon, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P421, DOI 10.1007/978-3-642-35289-8_25