Unsupervised Learning of Hierarchical Representations with Convolutional Deep Belief Networks

被引:291
作者
Lee, Honglak [1 ]
Grosse, Roger [2 ]
Ranganath, Rajesh [3 ]
Ng, Andrew Y. [3 ]
机构
[1] Univ Michigan, Comp Sci & Engn Div, Ann Arbor, MI 48109 USA
[2] MIT, CSAIL, Cambridge, MA 02139 USA
[3] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
INFERENCE;
D O I
10.1145/2001269.2001295
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
There has been much interest in unsupervised learning of hierarchical generative models such as deep belief networks (DBNs); however, scaling such models to full-sized, high-dimensional images remains a difficult problem. To address this problem, we present the convolutional deep belief network, a hierarchical generative model that scales to realistic image sizes. This model is translation-invariant and supports efficient bottom-up and top-down probabilistic inference. Key to our approach is probabilistic max-pooling, a novel technique that shrinks the representations of higher layers in a probabilistically sound way. Our experiments show that the algorithm learns useful high-level visual features, such as object parts, from unlabeled images of objects and natural scenes. We demonstrate excellent performance on several visual recognition tasks and show that our model can perform hierarchical (bottom-up and top-down) inference over full-sized images.
引用
收藏
页码:95 / 103
页数:9
相关论文
共 38 条
[1]  
[Anonymous], P INT C MACH LEARN
[2]  
[Anonymous], ADV NEURAL INFORM PR
[3]  
[Anonymous], 2006, PROC IEEE C COMPUTER
[4]  
[Anonymous], CVPR WORKSH GEN MOD
[5]  
[Anonymous], 2007, NEURIPS
[6]  
[Anonymous], P INT C MACH LEARN
[7]  
[Anonymous], 2009, JMLR P
[8]  
[Anonymous], P IEEE C COMP VIS PA
[9]  
[Anonymous], P IEEE C COMP VIS PA
[10]  
[Anonymous], 2008, ADV NEURAL INFORM PR