A sparse-response deep belief network based on rate distortion theory

被引:87
作者
Ji, Nan-Nan [1 ]
Zhang, Jiang-She [1 ]
Zhang, Chun-Xia [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Shaanxi, Peoples R China
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
Deep belief network; Kullback-Leibler divergence; Information entropy; Rate distortion theory; Unsupervised feature learning; NEURAL-NETWORKS; VISUAL-CORTEX; STIMULI; LEARN; V2;
D O I
10.1016/j.patcog.2014.03.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep belief networks (DBNs) are currently the dominant technique for modeling the architectural depth of brain, and can be trained efficiently in a greedy layer-wise unsupervised learning manner. However, DBNs without a narrow hidden bottleneck typically produce redundant, continuous-valued codes and unstructured weight patterns. Taking inspiration from rate distortion (RD) theory, which encodes original data using as few bits as possible, we introduce in this paper a variant of DBN, referred to as sparse-response DBN (SR-DBN). In this approach, Kullback-Leibler divergence between the distribution of data and the equilibrium distribution defined by the building block of DBN is considered as a distortion function, and the sparse response regularization induced by L-1-norm of codes is used to achieve a small code rate. Several experiments by extracting features from different scale image datasets show that our approach SR-DBN learns codes with small rate, extracts features at multiple levels of abstraction mimicking computations in the cortical hierarchy, and obtains more discriminative representation than PCA and several basic algorithms of DBNs. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3179 / 3191
页数:13
相关论文
共 52 条
[1]  
[Anonymous], 2011, P 25 AAAI C ART INT
[2]  
[Anonymous], 2008, Advances in neural information processing systems
[3]  
[Anonymous], SPARSE PENALTY DEEP
[4]  
[Anonymous], 1991, ELEMENTS INFORM THEO, DOI [DOI 10.1002/0471200611, 10.1002/0471200611]
[5]  
[Anonymous], 2006, NeurIPS
[6]  
[Anonymous], 1992, ADV NEURAL INFORM PR
[7]  
[Anonymous], 2008, P 25 INT C MACH LEAR
[8]  
Barlow H B, 1972, Perception, V1, P371, DOI 10.1068/p010371
[9]   The ''independent components'' of natural scenes are edge filters [J].
Bell, AJ ;
Sejnowski, TJ .
VISION RESEARCH, 1997, 37 (23) :3327-3338
[10]  
Bengio Y., 2007, P ADV NEUR INF PROC, P153