Bilinear sparse coding for invariant vision

被引:47
作者
Grimes, DB [1 ]
Rao, RPN [1 ]
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
10.1162/0899766052530893
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent algorithms for sparse coding and independent component analysis (ICA) have demonstrated how localized features can be learned from natural images. However, these approaches do not take image transformations into account. We describe an unsupervised algorithm for learning both localized features and their transformations directly from images using a sparse bilinear generative model. We show that from an arbitrary set of natural images, the algorithm produces oriented basis filters that can simultaneously represent features in an image and their transformations. The learned generative model can be used to translate features to different locations, thereby reducing the need to learn the same feature at multiple locations, a limitation of previous approaches to sparse coding and ICA. Our results suggest that by explicitly modeling the interaction between local image features and their transformations, the sparse bilinear approach can provide a basis for achieving transformation-invariant vision.
引用
收藏
页码:47 / 73
页数:27
相关论文
共 26 条
[1]  
ANDERSON C, 1987, P NATL ACAD SCI USA, V84, P1148
[2]   SOME INFORMATIONAL ASPECTS OF VISUAL PERCEPTION [J].
ATTNEAVE, F .
PSYCHOLOGICAL REVIEW, 1954, 61 (03) :183-193
[3]  
Barlow H. B., 1961, SENS COMMUN, P217, DOI DOI 10.7551/MITPRESS/9780262518420.003.0013
[4]   The ''independent components'' of natural scenes are edge filters [J].
Bell, AJ ;
Sejnowski, TJ .
VISION RESEARCH, 1997, 37 (23) :3327-3338
[5]  
BESAG J, 1986, J R STAT SOC B, V48, P259
[6]   Learning Invariance from Transformation Sequences [J].
Foldiak, Peter .
NEURAL COMPUTATION, 1991, 3 (02) :194-200
[7]   NEOCOGNITRON - A NEURAL NETWORK MODEL FOR A MECHANISM OF VISUAL-PATTERN RECOGNITION [J].
FUKUSHIMA, K ;
MIYAKE, S ;
ITO, T .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :826-834
[8]  
GRIMES DB, 2003, ADV NEURAL INFORMATI, V15
[9]   Generative models for discovering sparse distributed representations [J].
Hinton, GE ;
Ghahramani, Z .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 1997, 352 (1358) :1177-1190
[10]  
HINTON GE, 1987, LECT NOTES COMPUT SC, V258, P1