Statistical modeling and conceptualization of visual patterns

被引:69
作者
Zhu, SC
机构
[1] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA
关键词
perceptual organization; descriptive models; generative models; causal Markov models; discriminative methods; minimax entropy learning; mixed Markov models;
D O I
10.1109/TPAMI.2003.1201820
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural images contain an overwhelming number of visual patterns generated by diverse stochastic processes. Defining and modeling these patterns is of fundamental importance for generic vision tasks, such as perceptual organization, segmentation, and recognition. The objective of this epistemological paper is to summarize various threads of research in the literature and to pursue a unified framework for conceptualization, modeling, learning, and computing visual patterns. This paper starts with reviewing four research streams: 1) the study of image statistics, 2) the analysis of image components, 3) the grouping of image elements, and 4) the modeling of visual patterns. The models from these research streams are then divided into four categories according to their semantic structures: 1) descriptive models, i.e., Markov random fields (MRF) or Gibbs, 2) variants of descriptive models (causal MRF and "pseudodescriptive" models), 3) generative models, and 4) discriminative models. The objectives, principles, theories, and typical models are reviewed in each category and the relationships between the four types of models are studied. Two central themes emerge from the relationship studies. 1) In representation, the integration of descriptive and generative models is the future direction for statistical modeling and should lead to richer and more advanced classes of vision models. 2) To make visual models computationally tractable, discriminative models are used as computational heuristics for inferring generative models. Thus, the roles of four types of models are clarified. The paper also addresses the issue of conceptualizing visual patterns and their components (vocabularies) from the perspective of statistical mechanics. Under this unified framework, a visual pattern is equalized to a statistical ensemble, and, furthermore, statistical models for various visual patterns form a "continuous" spectrum in the sense that they belong to a series of nested probability families in the space of attributed graphs.
引用
收藏
页码:691 / 712
页数:22
相关论文
共 103 条
[1]  
ALVAREZ L, 1999, ADV IMAGING ELECT PH, V111
[2]   Ground from figure discrimination [J].
Amir, A ;
Lindenbaum, M .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 76 (01) :7-18
[3]  
[Anonymous], COMPUTING PERCEPTUAL
[4]  
[Anonymous], 1948, BELL SYST TECH J
[5]  
[Anonymous], NEURAL COMPUT, DOI DOI 10.1162/NECO.1995.7.5.889
[6]   WHAT DOES THE RETINA KNOW ABOUT NATURAL SCENES [J].
ATICK, JJ ;
REDLICH, AN .
NEURAL COMPUTATION, 1992, 4 (02) :196-210
[7]   SOME INFORMATIONAL ASPECTS OF VISUAL PERCEPTION [J].
ATTNEAVE, F .
PSYCHOLOGICAL REVIEW, 1954, 61 (03) :183-193
[8]  
Barlow H., 1961, SENS COMMUN, P217, DOI DOI 10.7551/MITPRESS/9780262518420.003.0013
[9]  
BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
[10]  
BIENENSTOCK E, 1997, P NEURAL INFORMATION