Dynamic trees for image modelling

被引:18
作者
Adams, NJ [1 ]
Williams, CKI [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Inst Adapt & Neural Computat, Edinburgh EH1 2QL, Midlothian, Scotland
关键词
Bayesian image modelling; belief networks; dynamic tree; variational inference; mean field; expectation-maximisation;
D O I
10.1016/S0262-8856(03)00073-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a new class of image model which we call dynamic trees or DTs. A dynamic tree model specifies a prior over structures of trees, each of which is a forest of one or more tree-structured belief networks (TSBN). In the literature standard tree-structured belief network models have been found to produce 'blocky' segmentations when naturally occurring boundaries within an image did not coincide with those of the subtrees in the rigid fixed structure of the network. Dynamic trees have a flexible architecture which allows the structure to vary to create configurations where the subtree and image boundaries align, and experimentation with the model has shown significant improvements. For large models the number of tree configurations quickly becomes intractable to enumerate over, presenting a problem for exact inference. Techniques such as Gibbs sampling over trees and search using simulated annealing have been considered, but a variational approximation based upon mean field was found to work faster while still producing a good approximation to the true model probability distribution. We look briefly at this mean field approximation before deriving an EM-style update based upon mean field inference for learning the parameters of the dynamic tree model. After development of algorithms for learning the dynamic tree model is applied to a database of images of outdoor scenes where all of its parameters are learned. DTs are seen to offer significant improvement in performance over the fixed-architecture TSBN and in a coding comparison the DT achieves 0.294 bits per pixel (bpp) compression compared to 0.378 bpp for lossless JPEG on images of seven colours. (C) 2003 Published by Elsevier B.V.
引用
收藏
页码:865 / 877
页数:13
相关论文
共 33 条
[1]  
Adams NJ, 1999, IEE CONF PUBL, P527, DOI 10.1049/cp:19991163
[2]  
ADAMS NJ, 2001, 4 INT ICSC S SOFT CO
[3]  
ADAMS NJ, 2001, THESIS U EDINBURGH 5
[4]   Oil splitting in industrial cleaning systems as studied by conductivity and interfacial tension [J].
Adamy, ST ;
Cala, FR .
JOURNAL OF SURFACTANTS AND DETERGENTS, 2000, 3 (02) :151-158
[5]  
[Anonymous], HDB BRAIN THEORY NEU
[6]   A MULTISCALE RANDOM-FIELD MODEL FOR BAYESIAN IMAGE SEGMENTATION [J].
BOUMAN, CA ;
SHAPIRO, M .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1994, 3 (02) :162-177
[7]  
Charniak Eugene, 1993, STAT LANGUAGE LEARNI
[8]   CLASSIFICATION OF TEXTURES USING GAUSSIAN MARKOV RANDOM-FIELDS [J].
CHELLAPPA, R ;
CHATTERJEE, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (04) :959-963
[9]  
CHOU PA, 1989, P SOC PHOTO-OPT INS, V1199, P852
[10]   Wavelet-based statistical signal processing using hidden Markov models [J].
Crouse, MS ;
Nowak, RD ;
Baraniuk, RG .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1998, 46 (04) :886-902