Blobworld: Image segmentation using expectation-maximization and its application to image querying

被引:862
作者
Carson, C
Belongie, S
Greenspan, H
Malik, J
机构
[1] Univ Calif Berkeley, Comp Sci Div, Berkeley, CA 94720 USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[3] Tel Aviv Univ, Fac Engn, Dept Biomed Engn, IL-69978 Tel Aviv, Israel
[4] Univ Calif Berkeley, Comp Sci Div, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
segmentation and grouping; image retrieval; image querying; clustering; expectation-maximization;
D O I
10.1109/TPAMI.2002.1023800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Retrieving images from large and varied collections using image content as a key is a challenging and important problem. We present a new image representation that provides a transformation from the raw pixel data to a small set of image regions that are coherent in color and texture. This "Blobworld" representation is created by clustering pixels in a joint color-texture-position feature space. The segmentation algorithm is fully automatic and has been run on a collection of 10,000 natural images. We describe a system that uses the Blobworld representation to retrieve images from this collection. An important aspect of the system is that the user is allowed to view the internal representation of the submitted image and the query results. Similar systems do not offer the user this view into the workings of the system; consequently, query results from these systems can be inexplicable, despite the availability of knobs for adjusting the similarity metrics. By finding image regions that roughly correspond to objects, we allow querying at the level of objects rather than global image properties. We present results indicating that querying for images using Blobworld produces higher precision than does querying using color and texture histograms of the entire image in cases where the image contains distinctive objects.
引用
收藏
页码:1026 / 1038
页数:13
相关论文
共 47 条
  • [1] [Anonymous], P EUR C COMP VIS
  • [2] ASHLEY J, 1995, P SOC PHOTO-OPT INS, V2410, P24
  • [3] AYER S, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P777, DOI 10.1109/ICCV.1995.466859
  • [4] Color- and texture-based image segmentation using EM and its application to content-based image retrieval
    Belongie, S
    Carson, C
    Greenspan, H
    Malik, J
    [J]. SIXTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, 1998, : 675 - 682
  • [5] MULTIDIMENSIONAL ORIENTATION ESTIMATION WITH APPLICATIONS TO TEXTURE ANALYSIS AND OPTICAL-FLOW
    BIGUN, J
    GRANLUND, GH
    WIKLUND, J
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (08) : 775 - 790
  • [6] CARSON C, 1997, P IEEE WORKSH CONT B
  • [7] Carson C., 1999, Proceedings of Third International Conference on Visual Information Systems, V1614, P509, DOI [DOI 10.1007/3-540-48762-X_63, 10.1007/3-540-48762]
  • [8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [9] Enser P. G. B., 1993, Journal of Document and Text Management, V1, P25
  • [10] FLICKNER M, 1995, IEEE COMPUT, V28, P23, DOI DOI 10.1109/2.410146