A Bayesian network-based framework for semantic image understanding

被引:84
作者
Luo, JB
Savakis, AE
Singhal, A
机构
[1] Eastman Kodak Co, Res & Dev Labs, Rochester, NY 14650 USA
[2] Rochester Inst Technol, Dept Comp Engn, Rochester, NY 14623 USA
关键词
semantic image understanding; low-level features; semantic features; Bayesian networks; domain knowledge;
D O I
10.1016/j.patcog.2004.11.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current research in content-based semantic image understanding is largely confined to exemplar-based approaches built on low-level feature extraction and classification. The ability to extract both low-level and semantic features and perform knowledge integration of different types of features is expected to raise semantic image understanding to a new level. Belief networks, or Bayesian networks (BN), have proven to be an effective knowledge representation and inference engine in artificial intelligence and expert systems research. Their effectiveness is due to the ability to explicitly integrate domain knowledge in the network structure and to reduce a joint probability distribution to conditional independence relationships. In this paper, we present a general-purpose knowledge integration framework that employs BN in integrating both low-level and semantic features. The efficacy of this framework is demonstrated via three applications involving semantic understanding of pictorial images. The first application aims at detecting main photographic subjects in an image, the second aims at selecting the most appealing image in an event, and the third aims at classifying images into indoor or outdoor scenes. With these diverse examples, we demonstrate that effective inference engines can be built within this powerful and flexible framework according to specific domain knowledge and available training data to solve inherently uncertain vision problems. (c) 2005 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:919 / 934
页数:16
相关论文
共 45 条
  • [1] ANTONISSE J, 1988, P DAT FUS S
  • [2] Ballard D.H., 1982, Computer Vision
  • [3] BARAGHIMIAN GA, 1990, P INT SOC OPT ENG AE, P46
  • [4] BROTHERTON T, 1994, P INT C MULT FUS INT
  • [5] BUNTINE W, 1994, ARTIF INTELL RES, P2
  • [6] DEDOMBAL FT, 1991, BRIT MED J, P302
  • [7] Dudgeon D. E., 1993, Lincoln Laboratory Journal, V6, P3
  • [8] ETZ S, 2000, P IEEE INT C IM PROC
  • [9] ETZ S, 2000, P HUMAN VISION ELECT
  • [10] FUNG RM, 1995, COMMUN ACM, P38