Semantic-friendly indexing and quering of images based on the extraction of the objective semantic cues

被引:65
作者
Mojsilovic, A [1 ]
Gomes, J [1 ]
Rogowitz, B [1 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Hawthorne, NY 10532 USA
关键词
image semantic categorization; image browsing and retrieval; color naming; perceptual features;
D O I
10.1023/B:VISI.0000004833.39906.33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
bstract image semantics resists all forms of modeling, very much like any kind of intelligence does. However, in order to develop more satisfying image navigation systems, we need tools to construct a semantic bridge between the user and the database. In this paper we present an image indexing scheme and a query language, which allow the user to introduce cognitive dimension to the search. At an abstract level, this approach consists of: (1) learning the "natural language" that humans speak to communicate their semantic experience of images, (2) understanding the relationships between this language and objective measurable image attributes, and then (3) developing corresponding feature extraction schemes. More precisely, we have conducted a number of subjective experiments in which we asked human subjects to group images, and then explain verbally why they did so. The results of this study indicated that a part of the abstraction involved in image interpretation is often driven by semantic categories, which can be broken into more tangible semantic entities, i.e. objective semantic indicators. By analyzing our experimental data, we have identified some candidate semantic categories (i.e. portraits, people, crowds, cityscapes, landscapes, etc.) and their underlying semantic indicators (i.e. skin, sky, water, object, etc.). These experiments also helped us derive important low-level image descriptors, accounting for our perception of these indicators. We have then used these findings to develop an image feature extraction and indexing scheme. In particular, our feature set has been carefully designed to match the way humans communicate image meaning. This led us to the development of a "semantic-friendly" query language for browsing and searching diverse collections of images. We have implemented our approach into an Internet search engine, and tested it on a large number of images. The results we obtained are very promising.
引用
收藏
页码:79 / 107
页数:29
相关论文
共 58 条
[1]
Ambrosio L, 1996, J DIFFER GEOM, V43, P693
[2]
[Anonymous], THESIS U BUFFALO
[3]
[Anonymous], P IEEE INT C SYST MA
[4]
[Anonymous], 1986, STAT ANAL
[5]
[Anonymous], 1996, VISUALIZATION TOOLKI
[6]
[Anonymous], 1996, LEVEL SET METHODS
[7]
A VISUAL INFORMATION MANAGEMENT-SYSTEM FOR THE INTERACTIVE RETRIEVAL OF FACES [J].
BACH, JR ;
PAUL, S ;
JAIN, R .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (04) :619-628
[8]
BELPAEME T, 2002, THESIS VRIJE U BRUSS
[9]
Berlin B, 1969, Basic Color Terms: Their Universality and Evolution
[10]
Bishop C. M., 1996, Neural networks for pattern recognition