A model of multimedia information retrieval

被引:68
作者
Meghini, C [1 ]
Sebastiani, F [1 ]
Straccia, U [1 ]
机构
[1] CNR, Ist Elaboraz Informaz, I-56124 Pisa, Italy
关键词
description logics; fuzzy logics; multimedia information retrieval;
D O I
10.1145/502102.502103
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Research on multimedia information retrieval (MIR) has recently witnessed a booming interest. A prominent feature of this research trend is its simultaneous but independent materialization within several fields of computer science. The resulting richness of paradigms, methods and systems may, on the long run, result in a fragmentation of efforts and slow down progress. The primary goal of this study is to promote an integration of methods and techniques for MIR by contributing a conceptual model that encompasses in a unified and coherent perspective the many efforts that are being produced Under the label of MIR. The model offers a retrieval capability that spans two media, text and images, but also several dimensions: form, content and structure. In this way, it reconciles similarity-based methods with semantics-based ones, providing the guidelines for the design of systems that are able to provide a generalized multimedia retrieval service, in which the existing forms of retrieval not only coexist, but can be combined in any desired manner. The model is formulated in terms of a fuzzy description logic, which plays a twofold role: (1) it directly models semantics-based retrieval, and (2) it offers an ideal framework for the integration of the multimedia and multidimensional aspects of retrieval mentioned above. The model also accounts for relevance feedback in both text and image retrieval, integrating known techniques for taking into account user judgments. The implementation of the model is addressed by presenting a decomposition technique that reduces query evaluation to the processing of simpler requests, each of which can be solved by means of widely known methods for text and image retrieval, and semantic processing. A prototype for multidimensional image retrieval is presented that shows this decomposition technique at work in a significant case.
引用
收藏
页码:909 / 970
页数:62
相关论文
共 101 条
[81]  
Smeaton A. F., 1996, SIGIR Forum, P174
[82]   Food allergy and intolerance: an international chemical safety perspective [J].
Smith, E .
ENVIRONMENTAL TOXICOLOGY AND PHARMACOLOGY, 1997, 4 (1-2) :3-7
[83]  
Smith J. R., 1996, Proceedings ACM Multimedia 96, P87, DOI 10.1145/244130.244151
[84]  
SMITH JR, 1994, IEEE IMAGE PROC, P407, DOI 10.1109/ICIP.1994.413817
[85]  
Sowa John F, 1984, Conceptual structures: Information processing in mind and machine
[86]   AUTOMATIC-INDEXING AND CONTENT-BASED RETRIEVAL OF CAPTIONED IMAGES [J].
SRIHARI, RK .
COMPUTER, 1995, 28 (09) :49-56
[87]  
STRACCI AU, 2000, SOFT COMPUTING INFOR, P332
[88]  
Straccia U, 1997, LECT NOTES ARTIF INT, V1227, P343, DOI 10.1007/BFb0027425
[89]  
Straccia U, 1997, INT JOINT CONF ARTIF, P128
[90]  
Straccia U, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P594