Predicate rewriting for translating Boolean queries in a heterogeneous information system

被引:31
作者
Chang, CCK [1 ]
García-Molina, H
Paepcke, A
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
algorithms; experimentation; languages; measurement; Boolean queries; content-based retrieval; filtering; predicate rewriting; query subsumption; query translation;
D O I
10.1145/297117.297120
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Searching over heterogeneous information sources is difficult in part because of the nonuniform query languages. Our approach is to allow users to compose Boolean queries in one rich front-end language. For each user query and target source, we transform the user query into a subsuming query that can be supported by the source but that may return extra documents. The results are then processed by a filter query to yield the correct final results. In this article we introduce the architecture and associated mechanism for query translation. In particular, we discuss techniques for rewriting predicates in Boolean queries into native subsuming forms,which is a basis of translating complex queries. In addition, we present experimental results for evaluating the cost of postfiltering. We also discuss the drawbacks of this approach and cases when it may not be effective. We have implemented prototype versions of these mechanisms and demonstrated them on heterogeneous Boolean systems.
引用
收藏
页码:1 / 39
页数:39
相关论文
共 49 条
[1]   The Stanford Digital Library metadata architectur [J].
Baldonado M. ;
Chang C.-C.K. ;
Gravano L. ;
Paepcke A. .
International Journal on Digital Libraries, 1997, 1 (2) :108-121
[2]  
CHANG CCK, 1999, P 1999 ACM C MAN DAT
[3]  
CHANG CCK, 1996, SIDLWP19960044
[4]  
CHANG CCK, 1997, P 2 ACM INT C DIG LI, P103
[5]   Boolean query mapping across heterogeneous information sources [J].
Chang, KCC ;
GarciaMolina, H ;
Paepcke, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (04) :515-521
[6]  
CRYSTAL MI, 1982, ONLINE, V6, P27
[7]  
Fagin R., 1996, Proceedings of the Fifteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. PODS 1996, P216, DOI 10.1145/237661.237715
[8]  
FALOUTSOS C, 1985, ACM COMPUT SURV, V17, P49
[9]  
FRAKES W, 1992, INFORMATION RETRIEVA
[10]  
GARCIAMOLINA H, 1995, P AAAI S INF GATH ST, P61