Two-dimensional filters for structured text

被引:1
作者
Kuikka, E
Salminen, A
机构
[1] UNIV WATERLOO,DEPT COMP SCI,WATERLOO,ON N2L 3G1,CANADA
[2] UNIV JYVASKYLA,DEPT COMP SCI & INFORMAT SYST,SF-40351 JYVASKYLA,FINLAND
关键词
RETRIEVAL; DATABASES; HYPERTEXT; ALGEBRA;
D O I
10.1016/S0306-4573(96)00040-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper introduces a method for defining filters for structured text. In the method, the text structure is originally defined by a grammar consisting of a set of productions; To describe the information interests, a two-dimensional template is first created interactively from the grammar to show the structure of a set of textual elements, at a chosen level of detail. The template depicts the hierarchical structure of the elements and indicates also optionality, alternatives, and iteration in the structure. Then, the template is filled by constraints and annotations. The constraints allow giving conditions to the content of parts, to the position of parts in an ordered set of parts, and to the number of parts obeying a specified property. In a compound filter, several templates are connected by annotations. The method is intended to be used as a theoretical framework for developing flexible and powerful graphical interfaces for filtering structured text. A prototype implementation is described. Copyright (C) 1997 Elsevier Science Ltd
引用
收藏
页码:37 / 54
页数:18
相关论文
共 38 条
[1]  
Aho Alfred V., 1972, The theory of parsing, translation, and compiling, V1
[2]  
[Anonymous], 1990, SGML HDB
[3]   INFORMATION FILTERING AND INFORMATION-RETRIEVAL - 2 SIDES OF THE SAME COIN [J].
BELKIN, NJ ;
CROFT, WB .
COMMUNICATIONS OF THE ACM, 1992, 35 (12) :29-38
[4]   QUERY-PROCESSING IN A MULTI-MEDIA DOCUMENT SYSTEM [J].
BERTINO, E ;
RABITTI, F ;
GIBBS, S .
ACM TRANSACTIONS ON OFFICE INFORMATION SYSTEMS, 1988, 6 (01) :1-41
[5]  
BLAKE GE, 1995, CS9525 U WAT
[6]   AN ALGEBRA FOR HIERARCHICALLY ORGANIZED TEXT-DOMINATED DATABASES [J].
BURKOWSKI, FJ .
INFORMATION PROCESSING & MANAGEMENT, 1992, 28 (03) :333-348
[7]  
BURKOWSKI FJ, 1991, INT C MULT INF SYST
[8]  
BYRD RJ, 1989, 81789 IBM RES DV TJ
[9]  
CHRISTOPHIDES V, 1994, 1994 ACM SIGMOD INT, V23, P313
[10]   AN ALGEBRA FOR STRUCTURED TEXT SEARCH AND A FRAMEWORK FOR ITS IMPLEMENTATION [J].
CLARKE, CLA ;
CORMACK, GV ;
BURKOWSKI, FJ .
COMPUTER JOURNAL, 1995, 38 (01) :43-56