FAST PARALLEL AND SERIAL MULTIDIMENSIONAL APPROXIMATE ARRAY MATCHING

被引:38
作者
AMIR, A
LANDAU, GM
机构
[1] UNIV MARYLAND,INST ADV COMP STUDIES,COLLEGE PK,MD 20742
[2] POLYTECH INST NEW YORK,DEPT COMP SCI,BROOKLYN,NY 11201
基金
美国国家科学基金会;
关键词
D O I
10.1016/0304-3975(91)90318-V
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Consider the multidimensional array matching problem, where differences between characters of the pattern and characters of the text are permitted. A difference may be due to a mismatch between a text and pattern character, superfluous text character or superfluous pattern character. Given a d-dimensional array of size n(d) (text) and a d-dimensional array of size m(d) (pattern) we present the following algorithms: For a given k, find all occurrences of the pattern in the text with at most k differences. Our serial algorithm runs in time O(n(d)(dk + k2)) and the parallel algorithm runs in time O(d((d)log n + k) + k2) using n(d) processors. If superfluous characters are not allowed and the only permitted errors are mismatches, we solve the problem serially in time O(n(d)dk) and in parallel in time O(d((d)log n + k)) using n(d) processors. We present an alternate algorithm for the mismatches problem which runs serially in time O(2(d)n(d) log2 m) and in parallel in time O(d log n) using n(d) processors. This algorithm is more efficient for large k. We also give an efficient solution to the close-match problem. Here a mismatch weight function f:SIGMA x SIGMA --> [0, 1] is assigned. The weight function gives weight to the mismatches, some mismatches being worse than others. We present a serial algorithm for finding all appearances of the pattern in the text with a bounded total error in time O(2(d)n(d) log2 m). Our parallel algorithm is again of time complexity O(d log n) using n(d) processors.
引用
收藏
页码:97 / 115
页数:19
相关论文
共 31 条
[1]  
Aho Alfred V., 1974, DESIGN ANAL COMPUTER
[2]   EFFICIENT STRING MATCHING - AID TO BIBLIOGRAPHIC SEARCH [J].
AHO, AV ;
CORASICK, MJ .
COMMUNICATIONS OF THE ACM, 1975, 18 (06) :333-340
[3]  
AMIR A, 1987, UMIACSTR8741 U MAR D
[4]   PARALLEL CONSTRUCTION OF A SUFFIX TREE WITH APPLICATIONS [J].
APOSTOLICO, A ;
ILIOPOULOS, C ;
LANDAU, GM ;
SCHIEBER, B ;
VISHKIN, U .
ALGORITHMICA, 1988, 3 (03) :347-365
[6]  
Bird R. S., 1977, Information Processing Letters, V6, P168, DOI 10.1016/0020-0190(77)90017-5
[7]   FAST STRING SEARCHING ALGORITHM [J].
BOYER, RS ;
MOORE, JS .
COMMUNICATIONS OF THE ACM, 1977, 20 (10) :762-772
[8]  
Fischer M.J., 1974, SIAM AMS P, VVII, P113
[9]  
Galil Z., 1986, SIGACT News, V17, P52, DOI 10.1145/8307.8309
[10]   PARALLEL STRING MATCHING WITH K-MISMATCHES [J].
GALIL, Z ;
GIANCARLO, R .
THEORETICAL COMPUTER SCIENCE, 1987, 51 (03) :341-348