Faster algorithms for string matching with k mismatches

被引:95
作者
Amir, A
Lewenstein, M [1 ]
Porat, E
机构
[1] Bar Ilan Univ, Dept Math & Comp Sci, IL-52900 Ramat Gan, Israel
[2] Georgia Inst Technol, Coll Comp, Atlanta, GA 30332 USA
来源
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC | 2004年 / 50卷 / 02期
基金
美国国家科学基金会;
关键词
design and analysis of algorithms; combinatorial algorithms on words; approximate string matching; Hamming distance;
D O I
10.1016/S0196-6774(03)00097-X
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The string matching with mismatches problem is that of finding the number of mismatches between a pattern P of length in and every length in substring of the text T. Currently, the fastest algorithms for this problem are the following. The Galil-Giancarlo algorithm finds all locations where the pattern has at most k errors (where k is part of the input) in time O (nk). The Abrahamson algorithm finds the number of mismatches at every location in time O (nrootm log m). We present an algorithm that is faster than both. Our algorithm finds all locations where the pattern has at most k errors in time O(nrootk log k). We also show an algorithm that solves the above problem in time O ((n + (nk(3))/m) log k). (C) 2003 Elsevier Inc. All rights reserved.
引用
收藏
页码:257 / 275
页数:19
相关论文
共 28 条
[1]   GENERALIZED STRING MATCHING [J].
ABRAHAMSON, K .
SIAM JOURNAL ON COMPUTING, 1987, 16 (06) :1039-1051
[2]   EFFICIENT STRING MATCHING - AID TO BIBLIOGRAPHIC SEARCH [J].
AHO, AV ;
CORASICK, MJ .
COMMUNICATIONS OF THE ACM, 1975, 18 (06) :333-340
[3]   EFFICIENT 2-DIMENSIONAL APPROXIMATE MATCHING OF HALF-RECTANGULAR FIGURES [J].
AMIR, A ;
FARACH, M .
INFORMATION AND COMPUTATION, 1995, 118 (01) :1-11
[4]   Pattern matching with swaps [J].
Amir, A ;
Aumann, Y ;
Landau, GM ;
Lewenstein, M ;
Lewenstein, N .
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2000, 37 (02) :247-266
[5]   Efficient special cases of Pattern Matching with Swaps [J].
Amir, A ;
Landau, GM ;
Lewenstein, M ;
Lewenstein, N .
INFORMATION PROCESSING LETTERS, 1998, 68 (03) :125-132
[6]  
Baker B. S., 1993, Proceedings of the Twenty-Fifth Annual ACM Symposium on the Theory of Computing, P71, DOI 10.1145/167088.167115
[7]  
Berkman O., 1989, Proceedings of the Twenty First Annual ACM Symposium on Theory of Computing, P309, DOI 10.1145/73007.73036
[8]   FAST STRING SEARCHING ALGORITHM [J].
BOYER, RS ;
MOORE, JS .
COMMUNICATIONS OF THE ACM, 1977, 20 (10) :762-772
[9]  
Fischer Michael J., 1974, SIAM AMS P, V7, P113
[10]  
Galil Z., 1986, SIGACT News, V17, P52, DOI 10.1145/8307.8309