AN APPROXIMATE STRING-MATCHING ALGORITHM

被引:14
作者
KIM, JY
SHAWETAYLOR, J
机构
[1] Department of Computer Science, Royal Holloway and Bedford New College, University of London, Egham
关键词
D O I
10.1016/0304-3975(92)90138-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
An approximate string-matching algorithm is described based on earlier attribute-matching algorithms. The algorithm involves building a trie from the text string which takes time O(N log2 N), for a text string of length N. Once this data structure has been built any number of approximate searches can be made for pattern strings of length m. The expected complexity analysis is given for the look-up phase of the algorithm based on certain regularity assumptions about the background language. The expected look-up time for each pattern is O(m log2 N). The ideas employed in the algorithm have been shown effective in practice before, but have not previously received any theoretical analysis.
引用
收藏
页码:107 / 117
页数:11
相关论文
共 13 条
[1]   RETRIEVAL OF MISSPELLED NAMES IN AN AIRLINES PASSENGER RECORD SYSTEM [J].
DAVIDSON, L .
COMMUNICATIONS OF THE ACM, 1962, 5 (03) :169-171
[2]  
DOWLING GR, 1980, ACM COMPUT SURV, V12, P381
[3]   STRING-PROCESSING ON THE HYPERCUBE [J].
IBARRA, OH ;
PONG, TC ;
SOHN, SM .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1990, 38 (01) :160-164
[4]  
Knuth, 1997, SORTING SEARCHING
[5]   FAST PARALLEL AND SERIAL APPROXIMATE STRING MATCHING [J].
LANDAU, GM ;
VISHKIN, U .
JOURNAL OF ALGORITHMS, 1989, 10 (02) :157-169
[6]  
MICHAEL AB, 1987, COMMUN ACM, P224
[7]   FAST APPROXIMATE STRING MATCHING [J].
OWOLABI, O ;
MCGREGOR, DR .
SOFTWARE-PRACTICE & EXPERIENCE, 1988, 18 (04) :387-393
[8]  
RISEMAN EM, 1974, IEEE T COMPUT, V5, P480
[9]  
Sankoff D, 1983, TIME WARPS STRING ED
[10]  
SHAWETAYLOR J, 1990, RHBNC CSDTR633 U LON