COMPUTATION OF NORMALIZED EDIT DISTANCE AND APPLICATIONS

被引:177
作者
MARZAL, A
VIDAL, E
机构
[1] Departamento de Sistemas Informàticos y Computatión, Universidad Politécnica de Valencia, Valencia
关键词
EDITING; LEVENSHTEIN DISTANCE; NORMALIZED EDIT DISTANCE; OPTICAL CHARACTER RECOGNITION; PATTERN RECOGNITION; SPEECH RECOGNITION; SPELLING CORRECTION; STRING CORRECTION;
D O I
10.1109/34.232078
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given two strings X and Y over a finite alphabet, the normalized edit distance between X and Y, d(X, Y) is defined as the minimum of W(P)/L(P), where P is an editing path between X and Y, W(P) is the sum of the weights of the elementary edit operations of P, and L(P) is the number of these operations (length of P). In this paper, it is shown that in general, d(X, Y) cannot be computed by first obtaining the conventional (unnormalized) edit distance between X and Y and then normalizing this value by the length of the corresponding editing path. In order to compute normalized edit distances, a new algorithm that can be implemented to work in O(m . n2) time and O(n2) memory space is proposed, where m and n are the lengths of the strings under consideration, and m greater-than-or-equal-to n. Experiments in hand-written digit recognition are presented, revealing that the normalized edit distance consistently provides better results than both unnormalized or post-normalized classical edit distances.
引用
收藏
页码:926 / 932
页数:7
相关论文
共 14 条
[1]  
[Anonymous], [No title captured]
[2]  
CASACUBERTA F, 1987, RECONOMCIMIENTO AUTO
[3]  
Fu K.S., 2019, APPL PATTERN RECOGNI
[4]   APPROXIMATE STRING MATCHING [J].
HALL, PAV ;
DOWLING, GR .
COMPUTING SURVEYS, 1980, 12 (04) :381-402
[5]   LSI IMPLEMENTATION OF A PATTERN-MATCHING ALGORITHM FOR SPEECH RECOGNITION [J].
KITAZUME, Y ;
OHIRA, E ;
ENDO, T .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (01) :1-4
[6]  
MARZAL A, DSICII151991 U POL V
[7]   A FASTER ALGORITHM COMPUTING STRING EDIT DISTANCES [J].
MASEK, WJ ;
PATERSON, MS .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1980, 20 (01) :18-31
[8]  
RABINER L, 1981, IEEE T COMMUN, V29, P6251
[9]  
RULOT H, 1987, PATTERN RECOGN, P451
[10]  
Sankoff D, 1983, TIME WARPS STRING ED