FAMILY OF SIMILARITY MEASURES BETWEEN 2 STRINGS

被引:13
作者
FINDLER, NV [1 ]
VANLEEUWEN, J [1 ]
机构
[1] STATE UNIV UTRECHT,DEPT COMP SCI,UTRECHT,NETHERLANDS
关键词
Classification problems pattern recognition search processes similarity measures between strings substringsxsxs; Index Terms;
D O I
10.1109/TPAMI.1979.4766885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a class of similarity measures for quantitatively comparing two strings, that is, two linearly ordered sets of elements. The strings can be of different lengths, the elements come from a single alphabet, and an element may appear any number of times. The limiting values of each measure are 0, when two completely different strings are compared, and 1, when the two strings are identical. Applications of similarity measures are numerous in nonnumerical computations, such as in heuristic search processes in associative networks, in pattern recognition and classification, in game playing programs, and in music and text analysis. We offer a number of feasible measures from among which some are discarded on plausibility grounds. One can select the measure most adequate for one's needs on the basis of a few characteristic examples of strings compared and by considering the specific requirements of the application at hand. Copyright © 1979 by The Institute of Electrical and Electronics Engineers, Inc.
引用
收藏
页码:116 / 118
页数:3
相关论文
共 5 条
[1]  
Findler N. V., 1972, Information Processing Letters, V1, P191, DOI 10.1016/0020-0190(72)90037-3
[2]   STUDIES IN MACHINE COGNITION USING GAME OF POKER [J].
FINDLER, NV .
COMMUNICATIONS OF THE ACM, 1977, 20 (04) :230-245
[3]  
FINDLER NV, ASS NETWORKS REPRESE
[4]  
KESSLER MJ, 1975, P ASS COMPUT MACH CO
[5]  
MOODY JD, COMMUNICATION