String taxonomy using learning automata

被引:17
作者
Oommen, BJ
Croix, EVD
机构
[1] School of Computer Science, Carleton University, Ottawa
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1997年 / 27卷 / 02期
基金
加拿大自然科学与工程研究理事会;
关键词
dictionary partitioning; string clustering; string taxonomy; syntactic pattern recognition;
D O I
10.1109/3477.558849
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A typical syntactic pattern recognition (PR) problem involves comparing a noisy string with every element of a dictionary, H. The problem of classification can be greatly simplified if the dictionary is partitioned into a set of subdictionaries. In this case, the classification can be hierarchical-the noisy string is first compared to a representative element of each subdictionary and the closest match within the subdictionary is subsequently located, Indeed, the entire problem of subdividing a set of strings into subsets where each subset contains ''similar'' strings has been referred to as the ''String Taxonomy Problem.'' To our knowledge there is no reported solution to this problem (see footnote 2). In this paper we present a learning-automaton based solution to string taxonomy, The solution utilizes the Object Migrating Automaton (OMA) the power of which in clustering objects and images [33], [35] has been reported, The power of the scheme for string taxonomy has been demonstrated using random strings and garbled versions of string representations of fragments of macromolecules.
引用
收藏
页码:354 / 365
页数:12
相关论文
共 46 条
[1]  
AHO AV, 1976, J ACM, V23, P1, DOI 10.1145/321921.321922
[2]   DECODING FOR CHANNELS WITH INSERTIONS, DELETIONS, AND SUBSTITUTIONS WITH APPLICATIONS TO SPEECH RECOGNITION [J].
BAHL, LR ;
JELINEK, F .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1975, 21 (04) :404-411
[3]  
DAYHOFF MO, 1971, ATLAS PROTEIN SEQUEN
[4]   A NOTE ON THE HEIGHT OF SUFFIX TREES [J].
DEVROYE, L ;
SZPANKOWSKI, W ;
RAIS, B .
SIAM JOURNAL ON COMPUTING, 1992, 21 (01) :48-53
[5]  
Dewey Godfrey, 1923, RELATIVE FREQUENCY E
[6]  
DUBES R, 1976, PATTERN RECOGN, V8, P235
[7]  
EHRENFEUCHT A, 1992, 3 INT S COMB PATT MA
[8]  
EVROYE L, 1986, NONUNIFORM RANDOM VA
[9]   APPROXIMATE STRING MATCHING [J].
HALL, PAV ;
DOWLING, GR .
COMPUTING SURVEYS, 1980, 12 (04) :381-402
[10]  
Hartigan J. A., 1975, CLUSTERING ALGORITHM