AUTOMATICALLY INFERRED MARKOV NETWORK MODELS FOR CLASSIFICATION OF CHROMOSOMAL BAND PATTERN STRUCTURES

被引:27
作者
GRANUM, E [1 ]
THOMASON, MG [1 ]
机构
[1] UNIV TENNESSEE,DEPT COMP SCI,KNOXVILLE,TN 37996
来源
CYTOMETRY | 1990年 / 11卷 / 01期
关键词
Chromosome band patterns; dynamic programming; grammatical inference; local sequential features; string representations; structural pattern recognition;
D O I
10.1002/cyto.990110105
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A structural pattern recognition approach to the analysis and classification of metaphase chromosome band patterns is presented. An operational method of representing band pattern profiles as sharp edged idealized profiles is outlined. These profiles are nonlinearly scaled to a few, but fixed number of “density” levels. Previous experience has shown that profiles of six levels are appropriate and that the differences between successive bands in these profiles are suitable for classification. String representations, which focuses on the sequences of transitions between local band pattern levels, are derived from such “difference profiles.” A method of syntactic analysis of the band transition sequences by dynamic programming for optimal (maximal probability) string‐to‐network alignments is described. It develops automatic data‐driven inference of band pattern models (Markov networks) per class, and uses these models for classification. The method does not use centromere information, but assumes the p‐q‐orientation of the band pattern profiles to be known a priori. It is experimentally established that the method can build Markov network models, which, when used for classification, show a recognition rate of about 92% on test data. The experiments used 200 samples (chromosome profiles) for each of the 22 autosome chromosome types and are designed to also investigate various classifier design problems. It is found that the use of a priori knowledge of Denver Group assignment only improved classification by 1 or 2%. A scheme for typewise normalization of the class relationship measures prove useful, partly through improvements on average results and partly through a more evenly distributed error pattern. The choice of reference of the p‐q‐orientation of the band patterns is found to be unimportant, and results of timing of the execution time of the analysis show that recent and efficient implementations can process one cell in less than 1 min on current standard hardware. A measure of divergence between data sets and Markov network models is shown to provide usable estimates of experimental classification performance. Copyright © 1990 Wiley‐Liss, Inc.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 26 条
[1]  
[Anonymous], 1985, INT SYSTEM HUMAN CYT
[2]  
CASPERSSON T, 1971, HEREDITAS-GENETISK A, V67, P103
[3]  
COLE GS, 1988, THESIS U TENNESSEE K
[4]   USE OF DISTRIBUTION FUNCTIONS TO DESCRIBE INTEGRATED DENSITY PROFILES OF HUMAN CHROMOSOMES [J].
GRANLUND, GH .
JOURNAL OF THEORETICAL BIOLOGY, 1973, 40 (03) :573-589
[5]  
GRANUM E, 1989, AUTOMATION CYTOGENET
[6]  
GRANUM E, 1977, 4 P NORD M MED BIOL
[7]  
GRANUM E, 1982, PATTERN RECOGNITION
[8]  
GRANUM E, 1980, THESIS TU DENMARK LY
[9]   ITERATIONS OF A NONLINEAR TRANSFORMATION FOR ENHANCEMENT OF DIGITAL IMAGES [J].
KRAMER, HP ;
BRUCKNER, JB .
PATTERN RECOGNITION, 1975, 7 (1-2) :53-58
[10]  
LUNDSTEEN C, 1979, CLIN GENET, V15, P418