A COMPARISON OF SEVERAL SIMILARITY INDEXES USED IN THE CLASSIFICATION OF PROTEIN SEQUENCES - A MULTIVARIATE-ANALYSIS

被引:15
作者
LANDES, C
HENAUT, A
RISLER, JL
机构
[1] Centre de Génétique Moléculaire du CNRS
关键词
D O I
10.1093/nar/20.14.3631
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The present work describes an attempt to identify reliable criteria which could be used as distance indices between protein sequences. Seven different criteria have been tested: i and ii) the scores of the alignments as given by the BESTFIT and the FASTA programs; iii) the ratio parametrer, i.e. the BESTFIT score divided by the length of the aligned peptides; iv and v) the statistical significance (Z-scores) of the scores calculated by BESTFIT and FASTA, as obtained by comparison with shuffled sequences; vi) the Z-scores provided by the program RELATE which performs a segment-by-segment comparison of 2 sequences, and vii) an original distance index calculated by the program DOCMA from all the pairwise dotplots between the sequences. These 7 criteria have been tested against the aminoacid sequences of 39 globins and those of the 20 aminoacyl-tRNA synthetases from E. coli. The distances between the sequences were analyzed by the multivariate analysis techniques. The results show that the distances calculated from the scores of the pairwise alignments are not adequately sensitive. The Z-score from RELATE is not selective enough and too demanding in computer time. Three criteria gave a classification consistent with the known similarities between the sequences in the sets, namely the Z-scores from BESTFIT and FASTA and the multiple dotplot comparison distance index from DOCMA.
引用
收藏
页码:3631 / 3637
页数:7
相关论文
共 35 条
[1]   MULTIPLE SEQUENCE ALIGNMENT [J].
BACON, DJ ;
ANDERSON, WF .
JOURNAL OF MOLECULAR BIOLOGY, 1986, 191 (02) :153-161
[2]   A STRATEGY FOR THE RAPID MULTIPLE ALIGNMENT OF PROTEIN SEQUENCES - CONFIDENCE LEVELS FROM TERTIARY STRUCTURE COMPARISONS [J].
BARTON, GJ ;
STERNBERG, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (02) :327-337
[3]  
Benzecri J.-P, 1982, ANAL DONNEES
[4]   3-DIMENSIONAL STRUCTURE, SPECIFICITY AND CATALYTIC MECHANISM OF RENIN [J].
BLUNDELL, T ;
SIBANDA, BL ;
PEARL, L .
NATURE, 1983, 304 (5923) :273-275
[5]  
BRETON R, 1990, J BIOL CHEM, V265, P18248
[6]   STRUCTURE OF TYROSYL TRANSFER-RNA SYNTHETASE REFINED AT 2.3-A RESOLUTION - INTERACTION OF THE ENZYME WITH THE TYROSYL ADENYLATE INTERMEDIATE [J].
BRICK, P ;
BHAT, TN ;
BLOW, DM .
JOURNAL OF MOLECULAR BIOLOGY, 1989, 208 (01) :83-98
[7]   A POSSIBLE 3-DIMENSIONAL STRUCTURE OF BOVINE ALPHA-LACTALBUMIN BASED ON THAT OF HENS EGG-WHITE LYSOZYME [J].
BROWNE, WJ ;
NORTH, ACT ;
PHILLIPS, DC .
JOURNAL OF MOLECULAR BIOLOGY, 1969, 42 (01) :65-&
[8]   CRYSTALLOGRAPHIC STUDY AT 2.5A RESOLUTION OF THE INTERACTION OF METHIONYL-TRANSFER-RNA SYNTHETASE FROM ESCHERICHIA-COLI WITH ATP [J].
BRUNIE, S ;
ZELWER, C ;
RISLER, JL .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (02) :411-424
[9]   MULTIPLE SEQUENCE ALIGNMENT WITH HIERARCHICAL-CLUSTERING [J].
CORPET, F .
NUCLEIC ACIDS RESEARCH, 1988, 16 (22) :10881-10890
[10]  
DAYHOFF MO, 1983, METHOD ENZYMOL, V91, P524