WEIGHTING IN SEQUENCE SPACE - A COMPARISON OF METHODS IN TERMS OF GENERALIZED SEQUENCES

被引:61
作者
VINGRON, M [1 ]
SIBBALD, PR [1 ]
机构
[1] EUROPEAN MOLEC BIOL LAB, DATA LIB, W-6900 HEIDELBERG, GERMANY
关键词
ALIGNMENT; PROFILES; CORRECTING FOR CORRELATION; SEQUENCE WEIGHTING;
D O I
10.1073/pnas.90.19.8777
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Four methods for weighting aligned biological sequences have recently appeared that differ mathematically, philosophically, and in their results. Thus, while there is consensus about the need to weight sequences, the method to use is contentious. A geometric analysis based on a continuous sequence space is presented that provides a common framework in which to compare the methods. It is concluded that there are two ''best'' methods. When the sequences are known to be phylogenetically related and a tree can be generated without introducing excessive stress into the data, the method of Altschul et al. [Altschul, S. F., Carroll, R. J. & Lipman, D. J. (1989) J. Mol. Biol. 207, 647-653] is appropriate. When the sequences are not known to be phylogenetically related or a tree cannot be produced without unduly distorting the distances between the sequences, a modification of the method of Sibbald and Argos [Sibbald, P. R. & Argos, P. (1990) J. Mol. Biol. 216, 813-818] is preferable.
引用
收藏
页码:8777 / 8781
页数:5
相关论文
共 29 条
[1]   EQUAL ANIMALS [J].
ALTSCHUL, SF ;
LIPMAN, DJ .
NATURE, 1990, 348 (6301) :493-494
[2]   WEIGHTS FOR DATA RELATED BY A TREE [J].
ALTSCHUL, SF ;
CARROLL, RJ ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1989, 207 (04) :647-653
[3]  
BANDELT HJ, 1989, B MATH BIOL, V51, P133, DOI 10.1016/S0092-8240(89)80053-9
[4]  
Barthelemy JP, 1991, TREES PROXIMITY REPR
[5]   A STRATEGY FOR THE RAPID MULTIPLE ALIGNMENT OF PROTEIN SEQUENCES - CONFIDENCE LEVELS FROM TERTIARY STRUCTURE COMPARISONS [J].
BARTON, GJ ;
STERNBERG, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (02) :327-337
[6]  
BICKEL PJ, 1977, MATH STATISTICS
[7]   MULTIPLE SEQUENCE ALIGNMENT WITH HIERARCHICAL-CLUSTERING [J].
CORPET, F .
NUCLEIC ACIDS RESEARCH, 1988, 16 (22) :10881-10890
[8]  
DRESS A, 1990, TREES HIERARCHICAL S
[9]   STATISTICAL GEOMETRY IN SEQUENCE SPACE - A METHOD OF QUANTITATIVE COMPARATIVE SEQUENCE-ANALYSIS [J].
EIGEN, M ;
WINKLEROSWATITSCH, R ;
DRESS, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (16) :5913-5917
[10]  
FELSENSTEIN J, 1973, AM J HUM GENET, V25, P471