SIMULTANEOUS AND MULTIVARIATE ALIGNMENT OF PROTEIN SEQUENCES - CORRESPONDENCE BETWEEN PHYSICOCHEMICAL PROFILES AND STRUCTURALLY CONSERVED REGIONS (SCR)

被引:27
作者
DEPIEREUX, E
FEYTMANS, E
机构
[1] Department of Biology, Facultes Universitatres Notre Dame de la Paix, Namur, B-5000
来源
PROTEIN ENGINEERING | 1991年 / 4卷 / 06期
关键词
CLIQUES; MULTIPLE ALIGNMENT; MULTIVARIATE PROFILES; SEQUENCES CLUSTERING; STRUCTURE PREDICTION;
D O I
10.1093/protein/4.6.603
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A general protein sequence alignment methodology for detecting a priori unknown common structural and functional regions is described. The method proposed in this paper is based on two basic requirements for a meaningful alignment. First, each sequence or segment of a sequence is characterized by a multivariate physicochemical profile. Second, the alignment is performed by considering all the sequences simultaneously, and the algorithm detects those regions that form a set of similar profiles. In order to test the structural meaning of the alignment obtained from the sequences, quantitative comparisons are performed with structurally conserved regions (SCR) determined from the X-ray structures of three serine proteases. Results suggest that the limits of the SCR may be predicted from the similarities between the physicochemical profiles of the sequences. The procedures are not completely automated. The final step requires a visual screening of alternative pathways in order to determine an optimal alignment.
引用
收藏
页码:603 / 613
页数:11
相关论文
共 38 条
  • [1] ANDERBERG MR, 1973, CLUSTER ANAL APPLICA
  • [2] MULTIPLE SEQUENCE ALIGNMENT
    BACON, DJ
    ANDERSON, WF
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1986, 191 (02) : 153 - 161
  • [3] FLEXIBLE PROTEIN-SEQUENCE PATTERNS - A SENSITIVE METHOD TO DETECT WEAK STRUCTURAL SIMILARITIES
    BARTON, GJ
    STERNBERG, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 212 (02) : 389 - 402
  • [4] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [5] KNOWLEDGE-BASED PREDICTION OF PROTEIN STRUCTURES AND THE DESIGN OF NOVEL MOLECULES
    BLUNDELL, TL
    SIBANDA, BL
    STERNBERG, MJE
    THORNTON, JM
    [J]. NATURE, 1987, 326 (6111) : 347 - 352
  • [6] Creighton T. E., 1984, PROTEINS
  • [7] Dayhoff M. O., 1972, ATLAS PROTEIN SEQUEN, V5, P89
  • [8] DAYHOFF MO, 1983, METHOD ENZYMOL, V91, P524
  • [9] DRAPER NR, 1980, APPLIED REGRESSION A
  • [10] Everitt B., 1974, CLUSTER ANAL