Non-intertwined binary patterns of hydrophobic/nonhydrophobic amino acids are considerably better markers of regular secondary structures than nonconstrained patterns

被引:22
作者
Hennetin, J
Le Tuan, K
Canard, L
Colloc'h, N
Mornon, JP
Callebaut, I
机构
[1] Univ Paris 06, CNRS UMR 7590, LMCP, F-75252 Paris 05, France
[2] Univ Paris 07, CNRS UMR 7590, LMCP, F-75252 Paris 05, France
关键词
hydrophobic cluster analysis; secondary structure; randomness;
D O I
10.1002/prot.10355
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Patterns of hydrophobic and hydrophilic residues (binary patterns) play an important role in protein architecture and can be roughly categorized into two classes regarding their preferential participation in alpha-helices or beta-strands. However, a single binary pattern can be embedded into different longer patterns carrying opposite structural information and thus cannot be as much informative as expected. Here, we consider conditional binary patterns, or hydrophobic clusters, whose existence is conditioned by the presence of a minimum number of nonhydrophobic residues, called the connectivity distance, that separate two hydrophobic amino acids assumed to belong to two distinct patterns. Conditional binary patterns are distinct from simple ones in that they are not intertwined, i.e., they can not include or be included in other conditional patterns and therefore carry a much more differentiated information, in particular being dramatically better correlated with regular secondary structures (especially beta ones). The distribution of these nonintertwined binary patterns in natural proteins was assessed relative to randomness, evidencing the structural bricks that are favored and disfavored by evolutionary selection. Several connectivity distances as well as several hydrophobic alphabets were tested, evidencing the clear superiority of a connectivity distance of 4, which mimics the minimum current length of loops in globular domains, and of the VILFMYW alphabet, selected from structural data (secondary structure propension and Voronoi tesselation), in highlighting fundamental properties of protein folds. (C) 2003 Wiley-Liss, Inc.
引用
收藏
页码:236 / 244
页数:9
相关论文
共 43 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] BOURAT G, 1994, J PHARM BELG, V49, P226
  • [3] BRESLER SE, 1944, DOKLADY URSS, V43, P349
  • [4] BRESLER SE, 1994, DOKLADY URSS, V43, P310
  • [5] Nature disfavors sequences of alternating polar and non-polar amino acids: Implications for amyloidogenesis
    Broome, BM
    Hecht, MH
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2000, 296 (04) : 961 - 968
  • [6] Prediction of local structure in proteins using a library of sequence-structure motifs
    Bystroff, C
    Baker, D
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1998, 281 (03) : 565 - 577
  • [7] Local sequence-structure correlations in proteins
    Bystroff, C
    Simons, KT
    Han, KF
    Baker, D
    [J]. CURRENT OPINION IN BIOTECHNOLOGY, 1996, 7 (04) : 417 - 421
  • [8] The V(D)J recombination activating protein RAG2 consists of a six-bladed propeller and a PHD fingerlike domain, as revealed by sequence analysis
    Callebaut, I
    Mornon, JP
    [J]. CELLULAR AND MOLECULAR LIFE SCIENCES, 1998, 54 (08) : 880 - 891
  • [9] Metallo-β-lactamase fold within nucleic acids processing enzymes:: the β-CASP family
    Callebaut, I
    Moshous, D
    Mornon, JP
    de Villartay, JP
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (16) : 3592 - 3601
  • [10] RUN domains: a new family of domains involved in Ras-like GTPase signaling
    Callebaut, I
    de Gunzburg, J
    Goud, B
    Mornon, JP
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2001, 26 (02) : 79 - 83