BETAWRAP:: Successful prediction of parallel β-helices from primary sequence reveals an association with many microbial pathogens

被引:94
作者
Bradley, P
Cowen, L
Menke, M
King, J
Berger, B
机构
[1] MIT, Dept Math, Cambridge, MA 02139 USA
[2] MIT, Comp Sci Lab, Cambridge, MA 02139 USA
[3] MIT, Dept Biol, Cambridge, MA 02139 USA
[4] Tufts Univ, Dept Elect Engn & Comp Sci, Medford, MA 02155 USA
关键词
D O I
10.1073/pnas.251267298
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The amino acid sequence rules that specify beta -sheet structure in proteins remain obscure. A subclass of beta -sheet proteins, parallel beta -helices, represent a processive folding of the chain into an elongated topologically simpler fold than globular beta -sheets. In this paper, we present a computational approach that predicts the right-handed parallel beta -helix supersecondary structural motif in primary amino acid sequences by using beta -strand interactions learned from non-beta -helix structures. A program called BETAWRAP (http://theory.Ics.mit.edu/betawrap) implements this method and recognizes each of the seven known parallel beta -helix families, when trained on the known parallel beta -helices from outside that family. BETAWRAP identifies 2,448 sequences among 595,890 screened from the National Center for Biotechnology Information (NCBI; http://www.ncbi.nlm.nih.gov/) nonredundant protein database as likely parallel beta -helices. It identifies surprisingly many bacterial and fungal protein sequences that play a role in human infectious disease; these include toxins, virulence factors, adhesins, and surface proteins of Chlamydia, Helicobacteria, Bordetella, Leishmania, Borrelia, Rickettsia, Neisseria, and Bacillus anthracis. Also unexpected was the rarity of the parallel beta -helix fold and its predicted sequences among higher eukaryotes. The computational method introduced here can be called a three-dimensional dynamic profile method because it generates interstrand pairwise correlations from a processive sequence wrap. Such methods may be applicable to recognizing other beta structures for which strand topology and profiles of residue accessibility are well conserved.
引用
收藏
页码:14819 / 14824
页数:6
相关论文
共 29 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [3] Berger B, 1995, J Comput Biol, V2, P125, DOI 10.1089/cmb.1995.2.125
  • [4] PREDICTING COILED COILS BY USE SF PAIRWISE RESIDUE CORRELATIONS
    BERGER, B
    WILSON, DB
    WOLF, E
    TONCHEV, T
    MILLA, M
    KIM, PS
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (18) : 8259 - 8263
  • [5] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [6] Profile hidden Markov models
    Eddy, SR
    [J]. BIOINFORMATICS, 1998, 14 (09) : 755 - 763
  • [7] ENGELMAN D, 1999, ANNU REV BIOPHYS BIO, V15, P321
  • [8] Knowledge-based protein secondary structure assignment
    Frishman, D
    Argos, P
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (04): : 566 - 579
  • [9] Garnier J, 1996, METHOD ENZYMOL, V266, P540
  • [10] β-helix structure and ice-binding properties of a hyperactive antifreeze protein from an insect
    Graether, SP
    Kuiper, MJ
    Gagné, SM
    Walker, VK
    Jia, ZC
    Sykes, BD
    Davies, PL
    [J]. NATURE, 2000, 406 (6793) : 325 - 328