A Hidden Markov Model method, capable of predicting and discriminating β-barrel outer membrane proteins -: art. no. 29

被引:118
作者
Bagos, PG [1 ]
Liakopoulos, TD [1 ]
Spyropoulos, IC [1 ]
Hamodrakas, SJ [1 ]
机构
[1] Univ Athens, Fac Biol, Dept Cell Biol & Biophys, Athens 15701, Greece
关键词
D O I
10.1186/1471-2105-5-29
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Integral membrane proteins constitute about 20-30% of all proteins in the fully sequenced genomes. They come in two structural classes, the alpha-helical and the beta-barrel membrane proteins, demonstrating different physicochemical characteristics, structure and localization. While transmembrane segment prediction for the alpha-helical integral membrane proteins appears to be an easy task nowadays, the same is much more difficult for the beta-barrel membrane proteins. We developed a method, based on a Hidden Markov Model, capable of predicting the transmembrane beta-strands of the outer membrane proteins of gram-negative bacteria, and discriminating those from water-soluble proteins in large datasets. The model is trained in a discriminative manner, aiming at maximizing the probability of correct predictions rather than the likelihood of the sequences. Results: The training has been performed on a non-redundant database of 14 outer membrane proteins with structures known at atomic resolution; it has been tested with a jacknife procedure, yielding a per residue accuracy of 84.2% and a correlation coefficient of 0.72, whereas for the self-consistency test the per residue accuracy was 88.1% and the correlation coefficient 0.824. The total number of correctly predicted topologies is 10 out of 14 in the self-consistency test, and 9 out of 14 in the jacknife. Furthermore, the model is capable of discriminating outer membrane from water-soluble proteins in large-scale applications, with a success rate of 88.8% and 89.2% for the correct classification of outer membrane and water-soluble proteins respectively, the highest rates obtained in the literature. That test has been performed independently on a set of known outer membrane proteins with low sequence identity with each other and also with the proteins of the training set. Conclusion: Based on the above, we developed a strategy, that enabled us to screen the entire proteome of E. coli for outer membrane proteins. The results were satisfactory, thus the method presented here appears to be suitable for screening entire proteomes for the discovery of novel outer membrane proteins. A web interface available for non-commercial users is located at: http://bioinformatics.biol.uoa.gr/PRED-TMBB, and it is the only freely available HMM-based predictor for beta-barrel outer membrane protein topology.
引用
收藏
页数:13
相关论文
共 36 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] ANDERS K, 1994, P 12 IAPR INT C PATT, P140
  • [3] Assessing the accuracy of prediction algorithms for classification: an overview
    Baldi, P
    Brunak, S
    Chauvin, Y
    Andersen, CAF
    Nielsen, H
    [J]. BIOINFORMATICS, 2000, 16 (05) : 412 - 424
  • [4] Baum L.E., 1972, Inequalities III: Proceedings of the Third Symposium on Inequalities, page, V3, P1
  • [5] The Protein Data Bank
    Berman, HM
    Battistuz, T
    Bhat, TN
    Bluhm, WF
    Bourne, PE
    Burkhardt, K
    Iype, L
    Jain, S
    Fagan, P
    Marvin, J
    Padilla, D
    Ravichandran, V
    Schneider, B
    Thanki, N
    Weissig, H
    Westbrook, JD
    Zardecki, C
    [J]. ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2002, 58 : 899 - 907
  • [6] Substrate-induced transmembrane signaling in the cobalamin transporter BtuB
    Chimento, DP
    Mohanty, AK
    Kadner, RJ
    Wiener, MC
    [J]. NATURE STRUCTURAL BIOLOGY, 2003, 10 (05) : 394 - 401
  • [7] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [8] Prediction by a neural network of outer membrane β-strand protein topology
    Diederichs, K
    Freigang, J
    Umhau, S
    Zeth, K
    Breed, J
    [J]. PROTEIN SCIENCE, 1998, 7 (11) : 2413 - 2420
  • [9] Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
  • [10] Profile hidden Markov models
    Eddy, SR
    [J]. BIOINFORMATICS, 1998, 14 (09) : 755 - 763