Prediction of signal peptides in archaea

被引:69
作者
Bagos, P. G. [1 ,2 ]
Tsirigos, K. D. [2 ]
Plessas, S. K. [2 ]
Liakopoulos, T. D. [2 ]
Hamodrakas, S. J. [2 ]
机构
[1] Univ Cent Greece, Dept Informat Applicat Biomed, Lamia 35100, Greece
[2] Univ Athens, Fac Biol, Dept Cell Biol & Biophys, Athens 15701, Greece
关键词
HYPERTHERMOPHILE AEROPYRUM-PERNIX; COMBINED TRANSMEMBRANE TOPOLOGY; ARGININE TRANSLOCATION PATHWAY; SULFOLOBUS-SOLFATARICUS P2; CELL-SURFACE GLYCOPROTEIN; COMPLETE GENOME SEQUENCE; SUBTILISIN-LIKE PROTEASE; PYROCOCCUS-FURIOSUS; ALPHA-AMYLASE; BACTERIAL LIPOPROTEINS;
D O I
10.1093/protein/gzn064
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Computational prediction of signal peptides (SPs) and their cleavage sites is of great importance in computational biology; however, currently there is no available method capable of predicting reliably the SPs of archaea, due to the limited amount of experimentally verified proteins with SPs. We performed an extensive literature search in order to identify archaeal proteins having experimentally verified SP and managed to find 69 such proteins, the largest number ever reported. A detailed analysis of these sequences revealed some unique features of the SPs of archaea, such as the unique amino acid composition of the hydrophobic region with a higher than expected occurrence of isoleucine, and a cleavage site resembling more the sequences of gram-positives with almost equal amounts of alanine and valine at the position-3 before the cleavage site and a dominant alanine at position-1, followed in abundance by serine and glycine. Using these proteins as a training set, we trained a hidden Markov model method that predicts the presence of the SPs and their cleavage sites and also discriminates such proteins from cytoplasmic and transmembrane ones. The method performs satisfactorily, yielding a 35-fold cross-validation procedure, a sensitivity of 100% and specificity 98.41% with the Matthews' correlation coefficient being equal to 0.964. This particular method is currently the only available method for the prediction of secretory SPs in archaea, and performs consistently and significantly better compared with other available predictors that were trained on sequences of eukaryotic or bacterial origin. Searching 48 completely sequenced archaeal genomes we identified 9437 putative SPs. The method, PRED-SIGNAL, and the results are freely available for academic users at http://bioinformatics.biol.uoa.gr/PRED-SIGNAL/ and we anticipate that it will be a valuable tool for the computational analysis of archaeal genomes.
引用
收藏
页码:27 / 35
页数:9
相关论文
共 100 条
[71]   Effect of organic solvents on the activity and stability of an extracellular protease secreted by the haloalkaliphilic archaeon Natrialba magadii [J].
Ruiz, Diego M. ;
De Castro, Rosana E. .
JOURNAL OF INDUSTRIAL MICROBIOLOGY & BIOTECHNOLOGY, 2007, 34 (02) :111-115
[72]   An extremely heat-stable extracellular proteinase (aeropyrolysin) from the hyperthermophilic archaeon Aeropyrum pernix K1 [J].
Sako, Y ;
Croocker, PC ;
Ishida, Y .
FEBS LETTERS, 1997, 415 (03) :329-334
[73]  
SANKARAN K, 1995, METHOD ENZYMOL, V250, P683
[74]  
SANKARAN K, 1995, METHOD ENZYMOL, V248, P169
[75]  
SANKARAN K, 1994, J BIOL CHEM, V269, P19701
[76]   Proteomic and computational analysis of secreted proteins with type 1 signal peptides from the antarctic archaeon Methanococcoides burtonii [J].
Saunders, Neil F. W. ;
Ng, Charmaine ;
Raftery, Mark ;
Guilhaus, Michael ;
Goodchild, Amber ;
Cavicchioli, Ricardo .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (09) :2457-2464
[77]   SEQUENCE LOGOS - A NEW WAY TO DISPLAY CONSENSUS SEQUENCES [J].
SCHNEIDER, TD ;
STEPHENS, RM .
NUCLEIC ACIDS RESEARCH, 1990, 18 (20) :6097-6100
[78]   Novel thermoactive glucoamylases from the thermoacidophilic Archaea Thermoplasma acidophilum, Picrophilus torridus and Picrophilus oshimae [J].
Serour, E ;
Antranikian, G .
ANTONIE VAN LEEUWENHOEK INTERNATIONAL JOURNAL OF GENERAL AND MOLECULAR MICROBIOLOGY, 2002, 81 (1-4) :73-83
[79]   Lipoprotein computational prediction in spirochaetal genomes 10.1099/mic.0.28317-0 [J].
Setubal, JC ;
Reis, M ;
Matsunaga, J ;
Haake, DA .
MICROBIOLOGY-SGM, 2006, 152 :113-121
[80]   The complete genome of the crenarchaeon Sulfolobus solfataricus P2 [J].
She, Q ;
Singh, RK ;
Confalonieri, F ;
Zivanovic, Y ;
Allard, G ;
Awayez, MJ ;
Chan-Weiher, CCY ;
Clausen, IG ;
Curtis, BA ;
De Moors, A ;
Erauso, G ;
Fletcher, C ;
Gordon, PMK ;
Heikamp-de Jong, I ;
Jeffries, AC ;
Kozera, CJ ;
Medina, N ;
Peng, X ;
Thi-Ngoc, HP ;
Redder, P ;
Schenk, ME ;
Theriault, C ;
Tolstrup, N ;
Charlebois, RL ;
Doolittle, WF ;
Duguet, M ;
Gaasterland, T ;
Garrett, RA ;
Ragan, MA ;
Sensen, CW ;
Van der Oost, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (14) :7835-7840