Secreted protein prediction system combining CJ-SPHMM, TMHMM, and PSORT

被引:180
作者
Chen, YJ
Yu, P
Luo, JC
Jiang, Y [1 ]
机构
[1] Peking Univ, Coll Life Sci, Natl Lab Prot Engn & Plant Genet Engn, Beijing 100871, Peoples R China
[2] Peking Univ, Ctr Bioinformat, Beijing 100871, Peoples R China
关键词
D O I
10.1007/s00335-003-2296-6
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To increase the coverage of secreted protein prediction, we describe a combination strategy. Instead of using a single method, we combine Hidden Markov Model (HMM)-based methods CJ-SPHMM and TMHMM with PSORT in secreted protein prediction. CJ-SPHMM is an HMM-based signal peptide prediction method, while TMHMM is an HMM-based transmembrane (TM) protein prediction algorithm. With CJ-SPHMM and TMHMM, proteins with predicted signal peptide and without predicted TM regions are taken as putative secreted proteins. This HMM-based approach predicts secreted protein with Ac (Accuracy) at 0.82 and Cc (Correlation coefficient) at 0.75, which are similar to PSORT with Ac at 0.82 and Cc at 0.76. When we further complement the HMM-based method, i.e., CJ-SPHMM + TMHMM with PSORT in secreted protein prediction, the Ac value is increased to 0.86 and the Cc value is increased to 0.81. Taking this combination strategy to search putative secreted proteins from the International Protein Index (IPI) maintained at the European Bioinformatics Institute (FBI), we constructed a putative human secretome with 5235 proteins. The prediction system described here can also be applied to predicting secreted proteins from other vertebrate proteomes.
引用
收藏
页码:859 / 865
页数:7
相关论文
共 41 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[4]   Human secretory signal peptide description by hidden Markov model and generation of a strong artificial signal peptide for secreted protein expression [J].
Barash, S ;
Wang, W ;
Shi, YG .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2002, 294 (04) :835-842
[5]  
BISHOP MJ, 1998, GUIDE HUMAN GENOME C
[6]  
Blobel G, 2000, CHEMBIOCHEM, V1, P87
[7]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[8]   Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[9]  
Chen Chien Peter, 2002, Appl Bioinformatics, V1, P21
[10]   Cytokines as new treatment targets in chronic heart failure [J].
Damås, JK ;
Gullestad, L ;
Aukrust, P .
CURRENT CONTROLLED TRIALS IN CARDIOVASCULAR MEDICINE, 2001, 2 (06) :271-277