Human secretory signal peptide description by hidden Markov model and generation of a strong artificial signal peptide for secreted protein expression

被引:107
作者
Barash, S
Wang, W
Shi, YG
机构
[1] Human Genome Sci Inc, Dept Preclin Discovery, Rockville, MD 20850 USA
[2] Human Genome Sci Inc, Dept Informat Technol, Rockville, MD 20850 USA
关键词
D O I
10.1016/S0006-291X(02)00566-1
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
A hidden Markov model (HMM) has been used to describe, predict, identify, and generate secretory signal peptide sequences. The relative strengths of artificial secretory signals emitted from the human signal peptide HMM (SP-HMM) correlate with their HMM bit scores as determined by their effectiveness to direct alkaline phosphatase secretion. The nature of the signal strength is in effect the closeness to the consensus. The HMM bit score of 8 is experimentally determined to be the threshold for discriminating signal sequences from non-secretory ones. An artificial SP-HMM generated signal sequence of the maximum model bit score (HMM + 38) was selected as an ideal human signal sequence. This signal peptide (secrecon) directs strong protein secretion and expression. We further ranked the signal strengths of the signal peptides of the known human secretory proteins by SP-HMM bit scores. The applications of high-bit scoring HMM signals in recombinant protein production and protein engineering are discussed. (C) 2002 Elsevier Science (USA). All rights reserved.
引用
收藏
页码:835 / 842
页数:8
相关论文
共 26 条
[1]
BAIROCH A, 1994, NUCLEIC ACIDS RES, V22, P3578
[2]
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[3]
Blobel G, 1979, Symp Soc Exp Biol, V33, P9
[4]
The chemistry and enzymology of the type I signal peptidases [J].
Dalbey, RE ;
Lively, MO ;
Bron, S ;
VanDijl, JM .
PROTEIN SCIENCE, 1997, 6 (06) :1129-1138
[5]
Durbin R., 1998, BIOL SEQUENCE ANAL P
[6]
Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[7]
Hidden Markov models [J].
Eddy, SR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :361-365
[8]
KOZAK M, 1989, J CELL BIOL, V108, P229, DOI DOI 10.1083/JCB.108.2.229
[9]
PHYSEAN: PHYsical SEquence ANalysis for the identification of protein domains on the basis of physical and chemical properties of amino acids [J].
Ladunga, I .
BIOINFORMATICS, 1999, 15 (12) :1028-1038
[10]
Signal sequences: more than just greasy peptides [J].
Martoglio, B ;
Dobberstein, B .
TRENDS IN CELL BIOLOGY, 1998, 8 (10) :410-415