Support vector machines for prediction of protein signal sequences and their cleavage sites

被引:86
作者
Cai, YD
Lin, SL
Chou, KC
机构
[1] Chinese Acad Sci, Shanghai Res Ctr Biotechnol, Shanghai 200033, Peoples R China
[2] Wyeth Ayerst Res, Pearl River, NY 10965 USA
[3] Pharmacia & Upjohn Inc, Upjohn Labs, Comp Aided Drug Discovery, Kalamazoo, MI 49001 USA
关键词
support vector machine; protein signal sequence; jackknife test; bench-mark window;
D O I
10.1016/S0196-9781(02)00289-9
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Given a nascent protein sequence, how can one predict its signal peptide or "Zipcode" sequence? This is an important problem for scientists to use signal peptides as a vehicle to find new drugs or to reprogram cells for gene therapy (see, e.g. [7] K.C. Chou, Current Protein and Peptide Science 2002;3:615-22). In this paper, support vector machines (SVMs), a new machine learning method, is applied to approach this problem. The overall rate of correct prediction for 1939 secretary proteins and 1440 nonsecretary proteins was over 91%. It has not escaped our attention that the new method may also serve as a useful tool for further investigating many unclear details regarding the molecular mechanism of the ZIP code protein-sorting system in cells. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:159 / 161
页数:3
相关论文
共 21 条
  • [1] [Anonymous], 1999, INT C MACH LEARN ICM
  • [2] Is it a paradox or misinterpretation?
    Cai, YD
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 43 (03): : 336 - 338
  • [3] Prediction of protein signal sequences
    Chou, KC
    [J]. CURRENT PROTEIN & PEPTIDE SCIENCE, 2002, 3 (06) : 615 - 622
  • [4] Prediction of signal peptides using scaled window
    Chou, KC
    [J]. PEPTIDES, 2001, 22 (12) : 1973 - 1979
  • [5] Protein subcellular location prediction
    Chou, KC
    Elrod, DW
    [J]. PROTEIN ENGINEERING, 1999, 12 (02): : 107 - 118
  • [6] PREDICTION OF PROTEIN STRUCTURAL CLASSES
    CHOU, KC
    ZHANG, CT
    [J]. CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) : 275 - 349
  • [7] Prediction of protein cellular attributes using pseudo-amino acid composition (vol 43, pg 246, 2001)
    Chou, KC
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2001, 44 (01): : 60 - 60
  • [8] Chou KC, 2001, PROTEINS, V42, P136, DOI 10.1002/1097-0134(20010101)42:1<136::AID-PROT130>3.0.CO
  • [9] 2-F
  • [10] Using subsite coupling to predict signal peptides
    Chou, KC
    [J]. PROTEIN ENGINEERING, 2001, 14 (02): : 75 - 79