Predicting protein localization in budding yeast

被引:99
作者
Chou, KC [1 ]
Cai, YD
机构
[1] Gordon Life Sci Inst, San Diego, CA 92130 USA
[2] Shanghai Jiao Tong Univ, Shanghai 200030, Peoples R China
[3] Tianjin Inst Bioinformat & Drug Discovery, Tianjin, Peoples R China
[4] UMIST, Biomol Sci Dept, Manchester M60 1QD, Lancs, England
关键词
D O I
10.1093/bioinformatics/bti104
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Most of the existing methods in predicting protein subcellular location were used to deal with the cases limited within the scope from two to five localizations, and only a few of them can be effectively extended to cover the cases of 12-14 localizations. This is because the more the locations involved are, the poorer the success rate would be. Besides, some proteins may occur in several different subcellular locations, i.e. bear the feature of 'multiplex locations'. So far there is no method that can be used to effectively treat the difficult multiplex location problem. The present study was initiated in an attempt to address (1) how to efficiently identify the localization of a query protein among many possible subcellular locations, and (2) how to deal with the case of multiplex locations. Results: By hybridizing gene ontology, functional domain and pseudo amino acid composition approaches, a new method has been developed that can be used to predict subcellular localization of proteins with multiplex location feature. A global analysis of the proteins in budding yeast classified into 22 locations was performed by jack-knife cross-validation with the new method. The overall success identification rate thus obtained is 70%. In contrast to this, the corresponding rates obtained by some other existing methods were only 13-14%, indicating that the new method is very powerful and promising. Furthermore, predictions were made for the four proteins whose localizations could not be determined by experiments, as well as for the 236 proteins whose localizations in budding yeast were ambiguous according to experimental observations. However, according to our predicted results, many of these 'ambiguous proteins' were found to have the same score and ranking for several different subcellular locations, implying that they may simultaneously exist, or move around, in these locations. This finding is intriguing because it reflects the dynamic feature of these proteins in a cell that may be associated with some special biological functions.
引用
收藏
页码:944 / 950
页数:7
相关论文
共 35 条
  • [1] ALBERTS B, 1994, MOL BIOL CELL, pCH1
  • [2] The InterPro database, an integrated documentation resource for protein families, domains and functional sites
    Apweiler, R
    Attwood, TK
    Bairoch, A
    Bateman, A
    Birney, E
    Biswas, M
    Bucher, P
    Cerutti, T
    Corpet, F
    Croning, MDR
    Durbin, R
    Falquet, L
    Fleischmann, W
    Gouzy, J
    Hermjakob, H
    Hulo, N
    Jonassen, I
    Kahn, D
    Kanapin, A
    Karavidopoulou, Y
    Lopez, R
    Marx, B
    Mulder, NJ
    Oinn, TM
    Pagni, M
    Servant, F
    Sigrist, CJA
    Zdobnov, EM
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 37 - 40
  • [3] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [4] Support vector machines for predicting membrane protein types by using functional domain composition
    Cai, YD
    Zhou, GP
    Chou, KC
    [J]. BIOPHYSICAL JOURNAL, 2003, 84 (05) : 3257 - 3263
  • [5] Relation between amino acid composition and cellular location of proteins
    Cedano, J
    Aloy, P
    PerezPons, JA
    Querol, E
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (03) : 594 - 600
  • [6] A JOINT PREDICTION OF THE FOLDING TYPES OF 1490 HUMAN PROTEINS FROM THEIR GENETIC CODONS
    CHOU, JJW
    ZHANG, CT
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 1993, 161 (02) : 251 - 262
  • [7] Structural bioinformatics and its impact to biomedical science
    Chou, KC
    [J]. CURRENT MEDICINAL CHEMISTRY, 2004, 11 (16) : 2105 - 2134
  • [8] Prediction and classification of protein subcellular location-sequence-order effect and pseudo amino acid composition. (vol 90, pg1250, 2003)
    Chou, KC
    Cai, YD
    [J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 2004, 91 (05) : 1085 - 1085
  • [9] Prediction and classification of protein subcellular location - Sequence-order effect and pseudo amino acid composition
    Chou, KC
    Cai, YD
    [J]. JOURNAL OF CELLULAR BIOCHEMISTRY, 2003, 90 (06) : 1250 - 1260
  • [10] A new hybrid approach to predict subcellular localization of proteins by incorporating gene ontology
    Chou, KC
    Cai, YD
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 311 (03) : 743 - 747