Incorporating Support Vector Machine for Identifying Protein Tyrosine Sulfation Sites

被引:77
作者
Chang, Wen-Chi [1 ,2 ]
Lee, Tzong-Yi [2 ]
Shien, Dray-Ming [4 ,6 ]
Hsu, Justin Bo-Kai [2 ]
Horng, Jorng-Tzong [4 ,5 ]
Hsu, Po-Chiang [2 ]
Wang, Ting-Yuan [2 ]
Huang, Hsien-Da [1 ,2 ]
Pan, Rong-Long [3 ]
机构
[1] Natl Chiao Tung Univ, Dept Biol Sci & Technol, Hsinchu, Taiwan
[2] Natl Chiao Tung Univ, Inst Bioinformat & Syst Biol, Hsinchu, Taiwan
[3] Natl Tsing Hua Univ, Coll Life Sci, Inst Bioinformat & Struct Biol, Hsinchu, Taiwan
[4] Natl Cent Univ, Dept Comp Sci & Informat Engn, Chungli 320, Taiwan
[5] Asia Univ, Dept Bioinformat, Taichung, Taiwan
[6] Chin Min Inst Technol, Dept Elect Engn, Miaoli, Taiwan
关键词
protein; sulfation; prediction; O-SULFATION; PREDICTION; PHOSPHORYLATION; BINDING; RECOGNITION; MUTANTS; MODELS; 3D;
D O I
10.1002/jcc.21258
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Tyrosine sulfation is a post-translational modification of many secreted and membrane-bound proteins. It governs protein-protein interactions that are involved in leukocyte adhesion, hemostasis, and chemokine signaling. However, the intrinsic feature of sulfated protein remains elusive and remains to be delineated. This investigation presents SulfoSite, which is a computational method based on a support vector machine (SVM) for predicting protein sulfotyrosine sites. The approach was developed to consider structural information such as concerning the secondary structure and solvent accessibility of amino acids that surround the sulfotyrosine sites. One hundred sixty-two experimentally verified tyrosine sulfation sites were identified using UniProtKB/SwissProt release 53.0. The results of a five-fold cross-validation evaluation suggest that the accessibility of the solvent around the sulfotyrosine sites contributes substantially to predictive accuracy. The SVM classifier can achieve an accuracy of 94.2% in fivefold cross validation when sequence positional weighted matrix (PWM) is coupled with values of the accessible surface area (ASA). The proposed method significantly outperforms previous methods for accurately predicting the location of tyrosine sulfation sites. (C) 2009 Wiley Periodicals, Inc. J Comput Chem 30: 2526-2537, 2009
引用
收藏
页码:2526 / 2537
页数:12
相关论文
共 39 条
  • [1] RVP-net: online prediction of real valued accessible surface area of proteins from single sequences
    Ahmad, S
    Gromiha, MM
    Sarai, A
    [J]. BIOINFORMATICS, 2003, 19 (14) : 1849 - 1851
  • [2] Real value prediction of solvent accessibility from amino acid sequence
    Ahmad, S
    Gromiha, MM
    Sarai, A
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 50 (04) : 629 - 635
  • [3] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [4] Existence of distinct tyrosylprotein sulfotransferase genes: Molecular characterization of tyrosylprotein sulfotransferase-2
    Beisswanger, R
    Corbeil, D
    Vannier, C
    Thiele, C
    Dohrmann, U
    Kellner, R
    Ashman, K
    Niehrs, C
    Huttner, WB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (19) : 11134 - 11139
  • [5] Molecular basis of leukocyte rolling on PSGL-1 -: Predominant role of core-2 O-glycans and of tyrosine sulfate residue 51
    Bernimoulin, MP
    Zeng, XL
    Abbal, C
    Giraud, S
    Martinez, M
    Michielin, O
    Schapira, M
    Spertini, O
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2003, 278 (01) : 37 - 47
  • [6] The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
    Boeckmann, B
    Bairoch, A
    Apweiler, R
    Blatter, MC
    Estreicher, A
    Gasteiger, E
    Martin, MJ
    Michoud, K
    O'Donovan, C
    Phan, I
    Pilbout, S
    Schneider, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 365 - 370
  • [7] Protein structure prediction servers at university college london
    Bryson, K
    McGuffin, LJ
    Marsden, RL
    Ward, JJ
    Sodhi, JS
    Jones, DT
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : W36 - W38
  • [8] New consensus features for tyrosine O-sulfation determined by mutational analysis
    Bundgaard, JR
    Vuust, J
    Rehfeld, JF
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 1997, 272 (35) : 21700 - 21705
  • [9] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
  • [10] Tyrosine sulfation of human antibodies contributes to recognition of the CCR5 binding region of HIV-1 gp120
    Choe, H
    Li, WH
    Wright, PL
    Vasilieva, N
    Venturi, M
    Huang, CC
    Grundner, C
    Dorfman, T
    Zwick, MB
    Wang, LP
    Rosenberg, ES
    Kwong, PD
    Burton, DR
    Robinson, JE
    Sodroski, JG
    Farzan, M
    [J]. CELL, 2003, 114 (02) : 161 - 170