Isoelectric point optimization using peptide descriptors and support vector machines

被引:30
作者
Perez-Riverol, Yasset [1 ,4 ]
Audain, Enrique [2 ]
Millan, Aleli [1 ]
Ramos, Yassel [1 ]
Sanchez, Aniel [1 ]
Vizcaino, Juan Antonio [4 ]
Wang, Rui [4 ]
Mueller, Markus [3 ]
Machado, Yoan J. [2 ]
Betancourt, Lazaro H. [1 ]
Gonzalez, Luis J. [1 ]
Padron, Gabriel [1 ]
Besada, Vladimir [1 ]
机构
[1] Ctr Genet Engn & Biotechnol, Dept Prote, Havana, Cuba
[2] Ctr Mol Immunol, Dept Prote, Havana, Cuba
[3] Swiss Inst Bioinformat, Proteome Informat Grp, CH-1211 Geneva, Switzerland
[4] European Bioinformat Inst, EMBL Outstn, Cambridge, England
关键词
Isoelectric point; Support vector machine; Peptide descriptors; TANDEM MASS-SPECTROMETRY; IMMOBILIZED PH GRADIENTS; AMINO-ACID-SEQUENCES; SHOTGUN PROTEOMICS; PREDICTION; ACCURACY; PROTEINS; IDENTIFICATION; DATABASE;
D O I
10.1016/j.jprot.2012.01.029
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
IPG (Immobilized pH Gradient) based separations are frequently used as the first step in shotgun proteomics methods; it yields an increase in both the dynamic range and resolution of peptide separation prior to the LC-MS analysis. Experimental isoelectric point (pI) values can improve peptide identifications in conjunction with MS/MS information. Thus, accurate estimation of the pI value based on the amino acid sequence becomes critical to perform these kinds of experiments. Nowadays, pI is commonly predicted using the charge-state model [1], and/or the cofactor algorithm [2]. However, none of these methods is capable of calculating the pI value for basic peptides accurately. In this manuscript, we present an new approach that can significant improve the pI estimation, by using Support Vector Machines (SVM)[3], an experimental amino acid descriptor taken from the AAIndex database [4] and the isoelectric point predicted by the charge-state model. Our results have shown a strong correlation (R-2=0.98) between the predicted and observed values, with a standard deviation of 0.32 pH units across the complete pH range. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:2269 / 2274
页数:6
相关论文
共 26 条
[1]   Machine learning methods for predictive proteomics [J].
Barla, Annalisa ;
Jurman, Giuseppe ;
Riccadonna, Samantha ;
Merler, Stefano ;
Chierici, Marco ;
Furlanello, Cesare .
BRIEFINGS IN BIOINFORMATICS, 2008, 9 (02) :119-128
[2]   THE FOCUSING POSITIONS OF POLYPEPTIDES IN IMMOBILIZED PH GRADIENTS CAN BE PREDICTED FROM THEIR AMINO-ACID-SEQUENCES [J].
BJELLQVIST, B ;
HUGHES, GJ ;
PASQUALI, C ;
PAQUET, N ;
RAVIER, F ;
SANCHEZ, JC ;
FRUTIGER, S ;
HOCHSTRASSER, D .
ELECTROPHORESIS, 1993, 14 (10) :1023-1031
[3]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[4]   Calculation of the isoelectric point of tryptic peptides in the pH 3.5-4.5 range based on adjacent amino acid effects [J].
Cargile, Benjamin J. ;
Sevinsky, Joel R. ;
Essader, Amal S. ;
Eu, Jerry P. ;
Stephenson, James L., Jr. .
ELECTROPHORESIS, 2008, 29 (13) :2768-2778
[5]   An alternative to tandem mass spectrometry: Isoelectric point and accurate mass for the identification of peptides [J].
Cargile, BJ ;
Stephenson, JL .
ANALYTICAL CHEMISTRY, 2004, 76 (02) :267-275
[6]   Immobilized pH gradients as a first dimension in shotgun proteomics and analysis of the accuracy of pI predictability of peptides [J].
Cargile, BJ ;
Talley, DL ;
Stephenson, JL .
ELECTROPHORESIS, 2004, 25 (06) :936-945
[7]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[8]   ExPASy: the proteomics server for in-depth protein knowledge and analysis [J].
Gasteiger, E ;
Gattiker, A ;
Hoogland, C ;
Ivanyi, I ;
Appel, RD ;
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3784-3788
[9]   A versatile peptide pI calculator for phosphorylated and N-terminal acetylated peptides experimentally tested using peptide isoelectric focusing [J].
Gauci, Sharon ;
Van Breukelen, Bas ;
Lemeer, Simone M. ;
Krijgsveld, Jeroen ;
Heck, Albert J. R. .
PROTEOMICS, 2008, 8 (23-24) :4898-4906
[10]   Added value for tandem mass spectrometry shotgun proteomics data validation through isoelectric focusing of peptides [J].
Heller, M ;
Ye, ML ;
Michel, PE ;
Morier, P ;
Stalder, D ;
Jünger, MA ;
Aebersold, R ;
Reymond, FR ;
Rossier, JS .
JOURNAL OF PROTEOME RESEARCH, 2005, 4 (06) :2273-2282