Learning a peptide-protein binding affinity predictor with kernel ridge regression

被引:32
作者
Giguere, Sebastien [1 ]
Marchand, Mario [1 ]
Laviolette, Francois [1 ]
Drouin, Alexandre [1 ]
Corbeil, Jacques [2 ]
机构
[1] Univ Laval, Dept Comp Sci & Software Engn, Quebec City, PQ, Canada
[2] Univ Laval, Dept Mol Med, Quebec City, PQ, Canada
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
加拿大自然科学与工程研究理事会; 加拿大创新基金会;
关键词
STRING KERNELS; SYSTEMS; MODEL;
D O I
10.1186/1471-2105-14-82
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The cellular function of a vast majority of proteins is performed through physical interactions with other biomolecules, which, most of the time, are other proteins. Peptides represent templates of choice for mimicking a secondary structure in order to modulate protein-protein interaction. They are thus an interesting class of therapeutics since they also display strong activity, high selectivity, low toxicity and few drug-drug interactions. Furthermore, predicting peptides that would bind to a specific MHC alleles would be of tremendous benefit to improve vaccine based therapy and possibly generate antibodies with greater affinity. Modern computational methods have the potential to accelerate and lower the cost of drug and vaccine discovery by selecting potential compounds for testing in silico prior to biological validation. Results: We propose a specialized string kernel for small bio-molecules, peptides and pseudo-sequences of binding interfaces. The kernel incorporates physico-chemical properties of amino acids and elegantly generalizes eight kernels, comprised of the Oligo, the Weighted Degree, the Blended Spectrum, and the Radial Basis Function. We provide a low complexity dynamic programming algorithm for the exact computation of the kernel and a linear time algorithm for it's approximation. Combined with kernel ridge regression and SupCK, a novel binding pocket kernel, the proposed kernel yields biologically relevant and good prediction accuracy on the PepX database. For the first time, a machine learning predictor is capable of predicting the binding affinity of any peptide to any protein with reasonable accuracy. The method was also applied to both single-target and pan-specific Major Histocompatibility Complex class II benchmark datasets and three Quantitative Structure Affinity Model benchmark datasets. Conclusion: On all benchmarks, our method significantly (p-value <= 0.057) outperforms the current state-of-the-art methods at predicting peptide-protein binding affinities. The proposed approach is flexible and can be applied to predict any quantitative biological activity. Moreover, generating reliable peptide-protein binding affinities will also improve system biology modelling of interaction pathways. Lastly, the method should be of value to a large segment of the research community with the potential to accelerate the discovery of peptide-based drugs and facilitate vaccine development. The proposed kernel is freely available at http://graal.ift.ulaval.ca/downloads/gs-kernel/.
引用
收藏
页数:16
相关论文
共 36 条
[1]   Scale-free networks in cell biology [J].
Albert, R .
JOURNAL OF CELL SCIENCE, 2005, 118 (21) :4947-4957
[2]  
[Anonymous], 2004, KERNEL METHODS PATTE
[3]   MultiRTA: A simple yet reliable method for predicting peptide binding affinities for multiple class II MHC allotypes [J].
Bordner, Andrew J. ;
Mittelmann, Hans D. .
BMC BIOINFORMATICS, 2010, 11
[4]   Prediction of the binding affinities of peptides to class II MHC using a regularized thermodynamic model [J].
Bordner, Andrew J. ;
Mittelmann, Hans D. .
BMC BIOINFORMATICS, 2010, 11
[5]   Sol-gel chemistry in medicinal science [J].
Coradin, T ;
Boissière, M ;
Livage, J .
CURRENT MEDICINAL CHEMISTRY, 2006, 13 (01) :99-108
[6]  
Dana-Farber Cancer Institute, 2012, 2 MACH LEARN COMP IM
[7]   Small molecular weight protein-protein interaction antagonists -: an insurmountable challenge? [J].
Doemling, Alexander .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2008, 12 (03) :281-291
[8]   Genome scale enzyme-metabolite and drug-target interaction predictions using the signature molecular descriptor [J].
Faulon, Jean-Loup ;
Misra, Milind ;
Martin, Shawn ;
Sale, Ken ;
Sapra, Rajat .
BIOINFORMATICS, 2008, 24 (02) :225-233
[9]   A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction [J].
Hoffmann, Brice ;
Zaslavskiy, Mikhail ;
Vert, Jean-Philippe ;
Stoven, Veronique .
BMC BIOINFORMATICS, 2010, 11
[10]   Large-scale prediction of protein-protein interactions from structures [J].
Hue, Martial ;
Riffle, Michael ;
Vert, Jean-Philippe ;
Noble, William S. .
BMC BIOINFORMATICS, 2010, 11