Fast protein fragment similarity scoring using a Binet-Cauchy kernel

被引:13
作者
Guyon, Frederic [1 ]
Tuffery, Pierre [1 ,2 ,3 ]
机构
[1] Univ Paris Diderot, Sorbonne Paris Cite, Mol Therapeut Silico, UMR 973, F-75205 Paris, France
[2] INSERM, U973, F-75205 Paris, France
[3] Univ Paris Diderot, Ressources Parisiennes Bioinformat Struct, F-75205 Paris, France
关键词
RELATE; 2; SETS; 3-DIMENSIONAL STRUCTURES; STRUCTURE ALIGNMENT; RESOURCE; ROTATION; VECTORS; TIME; TOOL;
D O I
10.1093/bioinformatics/btt618
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: Meaningful scores to assess protein structure similarity are essential to decipher protein structure and sequence evolution. The mining of the increasing number of protein structures requires fast and accurate similarity measures with statistical significance. Whereas numerous approaches have been proposed for protein domains as a whole, the focus is progressively moving to a more local level of structure analysis for which similarity measurement still remains without any satisfactory answer. Results: We introduce a new score based on Binet-Cauchy kernel. It is normalized and bounded between 1-maximal similarity that implies exactly the same conformations for protein fragments-and -1-mirror image conformations, the unrelated conformations having a null mean score. This allows for the search of both similar and mirror conformations. In addition, such score addresses two major issue of the widely used root mean square deviation (RMSD). First, it achieves length independent statistics even for short fragments. Second, it shows better performance in the discrimination of medium range RMSD values. Being simpler and faster to compute than the RMSD, it also provides the means for large-scale mining of protein structures.
引用
收藏
页码:784 / 791
页数:8
相关论文
共 34 条
[1]
Reorganizing the protein space at the Universal Protein Resource (UniProt) [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Casanova, Elisabet Barrera ;
Bely, Benoit ;
Bingley, Mark ;
Bower, Lawrence ;
Bursteinas, Borisas ;
Chan, Wei Mun ;
Chavali, Gayatri ;
Da Silva, Alan ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fazzini, Francesco ;
Fedotov, Alexander ;
Garavelli, John ;
Castro, Leyla Garcia ;
Gardner, Michael ;
Hieta, Reija ;
Huntley, Rachael ;
Jacobsen, Julius ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pontikos, Nikolas ;
Pundir, Sangya ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Wardell, Tony ;
Watkins, Xavier ;
Corbett, Matt ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Goujon, Mickael ;
McWilliam, Hamish ;
Lopez, Rodrigo ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Bridge, Alan ;
Poux, Sylvain ;
Redaschi, Nicole .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D71-D75
[2]
RESIDUAL LIFE TIME AT GREAT AGE [J].
BALKEMA, AA ;
DEHAAN, L .
ANNALS OF PROBABILITY, 1974, 2 (05) :792-804
[3]
Betancourt MR, 2001, BIOPOLYMERS, V59, P305, DOI 10.1002/1097-0282(20011015)59:5<305::AID-BIP1027>3.0.CO
[4]
2-6
[5]
Local sequence-structure correlations in proteins [J].
Bystroff, C ;
Simons, KT ;
Han, KF ;
Baker, D .
CURRENT OPINION IN BIOTECHNOLOGY, 1996, 7 (04) :417-421
[6]
A normalized root-mean-square distance for comparing protein three-dimensional structures [J].
Carugo, O ;
Pongor, S .
PROTEIN SCIENCE, 2001, 10 (07) :1470-1473
[7]
Fast detection of common geometric substructure in proteins [J].
Chew, LP ;
Huttenlocher, D ;
Kedem, K ;
Kleinberg, J .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :313-325
[8]
Using quaternions to calculate RMSD [J].
Coutsias, EA ;
Seok, C ;
Dill, KA .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2004, 25 (15) :1849-1857
[9]
De Gennes P.-G, 1979, Scaling concepts in polymer physics
[10]
Connecting the protein structure universe by using sparse recurring fragments [J].
Friedberg, I ;
Godzik, A .
STRUCTURE, 2005, 13 (08) :1213-1224