Markov entropy backbone electrostatic descriptors for predicting proteins biological activity

被引:52
作者
González-Díaz, H
Molina, R
Uriarte, E
机构
[1] Univ Santiago de Compostela, Fac Pharm, Dept Organ Chem, Santiago 15782, Spain
[2] Univ Rostock, FB Chem, D-18059 Rostock, Germany
关键词
protein electrostatics; 3D-QSAR; Markov chain; electrostatic field;
D O I
10.1016/j.bmcl.2004.06.100
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The spherical truncation of electrostatic interactions between aminoacids makes it possible to break down long-range spatial electrostatic interactions, resulting in short-range interactions. As a result, a Markov Chain model may be used to calculate the probabilities with which the effect of a given interaction reaches aminoacids at different distances within the backbone. The entropies of a Markov Chain model of this type may then be used to codify information about the spatial distribution of charges in the protein used in this study exploring the structure-activity relationship. In this paper, a linear discriminant analysis is reported, which correctly classified 92.3% of 26 proteins under investigation and leave-one-out cross validation, purely for illustrative purposes. Classification was carried out for three possible activities: lysozymes, dihydrofolate reductases, and alcohol dehydrogenases. The discriminant analysis equations were contracted into two canonical roots. These simple canonical roots have high regression coefficients (R-c1 = 0.903 and R-c2 = 0.70). Root1 explains the biological activity of alcohol dehydrogenases while Root2 discriminates between lysozymes and dihydrofolate reductases. It was possible to profile the effect of core, middle, and surface aminoacids on biological activity. In contrast, a model considering classic physicochemical parameters such as: polarizability, refractivity, and partition coefficient classify correctly only the 80.8% of the proteins. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4691 / 4695
页数:5
相关论文
共 30 条
[1]  
[Anonymous], CHEMOMETRIC METHODS
[2]   A METHOD FOR THE CHARACTERIZATION OF FOLDINGS IN PROTEIN RIBBON MODELS [J].
ARTECA, GA ;
MEZEY, PG .
JOURNAL OF MOLECULAR GRAPHICS, 1990, 8 (02) :66-80
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]   Support vector machines for predicting rRNA-, RNA-, and DNA-binding proteins from amino acid sequence [J].
Cai, YD ;
Lin, SL .
BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2003, 1648 (1-2) :127-133
[5]   AMINO-ACID SIDE-CHAIN DESCRIPTORS FOR QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIP STUDIES OF PEPTIDE ANALOGS [J].
COLLANTES, ER ;
DUNN, WJ .
JOURNAL OF MEDICINAL CHEMISTRY, 1995, 38 (14) :2705-2713
[6]  
DERAMOS AR, 2004, PROTEINS STRUCT FUNC
[7]   Markovian negentropies in bioinformatics.: 1.: A picture of footprints after the interaction of the HIV-1 Ψ-RNA packaging region with drugs [J].
Díaz, HG ;
de Armas, RR ;
Molina, R .
BIOINFORMATICS, 2003, 19 (16) :2079-2087
[8]   Markovian chemicals "in silico" design (MARCH-INSIDE), a promising approach for computer aided molecular design II:: Experimental and theoretical assessment of a novel method for virtual screening of fasciolicides [J].
Díaz, HG ;
Olazabal, E ;
Castañedo, N ;
Sánchez, IH ;
Morales, A ;
Serrano, HS ;
González, J ;
de Armas, RR .
JOURNAL OF MOLECULAR MODELING, 2002, 8 (08) :237-245
[9]   Characterization of 3D molecular structure [J].
Estrada, E .
CHEMICAL PHYSICS LETTERS, 2000, 319 (5-6) :713-718
[10]   Characterization of the folding degree of proteins [J].
Estrada, E .
BIOINFORMATICS, 2002, 18 (05) :697-704