The relationship between the sequence identities of alpha helical proteins in the PDB and the molecular similarities of their ligands

被引:40
作者
Mitchell, JBO [1 ]
机构
[1] Univ Cambridge, Dept Chem, Unilever Ctr Mol Informat, Cambridge CB2 1EW, England
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2001年 / 41卷 / 06期
关键词
D O I
10.1021/ci010364q
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper considers the relationship between the percentage sequence identities of protein chains and the molecular similarities of the ligands they bind. Among a set of alpha helical proteins from the PDB, it is found that related proteins tend to bind similar ligands. Furthermore, the property of binding similar ligands can be used to define the categories of "like" and "unlike" pairs of protein chains, separated by an approximate cutoff at a sequence identity of, or somewhat above, 45%. Similarly, the property of binding related protein chains can be used to define "low" and "high" similarity pairs of ligand residues, with a cutoff at a Tanimoto score of 0.70. The ligands bound to two "like" protein chains are five times more likely to be of high similarity than would be expected if protein sequence identity and ligand molecular similarity were independent variables. Nonetheless, the nature of the PDB means that it is unclear whether the same conclusions would be reached with a data set representing an unbiased sample of all protein-ligand complexes in a living cell. The construction of an appropriate data set for such a study represents a significant challenge.
引用
收藏
页码:1617 / 1622
页数:6
相关论文
共 19 条
[1]  
BARNARD JM, 2000, VIRTUAL SCREENING BI
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   On the properties of bit string-based measures of chemical similarity [J].
Flower, DR .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1998, 38 (03) :379-386
[4]   Mechanistically diverse enzyme superfamilies: the importance of chemistry in the evolution of catalysis [J].
Gerlt, JA ;
Babbitt, PC .
CURRENT OPINION IN CHEMICAL BIOLOGY, 1998, 2 (05) :607-612
[5]   The relationship between protein structure and function: a comprehensive survey with application to the yeast genome [J].
Hegyi, H ;
Gerstein, M .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 288 (01) :147-164
[6]   PDBsum: a Web-based database of summaries and analyses of all PDB structures [J].
Laskowski, RA ;
Hutchinson, EG ;
Michie, AD ;
Wallace, AC ;
Jones, ML ;
Thornton, JM .
TRENDS IN BIOCHEMICAL SCIENCES, 1997, 22 (12) :488-490
[7]   Protein folds and functions [J].
Martin, AC ;
Orengo, CA ;
Hutchinson, EG ;
Jones, S ;
Karmirantzou, M ;
Laskowski, RA ;
Mitchell, JB ;
Taroni, C ;
Thornton, JM .
STRUCTURE, 1998, 6 (07) :875-884
[8]   Selecting optimally diverse compounds from structure databases: A validation study of two-dimensional and three-dimensional molecular descriptors [J].
Matter, H .
JOURNAL OF MEDICINAL CHEMISTRY, 1997, 40 (08) :1219-1229
[9]   Sequences annotated by structure: a tool to facilitate the use of structural information in sequence analysis [J].
Milburn, D ;
Laskowski, RA ;
Thornton, JM .
PROTEIN ENGINEERING, 1998, 11 (10) :855-859
[10]  
Mitchell JBO, 1999, J COMPUT CHEM, V20, P1165, DOI 10.1002/(SICI)1096-987X(199908)20:11<1165::AID-JCC7>3.0.CO