A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery

被引:80
作者
Xie, Lei [1 ]
Xie, Li [2 ]
Bourne, Philip E. [1 ,2 ]
机构
[1] Univ Calif San Diego, San Diego Supercomp Ctr, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Skaggs Sch Pharm & Pharmaceut Sci, La Jolla, CA 92093 USA
关键词
PROTEIN FUNCTION; FUNCTIONAL SITES; FOLD SPACE; EVOLUTION; IDENTIFICATION; RESISTANCE; PREDICTION; TEMPLATES; PATTERNS; CLASSIFICATION;
D O I
10.1093/bioinformatics/btp220
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Functional relationships between proteins that do not share global structure similarity can be established by detecting their ligand-binding-site similarity. For a large-scale comparison, it is critical to accurately and efficiently assess the statistical significance of this similarity. Here, we report an efficient statistical model that supports local sequence order independent ligand-binding-site similarity searching. Most existing statistical models only take into account the matching vertices between two sites that are defined by a fixed number of points. In reality, the boundary of the binding site is not known or is dependent on the bound ligand making these approaches limited. To address these shortcomings and to perform binding-site mapping on a genome-wide scale, we developed a sequence-order independent profile-profile alignment (SOIPPA) algorithm that is able to detect local similarity between unknown binding sites a priori. The SOIPPA scoring integrates geometric, evolutionary and physical information into a unified framework. However, this imposes a significant challenge in assessing the statistical significance of the similarity because the conventional probability model that is based on fixed-point matching cannot be applied. Here we find that scores for binding-site matching by SOIPPA follow an extreme value distribution (EVD). Benchmark studies show that the EVD model performs at least two-orders faster and is more accurate than the non-parametric statistical method in the previous SOIPPA version. Efficient statistical analysis makes it possible to apply SOIPPA to genome-based drug discovery. Consequently, we have applied the approach to the structural genome of Mycobacterium tuberculosis to construct a protein-ligand interaction network. The network reveals highly connected proteins, which represent suitable targets for promiscuous drugs.
引用
收藏
页码:I305 / I312
页数:8
相关论文
共 81 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Evolution of protein fold in the presence of functional constraints [J].
Andreeva, Antonina ;
Murzin, Alexey G. .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2006, 16 (03) :399-408
[3]   A GRAPH-THEORETIC APPROACH TO THE IDENTIFICATION OF 3-DIMENSIONAL PATTERNS OF AMINO-ACID SIDE-CHAINS IN PROTEIN STRUCTURES [J].
ARTYMIUK, PJ ;
POIRRETTE, AR ;
GRINDLEY, HM ;
RICE, DW ;
WILLETT, P .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (02) :327-344
[4]   An algorithm for constraint-based structural template matching: application to 3D templates with statistical analysis [J].
Barker, JA ;
Thornton, JM .
BIOINFORMATICS, 2003, 19 (13) :1644-1649
[5]   The generation of new protein functions by the combination of domains [J].
Bashton, Matthew ;
Chothia, Cyrus .
STRUCTURE, 2007, 15 (01) :85-99
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   Protein Functional Surfaces: Global Shape Matching and Local Spatial Alignments of Ligand Binding Sites [J].
Binkowski, T. Andrew ;
Joachimiak, Andrzej .
BMC STRUCTURAL BIOLOGY, 2008, 8
[8]   Inferring functional relationships of proteins from local sequence and spatial surface patterns [J].
Binkowski, TA ;
Adamian, L ;
Liang, J .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 332 (02) :505-526
[9]   Probing binding requirements of NAD kinase with modified substrate (NAD) analogues [J].
Bonnac, Laurent ;
Chen, Liqiang ;
Pathak, Rashmi ;
Gao, Guangyao ;
Ming, Qian ;
Bennett, Eric ;
Felczak, Krzysztof ;
Kullberg, Martin ;
Patterson, Steven E. ;
Mazzola, Francesca ;
Magni, Giulio ;
Pankiewicz, Krzysztof W. .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2007, 17 (06) :1512-1515
[10]   Towards a structural classification of phosphate binding sites in protein-nucleotide complexes: An automated all-against-all structural comparison using geometric matching [J].
Brakoulias, A ;
Jackson, RM .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 56 (02) :250-260