Accelerating screening of 3D protein data with a graph theoretical approach

被引:9
作者
Frömmel, C
Gille, C
Goede, A
Gröpl, C
Hougardy, S
Nierhoff, T
Preissner, R
Thimm, M
机构
[1] Charite, Inst Biochem, D-10117 Berlin, Germany
[2] Humboldt Univ, Inst Informat, D-10099 Berlin, Germany
关键词
D O I
10.1093/bioinformatics/btg343
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The Dictionary of Interfaces in Proteins (DIP) is a database collecting the 3D structure of interacting parts of proteins that are called patches. It serves as a repository, in which patches similar to given query patches can be found. The computation of the similarity of two patches is time consuming and traversing the entire DIP requires some hours. In this work we address the question of how the patches similar to a given query can be identified by scanning only a small part of DIP. The answer to this question requires the investigation of the distribution of the similarity of patches. Results: The score values describing the similarity of two patches can roughly be divided into three ranges that correspond to different levels of spatial similarity. Interestingly, the two iso-score lines separating the three classes can be determined by two different approaches. Applying a concept of the theory of random graphs reveals significant structural properties of the data in DIP. These can be used to accelerate scanning the DIP for patches similar to a given query. Searches for very similar patches could be accelerated by a factor of more than 25. Patches with a medium similarity could be found 10 times faster than by brute-force search.
引用
收藏
页码:2442 / 2447
页数:6
相关论文
共 28 条
[1]   Selected concepts and investigations in compound classification, molecular descriptor analysis, and virtual screening [J].
Bajorath, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (02) :233-245
[2]  
Beroza P, 2000, J MOL GRAPH MODEL, V18, P335
[3]  
Bollobas B., 2001, Random Graphs, V21
[4]   Clustering protein sequences-structure prediction by transitive homology [J].
Bolten, E ;
Schliep, A ;
Schneckener, S ;
Schomburg, D ;
Schrader, R .
BIOINFORMATICS, 2001, 17 (10) :935-941
[5]   QSAR study and VolSurf characterization of anti-HIV quinolone library [J].
Filipponi, E ;
Cruciani, G ;
Tabarrini, O ;
Cecchetti, V ;
Fravolini, A .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2001, 15 (03) :203-217
[6]   Recent advances in structure-based rational drug design [J].
Gane, PJ ;
Dean, PM .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2000, 10 (04) :401-404
[7]  
Good A, 2001, Curr Opin Drug Discov Devel, V4, P301
[8]   THE CRYSTALLOGRAPHIC INFORMATION FILE (CIF) - A NEW STANDARD ARCHIVE FILE FOR CRYSTALLOGRAPHY [J].
HALL, SR ;
ALLEN, FH ;
BROWN, ID .
ACTA CRYSTALLOGRAPHICA SECTION A, 1991, 47 :655-685
[9]   A clustering algorithm based on graph connectivity [J].
Hartuv, E ;
Shamir, R .
INFORMATION PROCESSING LETTERS, 2000, 76 (4-6) :175-181
[10]  
Hartuv Erez., 1999, RECOMB, P188