Comparison of protein structures by growing neighborhood alignments

被引:7
作者
Bhattacharya, Sourangshu
Bhattacharyya, Chiranjib [1 ]
Chandra, Nagasuma R.
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore 560012, Karnataka, India
[2] Indian Inst Sci, Bioinformat Ctr, Bangalore 560012, Karnataka, India
关键词
D O I
10.1186/1471-2105-8-77
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Design of protein structure comparison algorithm is an important research issue, having far reaching implications. In this article, we describe a protein structure comparison scheme, which is capable of detecting correct alignments even in difficult cases, e. g. non-topological similarities. The proposed method computes protein structure alignments by comparing, small substructures, called neighborhoods. Two different types of neighborhoods, sequence and structure, are defined, and two algorithms arising out of the scheme are detailed. A new method for computing equivalences having non-topological similarities from pairwise similarity score is described. A novel and fast technique for comparing sequence neighborhoods is also developed. Results: The experimental results show that the current programs show better performance on Fischer and Novotny's benchmark datasets, than state of the art programs, e. g. DALI, CE and SSM. Our programs were also found to calculate correct alignments for proteins with huge amount of indels and internal repeats. Finally, the sequence neighborhood based program was used in extensive fold and non- topological similarity detection experiments. The accuracy of the fold detection experiments with the new measure of similarity was found to be similar or better than that of the standard algorithm CE. Conclusion: A new scheme, resulting in two algorithms, have been developed, implemented and tested. The programs developed are accessible at http://mllab.csa.iisc.ernet.in/mp2/runprog. html.
引用
收藏
页数:14
相关论文
共 22 条
[11]   Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions [J].
Krissinel, E ;
Henrick, K .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2004, 60 :2256-2268
[12]   Common subgraph isomorphism detection by backtracking search [J].
Krissinel, EB ;
Henrick, K .
SOFTWARE-PRACTICE & EXPERIENCE, 2004, 34 (06) :591-607
[13]   Circular permutations of natural protein sequences: Structural evidence [J].
Lindqvist, Y ;
Schneider, G .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (03) :422-427
[14]   SCOP - A STRUCTURAL CLASSIFICATION OF PROTEINS DATABASE FOR THE INVESTIGATION OF SEQUENCES AND STRUCTURES [J].
MURZIN, AG ;
BRENNER, SE ;
HUBBARD, T ;
CHOTHIA, C .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 247 (04) :536-540
[15]   Evaluation of protein fold comparison servers [J].
Novotny, M ;
Madsen, D ;
Kleywegt, GJ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 54 (02) :260-270
[16]   CATH - a hierarchic classification of protein domain structures [J].
Orengo, CA ;
Michie, AD ;
Jones, S ;
Jones, DT ;
Swindells, MB ;
Thornton, JM .
STRUCTURE, 1997, 5 (08) :1093-1108
[17]   Protein structure alignment by incremental combinatorial extension (CE) of the optimal path [J].
Shindyalov, IN ;
Bourne, PE .
PROTEIN ENGINEERING, 1998, 11 (09) :739-747
[18]  
Singh AP, 1997, ISMB-97 - FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY, PROCEEDINGS, P284
[19]   IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES [J].
SMITH, TF ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) :195-197
[20]   PROTEIN-STRUCTURE ALIGNMENT [J].
TAYLOR, WR ;
ORENGO, CA .
JOURNAL OF MOLECULAR BIOLOGY, 1989, 208 (01) :1-22