Comparison of protein structures by growing neighborhood alignments

被引:7
作者
Bhattacharya, Sourangshu
Bhattacharyya, Chiranjib [1 ]
Chandra, Nagasuma R.
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bangalore 560012, Karnataka, India
[2] Indian Inst Sci, Bioinformat Ctr, Bangalore 560012, Karnataka, India
关键词
D O I
10.1186/1471-2105-8-77
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Design of protein structure comparison algorithm is an important research issue, having far reaching implications. In this article, we describe a protein structure comparison scheme, which is capable of detecting correct alignments even in difficult cases, e. g. non-topological similarities. The proposed method computes protein structure alignments by comparing, small substructures, called neighborhoods. Two different types of neighborhoods, sequence and structure, are defined, and two algorithms arising out of the scheme are detailed. A new method for computing equivalences having non-topological similarities from pairwise similarity score is described. A novel and fast technique for comparing sequence neighborhoods is also developed. Results: The experimental results show that the current programs show better performance on Fischer and Novotny's benchmark datasets, than state of the art programs, e. g. DALI, CE and SSM. Our programs were also found to calculate correct alignments for proteins with huge amount of indels and internal repeats. Finally, the sequence neighborhood based program was used in extensive fold and non- topological similarity detection experiments. The accuracy of the fold detection experiments with the new measure of similarity was found to be similar or better than that of the standard algorithm CE. Conclusion: A new scheme, resulting in two algorithms, have been developed, implemented and tested. The programs developed are accessible at http://mllab.csa.iisc.ernet.in/mp2/runprog. html.
引用
收藏
页数:14
相关论文
共 22 条
[1]   A COMPUTER VISION-BASED TECHNIQUE FOR 3-D SEQUENCE-INDEPENDENT STRUCTURAL COMPARISON OF PROTEINS [J].
BACHAR, O ;
FISCHER, D ;
NUSSINOV, R ;
WOLFSON, H .
PROTEIN ENGINEERING, 1993, 6 (03) :279-288
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[4]  
FISCHER D, 1996, ASSESSING PERFORMANC, P300
[5]  
GOLDMAN D, 1999, FOCS 99 P 40 ANN S F, P512, DOI DOI 10.1109/SFFCS.1999.814624
[6]   Mapping the protein universe [J].
Holm, L ;
Sander, C .
SCIENCE, 1996, 273 (5275) :595-602
[7]   PROTEIN-STRUCTURE COMPARISON BY ALIGNMENT OF DISTANCE MATRICES [J].
HOLM, L ;
SANDER, C .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 233 (01) :123-138
[8]   CLOSED-FORM SOLUTION OF ABSOLUTE ORIENTATION USING UNIT QUATERNIONS [J].
HORN, BKP .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1987, 4 (04) :629-642
[9]   Comprehensive evaluation of protein structure alignment methods: Scoring by geometric measures [J].
Kolodny, R ;
Koehl, P ;
Levitt, M .
JOURNAL OF MOLECULAR BIOLOGY, 2005, 346 (04) :1173-1188
[10]   Approximate protein structural alignment in polynomial time [J].
Kolodny, R ;
Linial, N .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (33) :12201-12206