Fast overlapping of protein contact maps by alignment of eigenvectors

被引:22
作者
Di Lena, Pietro [1 ]
Fariselli, Piero [2 ]
Margara, Luciano [1 ]
Vassura, Marco [1 ]
Casadio, Rita [2 ]
机构
[1] Univ Bologna, Dept Comp Sci, I-40127 Bologna, Italy
[2] Univ Bologna, Biocomp Grp, I-40127 Bologna, Italy
关键词
RECONSTRUCTION; ALGORITHM; SEARCH; SIMILARITY; DATABASE;
D O I
10.1093/bioinformatics/btq402
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Searching for structural similarity is a key issue of protein functional annotation. The maximum contact map overlap (CMO) is one of the possible measures of protein structure similarity. Exact and approximate methods known to optimize the CMO are computationally expensive and this hampers their applicability to large-scale comparison of protein structures. Results: In this article, we describe a heuristic algorithm (Al-Eigen) for finding a solution to the CMO problem. Our approach relies on the approximation of contact maps by eigendecomposition. We obtain good overlaps of two contact maps by computing the optimal global alignment of few principal eigenvectors. Our algorithm is simple, fast and its running time is independent of the amount of contacts in the map. Experimental testing indicates that the algorithm is comparable to exact CMO methods in terms of the overlap quality, to structural alignment methods in terms of structure similarity detection and it is fast enough to be suited for large-scale comparison of protein structures. Furthermore, our preliminary tests indicates that it is quite robust to noise, which makes it suitable for structural similarity detection also for noisy and incomplete contact maps.
引用
收藏
页码:2250 / 2258
页数:9
相关论文
共 40 条
[1]   Fast molecular shape matching using contact maps [J].
Agarwal, Pankaj K. ;
Mustafa, Nabil H. ;
Wang, Yusu .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2007, 14 (02) :131-143
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
Andonov R, 2008, LECT N BIOINFORMAT, V5251, P162, DOI 10.1007/978-3-540-87361-7_14
[4]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[5]   SINGULAR VALUE DECOMPOSITION (SVD) IMAGE-CODING [J].
ANDREWS, HC ;
PATTERSON, CL .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1976, 24 (04) :425-432
[6]  
[Anonymous], P 40 ANN IEEE S FDN
[7]   The effect of backbone on the small-world properties of protein contact maps [J].
Bartoli, L. ;
Fariselli, P. ;
Casadio, R. .
PHYSICAL BIOLOGY, 2007, 4 (04) :L1-L5
[8]   1001 optimal PDB structure alignments: Integer programming methods for finding the maximum contact map overlap [J].
Caprara, A ;
Carr, R ;
Istrail, S ;
Lancia, G ;
Walenz, B .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (01) :27-52
[9]  
Caprara A.A., 2002, Proceedings of the sixth annual international conference on Computational biology, P100, DOI [DOI 10.1145/565196.565209, 10.1145/565196.565209.]
[10]   Optimal contact definition for reconstruction of Contact Maps [J].
Duarte, Jose M. ;
Sathyapriya, Rajagopal ;
Stehr, Henning ;
Filippis, Ioannis ;
Lappe, Michael .
BMC BIOINFORMATICS, 2010, 11