A new progressive-iterative algorithm for multiple structure alignment

被引:105
作者
Lupyan, D
Leo-Macias, A
Ortiz, AR [1 ]
机构
[1] Univ Autonoma Madrid, CSIC, Ctr Mol Biol Severo Ochoa, Bioinformat Unit, E-28049 Madrid, Spain
[2] Mt Sinai Sch Med, Dept Physiol & Biophys, New York, NY 10029 USA
关键词
D O I
10.1093/bioinformatics/bti527
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Multiple structure alignments are becoming important tools in many aspects of structural bioinformatics. The current explosion in the number of available protein structures demands multiple structural alignment algorithms with an adequate balance of accuracy and speed, for large scale applications in structural genomics, protein structure prediction and protein classification. Results: A new multiple structural alignment program, MAMMOTH-mult, is described. It is demonstrated that the alignments obtained with the new method are an improvement over previous manual or automatic alignments available in several widely used databases at all structural levels. Detailed analysis of the structural alignments for a few representative cases indicates that MAMMOTH-mult delivers biologically meaningful trees and conservation at the sequence and structural levels of functional motifs in the alignments. An important improvement over previous methods is the reduction in computational cost. Typical alignments take only a median time of 5 CPU seconds in a single R12000 processor. MAMMOTH-mult is particularly useful for large scale applications. Availability: http://ub.cbm.uam.es/mammoth/mult Contact: aro@cbm.uam.es
引用
收藏
页码:3255 / 3263
页数:9
相关论文
共 44 条
[1]   Combining multiple structure and sequence alignments to improve sequence detection and alignment: Application to the SH2 domains of Janus kinases [J].
Al-Lazikani, B ;
Sheinerman, FB ;
Honig, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (26) :14796-14801
[2]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[3]  
[Anonymous], 1999, APPL MULTIVARIATE AN
[4]   Use of a database of structural alignments and phylogenetic trees in investigating the relationship between sequence and structural variability among homologous proteins [J].
Balaji, S ;
Srinivasan, N .
PROTEIN ENGINEERING, 2001, 14 (04) :219-226
[5]   A STRATEGY FOR THE RAPID MULTIPLE ALIGNMENT OF PROTEIN SEQUENCES - CONFIDENCE LEVELS FROM TERTIARY STRUCTURE COMPARISONS [J].
BARTON, GJ ;
STERNBERG, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 198 (02) :327-337
[6]  
BORK P, 1994, J MOL BIOL, V242, P309, DOI 10.1006/jmbi.1994.1582
[7]   The ASTRAL compendium for protein structure and sequence analysis [J].
Brenner, SE ;
Koehl, P ;
Levitt, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :254-256
[8]  
Burke DF, 1999, PROTEINS, P55
[9]   The ASTRAL Compendium in 2004 [J].
Chandonia, JM ;
Hon, G ;
Walker, NS ;
Lo Conte, L ;
Koehl, P ;
Levitt, M ;
Brenner, SE .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D189-D192
[10]  
GODZIK A, 1994, COMPUT APPL BIOSCI, V10, P587