Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis

被引:8502
作者
Castresana, J [1 ]
机构
[1] European Mol Biol Lab, D-69117 Heidelberg, Germany
关键词
multiple alignments; conserved blocks; amino acid composition; mitochondrial proteins; eukaryotes;
D O I
10.1093/oxfordjournals.molbev.a026334
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The use of some multiple-sequence alignments in phylogenetic analysis, particularly those that are not very well conserved, requires the elimination of poorly aligned positions and divergent regions, since they may not he homologous or may have been saturated by multiple substitutions. A computerized method that eliminates such positions and at the same time tries to minimize the loss of informative sites is presented here. The method is based on the selection of blocks of positions that fulfill a simple set of requirements with respect to the number of contiguous conserved positions, lack of gaps, and high conservation of flanking positions, making the final alignment more suitable for phylogenetic analysis. To illustrate the efficiency of this method, alignments of 10 mitochondrial proteins from several completely sequenced mitochondrial genomes belonging to diverse eukaryotes were used as examples. The percentages of removed positions were higher in the most divergent alignments. After removing divergent segments, the amino acid composition of the different sequences was more uniform, and pairwise distances became much smaller. Phylogenetic trees show that topologies can be different after removing conserved blocks, particularly when there are several poorly resolved nodes. Strong support was found for the grouping of animals and fungi but not for the position of more basal eukaryotes. The use of a computerized method such as the one presented here reduces to a certain extent the necessity of manually editing multiple alignments, makes the automation of phylogenetic analysis of large data sets feasible, and facilitates the reproduction of the final alignment by other researchers.
引用
收藏
页码:540 / 552
页数:13
相关论文
共 57 条
[11]   Phylogenetic analysis of the Hsp70 sequences reveals the monophyly of metazoa and specific phylogenetic relationships between animals and fungi [J].
Borchiellini, C ;
Boury-Esnault, N ;
Vacelet, J ;
Le Parco, Y .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (06) :647-655
[12]   Complete sequence of the mitochondrial DNA of the red alga Porphyra purpurea:: Cyanobacterial introns and shared ancestry of red and green algae [J].
Burger, G ;
Saint-Louis, D ;
Gray, MW ;
Lang, BF .
PLANT CELL, 1999, 11 (09) :1675-1694
[13]   THE MITOCHONDRIAL-DNA OF THE AMEBOID PROTOZOAN, ACANTHAMOEBA-CASTELLANII - COMPLETE SEQUENCE, GENE CONTENT AND GENOME ORGANIZATION [J].
BURGER, G ;
PLANTE, I ;
LONERGAN, KM ;
GRAY, MW .
JOURNAL OF MOLECULAR BIOLOGY, 1995, 245 (05) :522-537
[14]  
Castresana J, 1998, GENETICS, V150, P1115
[15]   Codon reassignment and amino acid composition in hemichordate mitochondria [J].
Castresana, J ;
Feldmaier-Fuchs, G ;
Pääbo, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (07) :3703-3707
[16]   KINGDOM PROTOZOA AND ITS 18 PHYLA [J].
CAVALIERSMITH, T .
MICROBIOLOGICAL REVIEWS, 1993, 57 (04) :953-994
[17]   THE COMPLETE DNA-SEQUENCE OF THE MITOCHONDRIAL GENOME OF PODOSPORA-ANSERINA [J].
CUMMINGS, DJ ;
MCNALLY, KL ;
DOMENICO, JM ;
MATSUURA, ET .
CURRENT GENETICS, 1990, 17 (05) :375-402
[18]  
Dayhoff M.O., 1978, Atlas of Protein Sequence and Structure, P345
[19]   Complete sequence of the mitochondrial DNA of Chlamydomonas eugametos [J].
Denovan-Wright , EM ;
Nedelcu, AM ;
Lee, RW .
PLANT MOLECULAR BIOLOGY, 1998, 36 (02) :285-295
[20]   EVOLUTION OF NUCLEAR RIBOSOMAL-RNAS IN KINETOPLASTID PROTOZOA - PERSPECTIVES ON THE AGE AND ORIGINS OF PARASITISM [J].
FERNANDES, AP ;
NELSON, K ;
BEVERLEY, SM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (24) :11608-11612