Integrating ambiguously aligned regions of DNA sequences in phylogenetic analyses without violating positional homology

被引:319
作者
Lutzoni, F
Wagner, P
Reeb, V
Zoller, S
机构
[1] Field Museum Nat Hist, Dept Bot, Chicago, IL 60605 USA
[2] Field Museum Nat Hist, Dept Geol, Chicago, IL 60605 USA
关键词
character coding; character-state weighting; crocodile 12S mtrDNA; indel-rich region; insect 16S mtrDNA; intron; large-scale phylogenies; molecular phylogenies; multiple sequence alignment; non-protein-coding DNA sequences; ribosomal RNA genes;
D O I
10.1080/106351500750049743
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Phylogenetic analyses of non-protein-coding nucleotide sequences such as ribosomal RNA genes, internal transcribed spacers, and introns are often impeded by regions of the alignments that are ambiguously aligned. These regions are characterized by the presence of gaps and their uncertain positions, no matter which optimization criteria are used. This problem is particularly acute in large-scale phylogenetic studies and when aligning highly diverged sequences. Accommodating these regions, where positional homology is likely to be violated, in phylogenetic analyses has been dealt with very differently by molecular systematists and evolutionists, ranging from the total exclusion of these regions to the inclusion of every position regardless of ambiguity in the alignment. We present a new method that allows the inclusion of ambiguously aligned regions without violating homology. In this three-step procedure, first homologous regions of the alignment containing ambiguously aligned sequences are delimited. Second, each ambiguously aligned region is unequivocally coded as a new character, replacing its respective ambiguous region. Third, each of the coded characters is subjected to a specific step matrix to account for the differential number of changes (summing substitutions and indels) needed to transform one sequence to another. The optimal number of steps included in the step matrix is the one derived from the pairwise alignment with the greatest similarity and the least number of steps. In addition to potentially enhancing phylogenetic resolution and support, by integrating previously nonaccessible characters without violating positional homology, this new approach can improve branch length estimations when using parsimony.
引用
收藏
页码:628 / 651
页数:24
相关论文
共 63 条
[1]  
[Anonymous], 1983, SIAM REV
[2]  
[Anonymous], 1990, GENETIC DATA ANAL
[3]  
[Anonymous], MOL SYSTEMATICS PLAN
[4]  
[Anonymous], 1992, MacClade: Analysis of phylogeny and character evolution
[5]  
BALDWIN BG, 1995, ANN MO BOT GARD, V82, P257
[6]  
BARRIEL V, 1994, CR ACAD SCI III-VIE, V317, P693
[7]   A PHYLOGENETIC ANALYSIS OF EPILOBIUM (ONAGRACEAE) BASED ON NUCLEAR RIBOSOMAL DNA-SEQUENCES [J].
BAUM, DA ;
SYTSMA, KJ ;
HOCH, PC .
SYSTEMATIC BOTANY, 1994, 19 (03) :363-388
[8]   Loculoascomycete origins and evolution of filamentous ascomycete morphology based on 18S rRNA gene sequence data [J].
Berbee, ML .
MOLECULAR BIOLOGY AND EVOLUTION, 1996, 13 (03) :462-470
[9]   DATING THE EVOLUTIONARY RADIATIONS OF THE TRUE FUNGI [J].
BERBEE, ML ;
TAYLOR, JW .
CANADIAN JOURNAL OF BOTANY-REVUE CANADIENNE DE BOTANIQUE, 1993, 71 (08) :1114-1127
[10]  
BREMER K, 1988, EVOLUTION, V42, P795, DOI [10.2307/2408870, 10.1111/j.1558-5646.1988.tb02497.x]