δ plots:: A tool for analyzing phylogenetic distance data

被引:118
作者
Holland, BR
Huber, KT
Dress, A
Moulton, V
机构
[1] Ruhr Univ Bochum, Lehrstuhl Spezielle Zool, D-44780 Bochum, Germany
[2] BMC, Linnaeus Ctr Bioinformat, Uppsala, Sweden
[3] Univ Bielefeld, FSPM Strukturbildungsprozesse, D-4800 Bielefeld, Germany
[4] Swedish Univ Agr Sci, Dept Biometry & Informat, Uppsala, Sweden
关键词
genetic distance; statistical geometry; phylogenetic analysis; tree reconstruction; assessment of data quality; recombination;
D O I
10.1093/oxfordjournals.molbev.a004030
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A method is described that allows the assessment of treelikeness of phylogenetic distance data before tree estimation. This method is related to statistical geometry as introduced by Eigen, Winkler-Oswatitsch, and Dress (1988 [Proc. Natl. Acad. Sci. USA. 85:5913-5917]), and in essence, displays a measure for treelikeness of quartets in terms of a histogram that we call a 8 plot. This allows identification of nontreelike data and analysis of noisy data sets arising from processes such as, for example, parallel evolution, recombination, or lateral gene transfer. In addition to an overall assessment of treelikeness, individual taxa can be ranked by reference to the treelikeness of the quartets to which they belong. Removal of taxa on the basis of this ranking results in an increase in accuracy of tree estimation. Recombinant data sets are simulated, and the method is shown to be capable of identifying single recombinant taxa on the basis of distance information alone, provided the parents of the recombinant sequence are sufficiently divergent and the mixture of tree histories is not strongly skewed toward a single tree. 8 Plots and taxon rankings are applied to three biological data sets using distances derived from sequence alignment, gene order, and fragment length polymorphism.
引用
收藏
页码:2051 / 2059
页数:9
相关论文
共 32 条
[1]   Split Decomposition: A New and Useful Approach to Phylogenetic Analysis of Distance Data [J].
Bandelt, Hans-Juergen ;
Dress, Andreas W. M. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 1992, 1 (03) :242-252
[2]   Recombination between sequences of hepatitis B virus from different genotypes [J].
Bollyky, PL ;
Rambaut, A ;
Harvey, PH ;
Holmes, EC .
JOURNAL OF MOLECULAR EVOLUTION, 1996, 42 (02) :97-102
[3]  
BUNEMAN P, 1971, MATH ARCHEOLOGICAL H, P287
[4]  
DRESS A, 1988, BEDENTUNG BERLIN AUS
[5]   STATISTICAL GEOMETRY IN SEQUENCE SPACE - A METHOD OF QUANTITATIVE COMPARATIVE SEQUENCE-ANALYSIS [J].
EIGEN, M ;
WINKLEROSWATITSCH, R ;
DRESS, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (16) :5913-5917
[6]  
EIGEN M, 1990, METHOD ENZYMOL, V183, P505
[7]  
EIGEN M, 1985, EVOLUTIN, V39, P783
[8]  
FELSENSTEIN J, 1993, PHYLIP VERSION 3 5C
[9]   Molecular markers reveal that population structure of the human pathogen Candida albicans exhibits both clonality and recombination [J].
Graser, Y ;
Volovsek, M ;
Arrington, J ;
Schonian, G ;
Presber, W ;
Mitchell, TG ;
Vilgalys, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (22) :12473-12477
[10]   A likelihood method for the detection of selection and recombination using nucleotide sequences [J].
Grassly, NC ;
Holmes, EC .
MOLECULAR BIOLOGY AND EVOLUTION, 1997, 14 (03) :239-247