The fine structure of galls in phylogenetic networks

被引:53
作者
Gusfield, D [1 ]
Eddhu, S
Langley, C
机构
[1] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
[2] Univ Calif Davis, Div Evolut & Ecol, Davis, CA 95616 USA
关键词
molecular evolution; phylogenetic networks; ancestral recombination graph; recombination SNP; bi-convex graph;
D O I
10.1287/ijoc.1040.0099
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A phylogenetic network is a generalization of a phylogenetic tree, allowing properties that are not tree-like. With the growth of genomic data, much of which does not fit ideal tree models, there is greater need to understand the algorithmics and combinatorics of phylogenetic networks (Posada and Crandall 2001, Schierup and Hein 2000). Wang et al. (2001) studied the problem of constructing a phylogenetic network for a set of n binary sequences derived from the all-zero ancestral sequence, when each site in the sequence can mutate from zero to one at most once in the network, and recombination between sequences is allowed. They showed that the problem of minimizing the number of recombinations in such networks is NP-hard, but introduced a special case of the problem, i.e., to determine whether the sequences could be derived on a phylogenetic network where the recombination cycles are node-disjoint. Wang et al. (2001) provide a sufficient, but not a necessary test, for such solutions. Gusfield et al. (2003, 2004) gave a polynomial-time algorithm that is both a necessary and sufficient test. In this paper, we study in much more detail the fine combinatorial structure of node-disjoint cycles in phylogenetic networks, both for purposes of insight into phylogenetic networks and to speed up parts of the previous algorithm. We explicitly characterize all the ways in which mutations can be arranged on a disjoint cycle, and prove a strong necessary condition for a set of mutations to be on a disjoint cycle. The main contribution here is to show how structure in the phylogenetic network is reflected in the structure of an efficiently-computable graph, called the conflict graph. The success of this approach suggests that additional insight into the structure of phylogenetic networks can be obtained by exploring structural properties of the conflict graph.
引用
收藏
页码:459 / 469
页数:11
相关论文
共 18 条
[1]  
[Anonymous], 1997, ALGORITHMS STRINGS T, DOI DOI 10.1017/CBO9780511574931
[2]   COMPUTATIONAL-COMPLEXITY OF INFERRING PHYLOGENIES BY COMPATIBILITY [J].
DAY, WHE ;
SANKOFF, D .
SYSTEMATIC ZOOLOGY, 1986, 35 (02) :224-229
[3]   Strongly orderable graphs - A common generalization of strongly chordal and chordal bipartite graphs [J].
Dragan, FF .
DISCRETE APPLIED MATHEMATICS, 2000, 99 (1-3) :427-442
[4]  
Felsenstein Joseph, 2004, Inferring_phylogenies, V2
[5]   EFFICIENT ALGORITHMS FOR INFERRING EVOLUTIONARY TREES [J].
GUSFIELD, D .
NETWORKS, 1991, 21 (01) :19-28
[6]  
GUSFIELD D, 2004, OPTIMAL EFFICIENT RE
[7]  
GUSFIELD D, 2003, P 2 CBS BIOINF C
[8]  
Gusfield Dan, 2004, J Bioinform Comput Biol, V2, P173, DOI 10.1142/S0219720004000521
[9]   RECONSTRUCTING EVOLUTION OF SEQUENCES SUBJECT TO RECOMBINATION USING PARSIMONY [J].
HEIN, J .
MATHEMATICAL BIOSCIENCES, 1990, 98 (02) :185-200
[10]  
HEIN J, 1993, J MOL EVOL, V36, P396, DOI 10.1007/BF00182187