A Unified Approach for Reconstructing Ancient Gene Clusters

被引:18
作者
Stoye, Jens [1 ]
Wittler, Roland [1 ]
机构
[1] Univ Bielefeld, Fac Technol, Genome Informat Grp, D-33594 Bielefeld, Germany
关键词
Comparative genomics; gene order; gene cluster; gene cluster reconstruction; phylogeny; parsimony; consistency; COMMON INTERVALS; EVOLUTION; TREE; PERMUTATIONS; CONSERVATION; ALGORITHMS; PROTEINS; FAMILIES; DATABASE; OPERONS;
D O I
10.1109/TCBB.2008.135
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The order of genes in genomes provides extensive information. In comparative genomics, differences or similarities of gene orders are determined to predict functional relations of genes or phylogenetic relations of genomes. For this purpose, various combinatorial models can be used to identify gene clusters-groups of genes that are colocated in a set of genomes. We introduce a unified approach to model gene clusters and define the problem of labeling the inner nodes of a given phylogenetic tree with sets of gene clusters. Our optimization criterion in this context combines two properties: parsimony, i.e., the number of gains and losses of gene clusters has to be minimal, and consistency, i.e., for each ancestral node, there must exist at least one potential gene order that contains all the reconstructed clusters. We present and evaluate an exact algorithm to solve this problem. Despite its exponential worst-case time complexity, our method is suitable even for large-scale data. We show the effectiveness and efficiency on both simulated and real data.
引用
收藏
页码:387 / 400
页数:14
相关论文
共 26 条
[1]   Common intervals and symmetric difference in a model-free phylogenomics, with an application to streptophyte evolution [J].
Adam, Zaky ;
Turmel, Monique ;
Lemieux, Claude ;
Sankoff, David .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2007, 14 (04) :436-445
[2]  
[Anonymous], 2007, Pattern Discovery in Bioinformatics: Theory and Algorithms
[3]  
[Anonymous], 1997, ACM SIGACT NEWS
[4]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Rapp, BA ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :15-18
[5]  
BERGERON A, 2007, BIOINFORMATICS ALGOR
[6]  
BERGERON A, 2004, P 4 INT WORKSH ALG B, P14
[7]   On the similarity of sets of permutations and its applications to genome comparison [J].
Bergeron, Anne ;
Stoye, Jens .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (07) :1340-1354
[8]  
BOCKER S, 2008, P 12 ANN INT C RES C, P331
[9]   TESTING FOR CONSECUTIVE ONES PROPERTY, INTERVAL GRAPHS, AND GRAPH PLANARITY USING PQ-TREE ALGORITHMS [J].
BOOTH, KS ;
LUEKER, GS .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1976, 13 (03) :335-379
[10]   Conservation of gene order: a fingerprint of proteins that physically interact [J].
Dandekar, T ;
Snel, B ;
Huynen, M ;
Bork, P .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) :324-328