The reconstruction of doubled genomes

被引:64
作者
El-Marbrouk, N
Sankoff, D
机构
[1] Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
[2] Univ Montreal, Ctr Rech Math, Montreal, PQ H3C 3J7, Canada
关键词
genome duplication; genome rearrangement; signed genes; reversal; translocation; Hannenhalli-Pevzner graph; exact polynomial algorithms;
D O I
10.1137/S0097539700377177
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The genome can be modeled as a set of strings (chromosomes) of distinguished elements called genes. Genome duplication is an important source of new gene functions and novel physiological pathways. Originally (ancestrally), a duplicated genome contains two identical copies of each chromosome, but through the genomic rearrangement mutational processes of reciprocal translocation (prefix and/or suffix exchanges between chromosomes) and substring reversals, this simple doubled structure is disrupted. At the time of observation, each of the chromosomes resulting from the accumulation of rearrangements can be decomposed into a succession of conserved segments, such that each segment appears exactly twice in the genome. We present exact algorithms for reconstructing the ancestral doubled genome in linear time, minimizing the number of rearrangement mutations required to derive the observed order of genes along the present-day chromosomes. Somewhat different techniques are required for a translocations-only model, a translocations/reversals model, both of these in the multichromosomal context (eukaryotic nuclear genomes), and a reversals-only model for single chromosome prokaryotic and organellar genomes. We apply these methods to the yeast genome, which is thought to have doubled, and to the liverwort mitochondrial genome, whose duplicate genes are unlikely to have arisen by genome doubling.
引用
收藏
页码:754 / 792
页数:39
相关论文
共 40 条
[11]  
ELMABROUK N, 1999, LECT NOTES COMPUT SC, V1645, P78
[12]   Pattern and timing of gene duplication in animal genomes [J].
Friedman, R ;
Hughes, AL .
GENOME RESEARCH, 2001, 11 (11) :1842-1847
[13]   The coevolution of gene family trees [J].
Fryxell, KJ .
TRENDS IN GENETICS, 1996, 12 (09) :364-369
[14]   DNA sequence evidence for the segmental allotetraploid origin of maize [J].
Gaut, BS ;
Doebley, JF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (13) :6809-6814
[15]  
Hannenhalli S, 1995, AN S FDN CO, P581, DOI 10.1109/SFCS.1995.492588
[16]  
Hannenhalli S, 1995, LECT NOTES COMPUT SC, V937, P162
[17]  
Hannenhalli S., 1995, Proceedings of the Twenty-Seventh Annual ACM Symposium on the Theory of Computing, P178, DOI 10.1145/225058.225112
[18]  
Herdman M., 1985, EVOLUTION GENOME SIZ, P37
[19]   EVOLUTION OF CELLULAR DNA CONTENT IN TELEOST FISHES [J].
HINEGARD.R .
AMERICAN NATURALIST, 1968, 102 (928) :517-&
[20]   Phylogenies of developmentally important proteins do not support the hypothesis of two rounds of genome duplication early in vertebrate history [J].
Hughes, AL .
JOURNAL OF MOLECULAR EVOLUTION, 1999, 48 (05) :565-576