Mauve: Multiple alignment of conserved genomic sequence with rearrangements

被引:3666
作者
Darling, ACE
Mau, B
Blattner, FR
Perna, NT
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Anim Hlth & Biomed Sci, Madison, WI 53706 USA
[3] Univ Wisconsin, Dept Oncol, Madison, WI 53706 USA
[4] Univ Wisconsin, Dept Genet, Madison, WI 53706 USA
[5] Univ Wisconsin, Genome Ctr Wisconsin, Madison, WI 53706 USA
关键词
D O I
10.1101/gr.2289704
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As genomes evolve, they undergo large-scale evolutionary processes that present a challenge to sequence comparison not posed by short sequences. Recombination causes frequent genome rearrangements, horizontal transfer introduces new sequences into bacterial chromosomes, and deletions remove segments of the genome. Consequently, each genome is a mosaic Of unique lineage-specific segments, regions shared with a subset of other genomes and segments conserved among all the genomes under consideration. Furthermore, the linear order of these segments may be shuffled among genomes. We present methods for identification and alignment of conserved genomic DNA in the presence of rearrangements and horizontal transfer. Our methods have been implemented in a software package called Mauve. Mauve has been applied to align nine enterobacterial genomes and to determine global rearrangement structure in three mammalian genomes. We have evaluated the quality of Mauve alignments and drawn comparison to other methods through extensive simulations of genome evolution.
引用
收藏
页码:1394 / 1403
页数:10
相关论文
共 45 条
[41]   A comprehensive comparison of multiple sequence alignment programs [J].
Thompson, JD ;
Plewniak, F ;
Poch, O .
NUCLEIC ACIDS RESEARCH, 1999, 27 (13) :2682-2690
[42]   Genome rearrangement by replication-directed translocation [J].
Tillier, ERM ;
Collins, RA .
NATURE GENETICS, 2000, 26 (02) :195-197
[43]   Comparative genomics: Genome-wide analysis in metazoan eukaryotes [J].
Ureta-Vidal, A ;
Ettwiller, L ;
Birney, E .
NATURE REVIEWS GENETICS, 2003, 4 (04) :251-262
[44]   Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T [J].
Wei, J ;
Goldberg, MB ;
Burland, V ;
Venkatesan, MM ;
Deng, W ;
Fournier, G ;
Mayhew, GF ;
Plunkett, G ;
Rose, DJ ;
Darling, A ;
Mau, B ;
Perna, NT ;
Payne, SM ;
Runyen-Janecky, LJ ;
Zhou, S ;
Schwartz, DC ;
Blattner, FR .
INFECTION AND IMMUNITY, 2003, 71 (05) :2775-2786
[45]   Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli [J].
Welch, RA ;
Burland, V ;
Plunkett, G ;
Redford, P ;
Roesch, P ;
Rasko, D ;
Buckles, EL ;
Liou, SR ;
Boutin, A ;
Hackett, J ;
Stroud, D ;
Mayhew, GF ;
Rose, DJ ;
Zhou, S ;
Schwartz, DC ;
Perna, NT ;
Mobley, HLT ;
Donnenberg, MS ;
Blattner, FR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (26) :17020-17024