Mauve: Multiple alignment of conserved genomic sequence with rearrangements

被引:3562
作者
Darling, ACE
Mau, B
Blattner, FR
Perna, NT
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Anim Hlth & Biomed Sci, Madison, WI 53706 USA
[3] Univ Wisconsin, Dept Oncol, Madison, WI 53706 USA
[4] Univ Wisconsin, Dept Genet, Madison, WI 53706 USA
[5] Univ Wisconsin, Genome Ctr Wisconsin, Madison, WI 53706 USA
关键词
D O I
10.1101/gr.2289704
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As genomes evolve, they undergo large-scale evolutionary processes that present a challenge to sequence comparison not posed by short sequences. Recombination causes frequent genome rearrangements, horizontal transfer introduces new sequences into bacterial chromosomes, and deletions remove segments of the genome. Consequently, each genome is a mosaic Of unique lineage-specific segments, regions shared with a subset of other genomes and segments conserved among all the genomes under consideration. Furthermore, the linear order of these segments may be shuffled among genomes. We present methods for identification and alignment of conserved genomic DNA in the presence of rearrangements and horizontal transfer. Our methods have been implemented in a software package called Mauve. Mauve has been applied to align nine enterobacterial genomes and to determine global rearrangement structure in three mammalian genomes. We have evaluated the quality of Mauve alignments and drawn comparison to other methods through extensive simulations of genome evolution.
引用
收藏
页码:1394 / 1403
页数:10
相关论文
共 45 条
[1]   A linear-time algorithm for computing inversion distance between signed permutations with an experimental study [J].
Bader, DA ;
Moret, BME ;
Yan, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (05) :483-491
[2]   Human and mouse gene structure: Comparative analysis and application to exon prediction [J].
Batzoglou, S ;
Pachter, L ;
Mesirov, JP ;
Berger, B ;
Lander, ES .
GENOME RESEARCH, 2000, 10 (07) :950-958
[3]  
Blanchette, 1997, Genome Inform Ser Workshop Genome Inform, V8, P25
[4]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[5]  
Bourque G, 2002, GENOME RES, V12, P26
[6]   MAVID multiple alignment server [J].
Bray, N ;
Pachter, L .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3525-3526
[7]   AVID: A global alignment program [J].
Bray, N ;
Dubchak, I ;
Pachter, L .
GENOME RESEARCH, 2003, 13 (01) :97-102
[8]   LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA [J].
Brudno, M ;
Do, CB ;
Cooper, GM ;
Kim, MF ;
Davydov, E ;
Green, ED ;
Sidow, A ;
Batzoglou, S .
GENOME RESEARCH, 2003, 13 (04) :721-731
[9]   Glocal alignment: finding rearrangements during alignment [J].
Brudno, Michael ;
Malde, Sanket ;
Poliakov, Alexander ;
Do, Chuong B. ;
Couronne, Olivier ;
Dubchak, Inna ;
Batzoglou, Serafim .
BIOINFORMATICS, 2003, 19 :i54-i62
[10]   Fast identification and statistical evaluation of segmental homologies in comparative maps [J].
Calabrese, Peter P. ;
Chakravarty, Sugata ;
Vision, Todd J. .
BIOINFORMATICS, 2003, 19 :i74-i80