Serendipitous discovery of Wolbachia genomes in multiple Drosophila species -: art. no. R23

被引:106
作者
Salzberg, SL
Hotopp, JCD
Delcher, AL
Pop, M
Smith, DR
Eisen, MB
Nelson, WC
机构
[1] Inst Genome Res, Rockville, MD 20850 USA
[2] Agencourt Biosci Corp, Beverly, MA 01915 USA
[3] Univ Calif Berkeley, Ctr Integrat Genom, Berkeley, CA 94720 USA
关键词
D O I
10.1186/gb-2005-6-3-r23
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The Trace Archive is a repository for the raw, unanalyzed data generated by large-scale genome sequencing projects. The existence of this data offers scientists the possibility of discovering additional genomic sequences beyond those originally sequenced. In particular, if the source DNA for a sequencing project came from a species that was colonized by another organism, then the project may yield substantial amounts of genomic DNA, including near-complete genomes, from the symbiotic or parasitic organism. Results: By searching the publicly available repository of DNA sequencing trace data, we discovered three new species of the bacterial endosymbiont Wolbachia pipientis in three different species of fruit fly: Drosophila ananassae, D. simulans, and D. mojavensis. We extracted all sequences with partial matches to a previously sequenced Wolbachia strain and assembled those sequences using customized software. For one of the three new species, the data recovered were sufficient to produce an assembly that covers more than 95% of the genome; for a second species the data produce the equivalent of a 'light shotgun' sampling of the genome, covering an estimated 75-80% of the genome; and for the third species the data cover approximately 6-7% of the genome. Conclusions: The results of this study reveal an unexpected benefit of depositing raw data in a central genome sequence repository: new species can be discovered within this data. The differences between these three new Wolbachia genomes and the previously sequenced strain revealed numerous rearrangements and insertions within each lineage and hundreds of novel genes. The three new genomes, with annotation, have been deposited in GenBank.
引用
收藏
页数:8
相关论文
共 36 条
[1]   Repeated, recent and diverse transfers of a mitochondrial gene to the nucleus in flowering plants [J].
Adams, KL ;
Daley, DO ;
Qiu, YL ;
Whelan, J ;
Palmer, JD .
NATURE, 2000, 408 (6810) :354-357
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Bacteriophage flux in endosymbionts (Wolbachia):: Infection frequency, lateral transfer, and recombination rates [J].
Bordenstein, SR ;
Wernegreen, JJ .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (10) :1981-1991
[4]  
Bourtzis K, 1996, GENETICS, V144, P1063
[5]   Multiple sequence alignment with the Clustal series of programs [J].
Chenna, R ;
Sugawara, H ;
Koike, T ;
Lopez, R ;
Gibson, TJ ;
Higgins, DG ;
Thompson, JD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3497-3500
[6]   Genetic definition and sequence analysis of Arabidopsis centromeres [J].
Copenhaver, GP ;
Nickel, K ;
Kuromori, T ;
Benito, MI ;
Kaul, S ;
Lin, XY ;
Bevan, M ;
Murphy, G ;
Harris, B ;
Parnell, LD ;
McCombie, WR ;
Martienssen, RA ;
Marra, M ;
Preuss, D .
SCIENCE, 1999, 286 (5449) :2468-2474
[7]   Alignment of whole genomes [J].
Delcher, AL ;
Kasif, S ;
Fleischmann, RD ;
Peterson, J ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (11) :2369-2376
[8]   Improved microbial gene identification with GLIMMER [J].
Delcher, AL ;
Harmon, D ;
Kasif, S ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (23) :4636-4641
[9]   Fast algorithms for large-scale genome alignment and comparison [J].
Delcher, AL ;
Phillippy, A ;
Carlton, J ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 2002, 30 (11) :2478-2483
[10]   Wolbachia infections are distributed throughout insect somatic and germ line tissues [J].
Dobson, SL ;
Bourtzis, K ;
Braig, HR ;
Jones, BF ;
Zhou, WG ;
Rousset, F ;
O'Neill, SL .
INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1999, 29 (02) :153-160