Non-LTR retrotransposons in the African malaria mosquito, Anopheles gambiae:: Unprecedented diversity and evidence of recent activity

被引:58
作者
Biedler, J [1 ]
Tu, ZJ [1 ]
机构
[1] Virginia Polytech Inst & State Univ, Dept Biochem, Blacksburg, VA 24061 USA
关键词
genome; molecular evolution; non-LTR retrotransposon; polyadenylation; retrotransposition; reverse transcriptase;
D O I
10.1093/molbev/msg189
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Over a hundred families of non-long terminal repeat retrotransposons (non-LTRs) were found in the newly released Anopheles gambiae genome assembly during a reiterative and comprehensive search using the conserved reverse transcriptase (RT) domains of known non-LTRs as the starting queries. These families, which are defined by at least 20% amino acid sequence divergence in their RT domains, range from a few to approximately 2,000 copies and occupy at least 3% of the genome. In addition to having an unprecedented number of diverse families, A. gambiae non-LTRs represent 8 of the 15 previously defined clades plus two novel clades, Loner and Outcast, more than what has been reported for any genome. Five families were found belonging to the L1 clade, which had no invertebrate representatives to date. One unique family named Sponge contains only a complete open reading frame (ORF) for the Gag-like protein and appears to have been mobilized by a family of the CRI clade. Although most families appear to be inactive as expected, all clades except R4 have families with characteristics suggesting recent activity. At least 21 families have multiple full-length copies with over 99% nucleotide identity and some or all of the following characteristics: target site duplications (TSDs), intact ORFs, and corresponding expressed sequence tags (ESTs). The incredible diversity and the maintenance of multiple recently active lineages within different clades indicate a complex evolutionary scenario. A. gambiae non-LTRs have the potential to be developed as tools for population genetic studies and genetic manipulations of this primary vector of the devastating disease malaria. The semi-automated reiterative search approach described here may be used with any genome assembly to systematically survey and characterize non-LTRs as well as other transposable elements that encode a conserved protein.
引用
收藏
页码:1811 / 1825
页数:15
相关论文
共 51 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes [J].
Aparicio, S ;
Chapman, J ;
Stupka, E ;
Putnam, N ;
Chia, J ;
Dehal, P ;
Christoffels, A ;
Rash, S ;
Hoon, S ;
Smit, A ;
Gelpke, MDS ;
Roach, J ;
Oh, T ;
Ho, IY ;
Wong, M ;
Detter, C ;
Verhoef, F ;
Predki, P ;
Tay, A ;
Lucas, S ;
Richardson, P ;
Smith, SF ;
Clark, MS ;
Edwards, YJK ;
Doggett, N ;
Zharkikh, A ;
Tavtigian, SV ;
Pruss, D ;
Barnstead, M ;
Evans, C ;
Baden, H ;
Powell, J ;
Glusman, G ;
Rowen, L ;
Hood, L ;
Tan, YH ;
Elgar, G ;
Hawkins, T ;
Venkatesh, B ;
Rokhsar, D ;
Brenner, S .
SCIENCE, 2002, 297 (5585) :1301-1310
[3]   Automated de novo identification of repeat sequence families in sequenced genomes [J].
Bao, ZR ;
Eddy, SR .
GENOME RESEARCH, 2002, 12 (08) :1269-1276
[4]   AFRICAN ORIGIN OF HUMAN-SPECIFIC POLYMORPHIC ALU INSERTIONS [J].
BATZER, MA ;
STONEKING, M ;
ALEGRIAHARTMAN, M ;
BAZAN, H ;
KASS, DH ;
SHAIKH, TH ;
NOVICK, GE ;
IOANNOU, PA ;
SCHEER, WD ;
HERRERA, RJ ;
DEININGER, PL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1994, 91 (25) :12288-12292
[5]   CM-gag, a transposable-like element reiterated in the genome of Culex pipiens mosquitoes, contains only a gag gene [J].
Bensaadi-Merchermek, N ;
Cagnon, C ;
Desmons, I ;
Salvado, JC ;
Karama, S ;
D'Amico, F ;
Mouchès, C .
GENETICA, 1997, 100 (1-3) :141-148
[6]  
Berezikov E, 2000, GENOME BIOL, V1
[7]  
Besansky N. J., 1994, Insect Molecular Biology, V3, P49, DOI 10.1111/j.1365-2583.1994.tb00150.x
[8]   DISTINCT FAMILIES OF SITE-SPECIFIC RETROTRANSPOSONS OCCUPY IDENTICAL POSITIONS IN THE RIBOSOMAL-RNA GENES OF ANOPHELES-GAMBIAE [J].
BESANSKY, NJ ;
PASKEWITZ, SM ;
HAMM, DM ;
COLLINS, FH .
MOLECULAR AND CELLULAR BIOLOGY, 1992, 12 (11) :5102-5110
[9]  
BESANSKY NJ, 1990, MOL BIOL EVOL, V7, P229
[10]  
Besansky NJ, 1999, PARASSITOLOGIA, VOL 41, NOS 1-3, SEPTEMBER 1999, P97