A random sequencing approach for the analysis of the Trypanosoma cruzi genome:: General structure, large gene and repetitive DNA families, and gene discovery

被引:45
作者
Agüero, F [1 ]
Verdún, RE [1 ]
Frasch, ACC [1 ]
Sánchez, DO [1 ]
机构
[1] Univ Nacl Gen San Martin, Consejo Nacl Invest Cient & Tecn, Inst Invest Biotecnol, Inst Tecnol Chascomus, RA-1650 San Martin, Buenos Aires, Argentina
关键词
D O I
10.1101/gr.GR-1463R
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A random sequence survey of the genome of Trypanosoma cruzi, the agent of Chagas disease, was performed and 11,459 genomic sequences were obtained, resulting in similar to4.3 Mb of readable sequences or similar to 10% of the parasite haploid genome. The estimated total GC content was 50.9%, with a high representation of A and T di- and trinucleotide repeats. Out of the estimated 5000 parasite genes, 947 putative new genes were identified. Another 1723 sequences corresponded to genes detected previously in T. cruzi through expression sequence tag analysis. 7735 sequences had no matches in the database, but the presence of open reading frames that passed Fickett's test suggests that some might contain coding DNA. The survey was highly redundant, with similar to 35% of the sequences included in a few large sequence families. Some of them code for protein Families present in dozens of copies, including proteins essential for parasite survival and retrotransposons. Other sequence families include repetitive DNA present in thousands of copies per haploid genome. Some families in the latter group are new, parasite-specific, repetitive DNAs. These results suggest that T. cruzi could constitute an interesting model to analyze gene and genome evolution due to its plasticity in terms of sequence amplification and divergence. Additional information can be found at http://www.iib.unsam.edu.ar/tcruzi.gss.html.
引用
收藏
页码:1996 / 2005
页数:10
相关论文
共 48 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Complete sequence of a 93.4-kb contig from chromosome 3 of Trypanosoma cruzi containing a strand-switch region [J].
Andersson, B ;
Åslund, L ;
Tammi, M ;
Tran, AN ;
Hoheisel, JD ;
Pettersson, U .
GENOME RESEARCH, 1998, 8 (08) :809-816
[3]   Characterization of an interspersed repetitive DNA element in the genome of Trypanosoma cruzi [J].
Araya, J ;
Cano, MI ;
Gomes, HBM ;
Novak, EM ;
Requena, JM ;
Alonso, C ;
Levin, MJ ;
Guevara, P ;
Ramirez, JL ;
Da Silveira, JF .
PARASITOLOGY, 1997, 115 :563-570
[4]   S-myristoylation of a glycosylphosphatidylinositol-specific phospholipase C in Trypanosoma brucei [J].
Armah, DA ;
Mensa-Wilmot, K .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1999, 274 (09) :5931-5938
[5]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[6]   The structure of myristoyl-CoA:protein N-myristoyltransferase [J].
Bhatnagar, RS ;
Fütterer, K ;
Waksman, G ;
Gordon, JI .
BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR AND CELL BIOLOGY OF LIPIDS, 1999, 1441 (2-3) :162-172
[7]   A SUPERFAMILY OF TRYPANOSOMA-CRUZI SURFACE-ANTIGENS [J].
CAMPETELLA, O ;
SANCHEZ, D ;
CAZZULO, JJ ;
FRASCH, ACC .
PARASITOLOGY TODAY, 1992, 8 (11) :378-381
[8]   THE MAJOR CYSTEINE PROTEINASE (CRUZIPAIN) FROM TRYPANOSOMA-CRUZI IS ENCODED BY MULTIPLE POLYMORPHIC TANDEMLY ORGANIZED GENES LOCATED ON DIFFERENT CHROMOSOMES [J].
CAMPETELLA, O ;
HENRIKSSON, J ;
ASLUND, L ;
FRASCH, ACC ;
PETTERSSON, U ;
CAZZULO, JJ .
MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 1992, 50 (02) :225-234
[9]   A NEW REPETITIVE DNA-SEQUENCE FROM TRYPANOSOMA-CRUZI [J].
DEMENDONCALIMA, L ;
TRAUBCSEKO, YM .
MEMORIAS DO INSTITUTO OSWALDO CRUZ, 1991, 86 (04) :475-475
[10]   The Trypanosoma cruzi mucin family is transcribed from hundreds of genes having hypervariable regions [J].
Di Noia, JM ;
D'Orso, I ;
Åslund, L ;
Sánchez, DO ;
Frasch, ACC .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1998, 273 (18) :10843-10850