Detection of transposable elements by their compositional bias -: art. no. 94

被引:18
作者
Andrieu, O [1 ]
Fiston, AS [1 ]
Anxolabéhère, D [1 ]
Quesneville, H [1 ]
机构
[1] Inst Jacques Monod, Lab Dynam Genome & Evolut, F-75251 Paris 05, France
关键词
D O I
10.1186/1471-2105-5-94
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Transposable elements (TE) are mobile genetic entities present in nearly all genomes. Previous work has shown that TEs tend to have a different nucleotide composition than the host genes, either considering codon usage bias or dinucleotide frequencies. We show here how these compositional differences can be used as a tool for detection and analysis of TE sequences. Results: We compared the composition of TE sequences and host gene sequences using probabilistic models of nucleotide sequences. We used hidden Markov models (HMM), which take into account the base composition of the sequences (occurrences of words n nucleotides long, with n ranging here from 1 to 4) and the heterogeneity between coding and non-coding parts of sequences. We analyzed three sets of sequences containing class I TEs, class II TEs and genes respectively in three species: Drosophila melanogaster, Caenorhabditis elegans and Arabidopsis thaliana. Each of these sets had a distinct, homogeneous composition, enabling us to distinguish between the two classes of TE and the genes. However the particular base composition of the TEs differed in the three species studied. Conclusions: This approach can be used to detect and annotate TEs in genomic sequences and complements the current homology-based TE detection methods. Furthermore, the HMM method is able to identify the parts of a sequence in which the nucleotide composition resembles that of a coding region of a TE. This is useful for the detailed annotation of TE sequences, which may contain an ancient, highly diverged coding region that is no longer fully functional.
引用
收藏
页数:13
相关论文
共 20 条
[1]  
[Anonymous], GENOME BIOL
[2]  
ASHBURNER M, 1989, DROSOPHILA LAB HDB, P76
[3]   DNA synthesis fidelity by the reverse transcriptase of the yeast retrotransposon Ty1 [J].
Boutabout, M ;
Wilhelm, M ;
Wilhelm, FX .
NUCLEIC ACIDS RESEARCH, 2001, 29 (11) :2217-2222
[4]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[5]  
CHURCHILL GA, 1989, B MATH BIOL, V51, P79
[6]   Structural features of the mdg1 lineage of the Ty3/gypsy group of LTR retrotransposons inferred from the phylogenetic analyses of its open reading frames [J].
Costas, J ;
Valadé, E ;
Naveira, H .
JOURNAL OF MOLECULAR EVOLUTION, 2001, 53 (03) :165-171
[7]   Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis [J].
Devos, KM ;
Brown, JKM ;
Bennetzen, JL .
GENOME RESEARCH, 2002, 12 (07) :1075-1079
[8]  
DURBIN R, 1998, BIOL SEQUENCE ANAL P, P46
[9]   Repbase Update - a database and an electronic journal of repetitive elements [J].
Jurka, J .
TRENDS IN GENETICS, 2000, 16 (09) :418-420
[10]  
KARLIN S, 1995, TRENDS GENET, V11, P283