PlantSat:: a specialized database for plant satellite repeats

被引:102
作者
Macas, J [1 ]
Mészáros, T [1 ]
Nouzová, M [1 ]
机构
[1] Inst Plant Mol Biol, Lab Mol Cytogenet, CZ-37005 Ceske Budejovice, Czech Republic
关键词
D O I
10.1093/bioinformatics/18.1.28
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Tandemly organized repetitive sequences (satellite DNA) are widespread in complex eukaryotic genomes. In plants, satellite repeats often represent a substantial part of nuclear DNA but only a little is known about the molecular mechanisms of their amplification and their possible role(s) in genome evolution and function. Unfortunately, addressing these questions via characterization of general sequence properties of known satellite repeats has been hindered by a difficulty in obtaining a complete and unbiased set of sequence data for this analysis. This is mainly due to the presence of multiple entries of homologous sequences and of single entries that contain more than one repeated unit (monomer) in the public databases. Results: We have established a computer database specialized for plant satellite repeats (PlantSat) that integrates sequence data available from various resources with supplementary information including repeat consensus sequences, abundances, and chromosomal localizations. The sequences are stored as individual repeat monomers grouped into families, which simplifies their computer analysis and makes it more accurate. Using this feature, we have performed a basic sequence analysis of the whole set of plant satellite repeats with respect to their monomer length and nucleotide composition. The analysis revealed several preferred length ranges of the monomers (similar to165 bp and its multiples) and an over-representation of the AA/TT dinucleotide in the repeats. We have also detected an enrichment of satellite DNA sequences for the motif CAAAA that is supposed to be involved in breakage-reunion of repeated sequences.
引用
收藏
页码:28 / 35
页数:8
相关论文
共 34 条
[11]   RISSC:: a novel database for ribosomal 16S-23S RNA genes spacer regions [J].
García-Martínez, J ;
Bescós, I ;
Rodríguez-Sala, JJ ;
Rodríguez-Valera, F .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :178-180
[12]   CHARACTERIZATION OF A NEW FAMILY OF TOBACCO HIGHLY REPETITIVE DNA, GRS, SPECIFIC FOR THE NICOTIANA-TOMENTOSIFORMIS GENOMIC COMPONENT [J].
GAZDOVA, B ;
SIROKY, J ;
FAJKUS, J ;
BRZOBOHATY, B ;
KENTON, A ;
PAROKONNY, A ;
HESLOPHARRISON, JS ;
PALME, K ;
BEZDEK, M .
CHROMOSOME RESEARCH, 1995, 3 (04) :245-254
[13]   ORIGIN OF THE MAIN CLASS OF REPETITIVE DNA WITHIN SELECTED PENNISETUM SPECIES [J].
INGHAM, LD ;
HANNA, WW ;
BAIER, JW ;
HANNAH, LC .
MOLECULAR & GENERAL GENETICS, 1993, 238 (03) :350-356
[14]   NUCLEOTIDE-SEQUENCE OF A HIGHLY REPEATED DNA-SEQUENCE AND ITS CHROMOSOMAL LOCALIZATION IN ALLIUM-FISTULOSUM [J].
IRIFUNE, K ;
HIRAI, K ;
ZHENG, J ;
TANAKA, R ;
MORIKAWA, H .
THEORETICAL AND APPLIED GENETICS, 1995, 90 (3-4) :312-316
[15]   Comparative DNA analysis across diverse genomes [J].
Karlin, S ;
Campbell, AM ;
Mrázek, J .
ANNUAL REVIEW OF GENETICS, 1998, 32 :185-225
[16]  
KARLIN S, 1995, TRENDS GENET, V11, P283
[17]   SEQUENCE-ANALYSIS OF VICIA-FABA REPEATED DNA, THE FOKI REPEAT ELEMENT [J].
KATO, A ;
YAKURA, K ;
TANIFUJI, S .
NUCLEIC ACIDS RESEARCH, 1984, 12 (16) :6415-6426
[18]  
Katsiotis A, 1998, GENOME, V41, P527, DOI 10.1139/gen-41-4-527
[19]   DNA BENDING AT ADENINE . THYMINE TRACTS [J].
KOO, HS ;
WU, HM ;
CROTHERS, DM .
NATURE, 1986, 320 (6062) :501-506
[20]   Repetitive DNA elements as a major component of plant genomes [J].
Kubis, S ;
Schmidt, T ;
Heslop-Harrison, JS .
ANNALS OF BOTANY, 1998, 82 :45-55