The repetitive sequence database and mining putative regulatory elements in gene promoter regions

被引:12
作者
Horng, JT
Huang, HD
Jin, MH
Wu, LC
Huang, SL
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Chungli 320, Taiwan
[2] Natl Cent Univ, Dept Life Sci, Chungli 320, Taiwan
关键词
database; DNA; data mining; repetitive elements; genes;
D O I
10.1089/106652702760277354
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
At least 43% of the human genome is occupied by repetitive elements. Moreover, around 51% of the rice genome is occupied by repetitive elements. The analysis of repetitive elements reveals that repetitive elements in our genome may have been very important in the evolutionary genomics. The first part of this study is to describe a database of repetitive elements-RSDB. The RSDB database contains repetitive elements, which are classified into the following categories: exact, tandem, and similar. The interfaces needed to query and show the results and statistical data, such as the relationship between repetitive elements and genes, cross-references of repetitive elements among different organisms, and so on, are provided. The second part of this study then attempts to mine the putative binding site for information on how combinations of the known regulatory sites and overrepresented repetitive elements in RSDB are distributed in the promoter regions of groups of functionally related genes. The overrepresented repetitive elements appearing in the associations are possible transcription factor binding sites. Our proposed approach is applied to Saccharomyces cerevisiae and the promoter regions of Yeast ORFs. The complete contents of RSDB and partial putative binding sites are available to the public at www.rsdb.csie.ncu.edu.tw. The readers may download partial query results.
引用
收藏
页码:621 / 640
页数:20
相关论文
共 17 条
  • [1] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [2] Agrawal R., 1994, P 20 INT C VER LARG, V1215, P487
  • [3] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [4] GenBank
    Benson, DA
    Boguski, MS
    Lipman, DJ
    Ostell, J
    Ouellette, BFF
    Rapp, BA
    Wheeler, DL
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 12 - 17
  • [5] Predicting gene regulatory elements in silico on a genomic scale
    Brazma, A
    Jonassen, I
    Vilo, J
    Ukkonen, E
    [J]. GENOME RESEARCH, 1998, 8 (11) : 1202 - 1215
  • [6] BRAZMA A, 1997, P 5 INT C INT SYST M, P65
  • [7] Exploring the metabolic and genetic control of gene expression on a genomic scale
    DeRisi, JL
    Iyer, VR
    Brown, PO
    [J]. SCIENCE, 1997, 278 (5338) : 680 - 686
  • [8] Databases on transcriptional regulation: TRANSFAC, TRRD and COMPEL
    Heinemeyer, T
    Wingender, E
    Reuter, I
    Hermjakob, H
    Kel, AE
    Kel, OV
    Ignatieva, EV
    Ananko, EA
    Podkolodnaya, OA
    Kolpakov, FA
    Podkolodny, NL
    Kolchanov, NA
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 362 - 367
  • [9] Expanding the TRANSFAC database towards an expert system of regulatory molecular mechanisms
    Heinemeyer, T
    Chen, X
    Karas, H
    Kel, AE
    Kel, OV
    Liebich, I
    Meinhardt, T
    Reuter, I
    Schacherer, F
    Wingender, E
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 318 - 322
  • [10] HORNG JT, 2001, P GERM C BIOINF, P90