Comparative promoter region analysis powered by CORG -: art. no. 24

被引:23
作者
Dieterich, C
Grossmann, S
Tanzer, A
Röpcke, S
Arndt, PF
Stadler, PF
Vingron, M
机构
[1] Max Planck Inst Mol Genet, Computat Mol Biol Dept, D-14195 Berlin, Germany
[2] Univ Vienna, Inst Theoret Chem & Struct Biol, A-1090 Vienna, Austria
[3] Univ Leipzig, Bioinformat Grp, Dept Comp Sci, D-04103 Leipzig, Germany
关键词
D O I
10.1186/1471-2164-6-24
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Promoters are key players in gene regulation. They receive signals from various sources (e.g. cell surface receptors) and control the level of transcription initiation, which largely determines gene expression. In vertebrates, transcription start sites and surrounding regulatory elements are often poorly defined. To support promoter analysis, we present CORG http://corg.molgen.mpg.de, a framework for studying upstream regions including untranslated exons (5' UTR). Description: The automated annotation of promoter regions integrates information of two kinds. First, statistically significant cross-species conservation within upstream regions of orthologous genes is detected. Pairwise as well as multiple sequence comparisons are computed. Second, binding site descriptions (position-weight matrices) are employed to predict conserved regulatory elements with a novel approach. Assembled EST sequences and verified transcription start sites are incorporated to distinguish exonic from other sequences. As of now, we have included 5 species in our analysis pipeline (man, mouse, rat, fugu and zebrafish). We characterized promoter regions of 16,127 groups of orthologous genes. All data are presented in an intuitive way via our web site. Users are free to export data for single genes or access larger data sets via our DAS server http://tomcat.molgen.mpg.de:8080/das. The benefits of our framework are exemplarily shown in the context of phylogenetic profiling of transcription factor binding sites and detection of microRNAs close to transcription start sites of our gene set. Conclusion: The CORG platform is a versatile tool to support analyses of gene regulation in vertebrate promoter regions. Applications for CORG cover a broad range from studying evolution of DNA binding sites and promoter constitution to the discovery of new regulatory sequence elements (e.g. microRNAs and binding sites).
引用
收藏
页数:10
相关论文
共 48 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Distinct changes of genomic biases in nucleotide substitution at the time of mammalian radiation [J].
Arndt, PF ;
Petrov, DA ;
Hwa, T .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (11) :1887-1896
[3]   Serum response factor is essential for mesoderm formation during mouse embryogenesis [J].
Arsenian, S ;
Weinhold, B ;
Oelgeschläger, M ;
Rüther, U ;
Nordheim, A .
EMBO JOURNAL, 1998, 17 (21) :6289-6299
[4]   The expanding snoRNA world [J].
Bachellerie, JP ;
Cavaillé, J ;
Hüttenhofer, A .
BIOCHIMIE, 2002, 84 (08) :775-790
[5]   An overview of ensembl [J].
Birney, E ;
Andrews, TD ;
Bevan, P ;
Caccamo, M ;
Chen, Y ;
Clarke, L ;
Coates, G ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Down, T ;
Eyras, E ;
Fernandez-Suarez, XM ;
Gane, P ;
Gibbins, B ;
Gilbert, J ;
Hammond, M ;
Hotz, HR ;
Iyer, V ;
Jekosch, K ;
Kahari, A ;
Kasprzyk, A ;
Keefe, D ;
Keenan, S ;
Lehvaslaiho, H ;
McVicker, G ;
Melsopp, C ;
Meidl, P ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Proctor, G ;
Rae, M ;
Searle, S ;
Slater, G ;
Smedley, D ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Storey, R ;
Ureta-Vidal, A ;
Woodwark, KC ;
Cameron, G ;
Durbin, R ;
Cox, A ;
Hubbard, T ;
Clamp, M .
GENOME RESEARCH, 2004, 14 (05) :925-928
[6]   FANTOM DB: Database of functional annotation of RIKEN mouse cDNA clones [J].
Bono, H ;
Kasukawa, T ;
Furuno, M ;
Hayashizaki, Y ;
Okazaki, Y .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :116-118
[7]   FINDING ALL CLIQUES OF AN UNDIRECTED GRAPH [H] [J].
BRON, C ;
KERBOSCH, J .
COMMUNICATIONS OF THE ACM, 1973, 16 (09) :575-577
[8]   Ets ternary complex transcription factors [J].
Buchwalter, G ;
Gross, C ;
Wasylyk, B .
GENE, 2004, 324 :1-14
[9]   Multiple sequence alignment with the Clustal series of programs [J].
Chenna, R ;
Sugawara, H ;
Koike, T ;
Lopez, R ;
Gibson, TJ ;
Higgins, DG ;
Thompson, JD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3497-3500
[10]   CORG: a database for COmparative Regulatory Genomics [J].
Dieterich, C ;
Wang, H ;
Rateitschak, K ;
Luz, H ;
Vingron, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :55-57