GenomeTrafac: a whole genome resource for the detection of transcription factor binding site clusters associated with conventional and microRNA encoding genes conserved between mouse and human gene orthologs

被引:18
作者
Jegga, Anil G.
Chen, Jing
Gowrisankar, Sivakumar
Deshmukh, Mrunal A.
Gudivada, RangaChandra
Kong, Sue
Kaimal, Vivek
Aronow, Bruce J.
机构
[1] Childrens Hosp, Med Ctr, Div Biomed Informat, Cincinnati, OH 45229 USA
[2] Univ Cincinnati, Coll Med, Dept Pediat, Cincinnati, OH 45229 USA
[3] Univ Cincinnati, Dept Biomed Engn, Cincinnati, OH 45229 USA
关键词
D O I
10.1093/nar/gkl1011
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Transcriptional cis-regulatory control regions frequently are found within non-coding DNA segments conserved across multi-species gene orthologs. Adopting a systematic gene-centric pipeline approach, we report here the development of a web-accessible database resource-GenomeTraFac (http://genometrafac.cchmc.org)-that allows genome-wide detection and characterization of compositionally similar cis-clusters that occur in gene orthologs between any two genomes for both microRNA genes as well as conventional RNA-encoding genes. Each ortholog gene pair can be scanned to visualize overall conserved sequence regions, and within these, the relative density of conserved cis-element motif clusters form graph peak structures. The results of these analyses can be mined en masse to identify most frequently represented cis-motifs in a list of genes. The system also provides a method for rapid evaluation and visualization of gene model-consistency between orthologs, and facilitates consideration of the potential impact of sequence variation in conserved non-coding regions to impact complex cis-element structures. Using the mouse and human genomes via the NCBI Reference Sequence database and the Sanger Institute miRBase, the system demonstrated the ability to identify validated transcription factor targets within promoter and distal genomic regulatory regions of both conventional and microRNA genes.
引用
收藏
页码:D116 / D121
页数:6
相关论文
共 34 条
[21]   Comparative genomics [J].
Miller, W ;
Makova, KD ;
Nekrutenko, A ;
Hardison, RC .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2004, 5 :15-56
[22]   c-Myc-regulated microRNAs modulate E2F1 expression [J].
O'Donnell, KA ;
Wentzel, EA ;
Zeller, KI ;
Dang, CV ;
Mendell, JT .
NATURE, 2005, 435 (7043) :839-843
[23]   Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification [J].
Ohler, U ;
Yekta, S ;
Lim, LP ;
Bartel, DP ;
Burge, CB .
RNA, 2004, 10 (09) :1309-1322
[24]   Identifying synonymous regulatory elements in vertebrate genomes [J].
Ovcharenko, I ;
Nobrega, MA .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W403-W407
[25]   ECR Browser: a tool for visualizing and accessing data from comparisons of multiple vertebrate genomes [J].
Ovcharenko, I ;
Nobrega, MA ;
Loots, GG ;
Stubbs, L .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W280-W286
[26]   NCBI Reference Sequence Project: update and current status [J].
Pruitt, KD ;
Tatusova, T ;
Maglott, DR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :34-37
[27]   MatInd and MatInspector: New fast and versatile tools for detection of consensus matches in nucleotide sequence data [J].
Quandt, K ;
Frech, K ;
Karas, H ;
Wingender, E ;
Werner, T .
NUCLEIC ACIDS RESEARCH, 1995, 23 (23) :4878-4884
[28]   Genome-wide location and function of DNA binding proteins [J].
Ren, B ;
Robert, F ;
Wyrick, JJ ;
Aparicio, O ;
Jennings, EG ;
Simon, I ;
Zeitlinger, J ;
Schreiber, J ;
Hannett, N ;
Kanin, E ;
Volkert, TL ;
Wilson, CJ ;
Bell, SP ;
Young, RA .
SCIENCE, 2000, 290 (5500) :2306-+
[29]   Human-mouse alignments with BLASTZ [J].
Schwartz, S ;
Kent, WJ ;
Smit, A ;
Zhang, Z ;
Baertsch, R ;
Hardison, RC ;
Haussler, D ;
Miller, W .
GENOME RESEARCH, 2003, 13 (01) :103-107
[30]   The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information [J].
Smith, CL ;
Goldsmith, CAW ;
Eppig, JT .
GENOME BIOLOGY, 2005, 6 (01)