CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes

被引:30
作者
Hestand, Matthew S. [1 ]
van Galen, Michiel [1 ]
Villerius, Michel P. [1 ]
van Ommen, Gert-Jan B. [1 ]
den Dunnen, Johan T. [1 ]
't Hoen, Peter A. C. [1 ]
机构
[1] Leiden Univ, Med Ctr, Ctr Human & Clin Genet, NL-2300 RC Leiden, Netherlands
关键词
D O I
10.1186/1471-2105-9-495
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments. Results: We have developed a novel tool, "CORE_TF" (Conserved and Over-REpresented Transcription Factor binding sites) that identifies common transcription factor binding sites in promoters of co-regulated genes. To improve upon existing binding site predictions, the tool searches for position weight matrices from the TRANSFACR database that are over-represented in an experimental set compared to a random set of promoters and identifies cross-species conservation of the predicted transcription factor binding sites. The algorithm has been evaluated with expression and chromatin-immunoprecipitation on microarray data. We also implement and demonstrate the importance of matching the random set of promoters to the experimental promoters by GC content, which is a unique feature of our tool. Conclusion: The program CORE_TF is accessible in a user friendly web interface at http://www.LGTC.nl/CORE_TF. It provides a table of over-represented transcription factor binding sites in the users input genes' promoters and a graphical view of evolutionary conserved transcription factor binding sites. In our test data sets it successfully predicts target transcription factors and their binding sites.
引用
收藏
页数:11
相关论文
共 26 条
[1]   IDconverter and IDClight:: Conversion and annotation of gene and protein IDs [J].
Alibes, Andreu ;
Yankilevich, Patricio ;
Canada, Andres ;
Diaz-Uriarte, Ramon .
BMC BIOINFORMATICS, 2007, 8
[2]   CIS-ACTING SEQUENCES OF THE RAT TROPONIN-I SLOW GENE CONFER TISSUE-SPECIFIC AND DEVELOPMENT-SPECIFIC TRANSCRIPTION IN CULTURED MUSCLE-CELLS AS WELL AS FIBER-TYPE SPECIFICITY IN TRANSGENIC MICE [J].
BANERJEEBASU, S ;
BUONANNO, A .
MOLECULAR AND CELLULAR BIOLOGY, 1993, 13 (11) :7019-7028
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]  
BRUNETTI A, 1990, J BIOL CHEM, V265, P5960
[5]   Global and gene-specific analyses show distinct roles for Myod and Myog at a common set of promoters [J].
Cao, Y ;
Kumar, RM ;
Penn, BH ;
Berkes, CA ;
Kooperberg, C ;
Boyer, LA ;
Young, RA ;
Tapscott, SJ .
EMBO JOURNAL, 2006, 25 (03) :502-511
[6]  
EDMONDSON DG, 1991, J BIOL CHEM, V266, P21343
[7]   Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells [J].
Elkon, R ;
Linhart, C ;
Sharan, R ;
Shamir, R ;
Shiloh, Y .
GENOME RESEARCH, 2003, 13 (05) :773-780
[8]   Evolutionary strategies for the elucidation of cis and trans factors that regulate the developmental switching programs of the beta-like globin genes [J].
Gumucio, DL ;
Shelton, DA ;
Zhu, W ;
Millinoff, D ;
Gray, T ;
Bock, JH ;
Slightom, JL ;
Goodman, M .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 1996, 5 (01) :18-32
[9]   Long human-mouse sequence alignments reveal novel regulatory elements: A reason to sequence the mouse genome [J].
Hardison, RC ;
Oeltjen, J ;
Miller, W .
GENOME RESEARCH, 1997, 7 (10) :959-966
[10]   ConTra: a promoter alignment analysis tool for identification of transcription factor binding sites across species [J].
Hooghe, Bart ;
Hulpiau, Paco ;
van Roy, Frans ;
De Bleser, Pieter .
NUCLEIC ACIDS RESEARCH, 2008, 36 :W128-W132