Associating transcription factor-binding site motifs with target GO terms and target genes

被引:23
作者
Boden, Mikael [1 ]
Bailey, Timothy L. [1 ]
机构
[1] Univ Queensland, Inst Mol Biosci, Brisbane, QLD 4072, Australia
关键词
D O I
10.1093/nar/gkn374
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
The roles and target genes of many transcription factors (TFs) are still unknown. To predict the roles of TFs, we present a computational method for associating Gene Ontology ( GO) terms with TF-binding motifs. The method works by ranking all genes as potential targets of the TF, and reporting GO terms that are significantly associated with highly ranked genes. We also present an approach, whereby these predicted GO terms can be used to improve predictions of TF target genes. This uses a novel genescoring function that reflects the insight that genes annotated with GO terms predicted to be associated with the TF are more likely to be its targets. We construct validation sets of GO terms highly associated with known targets of various yeast and human TF. On the yeast reference sets, our prediction method identifies at least one correct GO term for 73% of the TF, 49% of the correct GO terms are predicted and almost one-third of the predicted GO terms are correct. Results on human reference sets are similarly encouraging. Validation of our target gene prediction method shows that its accuracy exceeds that of simple motif scanning.
引用
收藏
页码:4108 / 4117
页数:10
相关论文
共 34 条
[1]
Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]
Methods and statistics for combining motif match scores [J].
Bailey, TL ;
Gribskov, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (02) :211-221
[3]
Combining evidence using p-values: application to sequence homology searches [J].
Bailey, TL ;
Gribskov, M .
BIOINFORMATICS, 1998, 14 (01) :48-54
[4]
Transcription factor map alignment of promoter regions [J].
Blanco, Enrique ;
Messeguer, Xavier ;
Smith, Temple F. ;
Guigo, Roderic .
PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (05) :403-416
[5]
Ab initio identification of putative human transcription factor binding sites by comparative genomics - art. no. 110 [J].
Corà, D ;
Herrmann, C ;
Dieterich, C ;
Di Cunto, F ;
Provero, P ;
Caselle, M .
BMC BIOINFORMATICS, 2005, 6 (1)
[6]
Computational identification of transcription factor binding sites by functional analysis of sets of genes sharing overrep-resented upstream motifs -: art. no. 57 [J].
Corà, D ;
Di Cunto, F ;
Provero, P ;
Silengo, L ;
Caselle, M .
BMC BIOINFORMATICS, 2004, 5 (1)
[7]
Software - Predicting transcription factor binding sites using local over-representation and comparative genomics [J].
Defrance, Matthieu ;
Touzet, Helene .
BMC BIOINFORMATICS, 2006, 7 (1)
[8]
Detection of functional DNA motifs via statistical over-representation [J].
Frith, MC ;
Fu, YT ;
Yu, LQ ;
Chen, JF ;
Hansen, U ;
Weng, ZP .
NUCLEIC ACIDS RESEARCH, 2004, 32 (04) :1372-1381
[9]
Gribskov M, 1996, METHOD ENZYMOL, V266, P198
[10]
Assessing semantic similarity measures for the characterization of human regulatory pathways [J].
Guo, X ;
Liu, RX ;
Shriver, CD ;
Hu, H ;
Liebman, MN .
BIOINFORMATICS, 2006, 22 (08) :967-973