oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes

被引:301
作者
Sui, SJH
Mortimer, JR
Arenillas, DJ
Brumm, J
Walsh, CJ
Kennedy, BP
Wasserman, WW [1 ]
机构
[1] Univ British Columbia, Ctr Mol Med & Therapeut, Vancouver, BC V5Z 1M9, Canada
[2] Univ British Columbia, Genet Grad Program, Vancouver, BC V5Z 1M9, Canada
[3] Merck Frosst Ctr Therapeut Res, Kirkland, PQ, Canada
[4] Univ British Columbia, Dept Med Genet, Vancouver, BC V5Z 1M9, Canada
[5] Univ British Columbia, Dept Stat, Vancouver, BC V5Z 1M9, Canada
基金
加拿大健康研究院;
关键词
D O I
10.1093/nar/gki624
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes.
引用
收藏
页码:3154 / 3164
页数:11
相关论文
共 43 条
[41]   Human-mouse genome comparisons to locate regulatory sites [J].
Wasserman, WW ;
Palumbo, M ;
Thompson, W ;
Fickett, JW ;
Lawrence, CE .
NATURE GENETICS, 2000, 26 (02) :225-228
[42]   TRANSFAC: A database on transcription factors and their DNA binding sites [J].
Wingender, E ;
Dietze, P ;
Karas, H ;
Knuppel, R .
NUCLEIC ACIDS RESEARCH, 1996, 24 (01) :238-241
[43]   An approach to identify over-represented cis-elements in related sequences [J].
Zheng, JS ;
Wu, JJ ;
Sun, ZR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (07) :1995-2005