oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes

被引:301
作者
Sui, SJH
Mortimer, JR
Arenillas, DJ
Brumm, J
Walsh, CJ
Kennedy, BP
Wasserman, WW [1 ]
机构
[1] Univ British Columbia, Ctr Mol Med & Therapeut, Vancouver, BC V5Z 1M9, Canada
[2] Univ British Columbia, Genet Grad Program, Vancouver, BC V5Z 1M9, Canada
[3] Merck Frosst Ctr Therapeut Res, Kirkland, PQ, Canada
[4] Univ British Columbia, Dept Med Genet, Vancouver, BC V5Z 1M9, Canada
[5] Univ British Columbia, Dept Stat, Vancouver, BC V5Z 1M9, Canada
基金
加拿大健康研究院;
关键词
D O I
10.1093/nar/gki624
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes.
引用
收藏
页码:3154 / 3164
页数:11
相关论文
共 43 条
[21]   A predictive model for regulatory sequences directing liver-specific transcription [J].
Krivan, W ;
Wasserman, WW .
GENOME RESEARCH, 2001, 11 (09) :1559-1566
[22]   TFBS: Computational framework for transcription factor binding site analysis [J].
Lenhard, B ;
Wasserman, WW .
BIOINFORMATICS, 2002, 18 (08) :1135-1136
[23]  
Lenhard Boris, 2003, J Biol, V2, P13, DOI 10.1186/1475-4924-2-13
[24]   NF-κB regulation in the immune system [J].
Li, QT ;
Verma, IM .
NATURE REVIEWS IMMUNOLOGY, 2002, 2 (10) :725-734
[25]   rVista for comparative sequence-based discovery of functional transcription factor binding sites [J].
Loots, GG ;
Ovcharenko, I ;
Pachter, L ;
Dubchak, I ;
Rubin, EM .
GENOME RESEARCH, 2002, 12 (05) :832-839
[26]   PU.1 and multiple IFN regulatory factor proteins synergize to mediate transcriptional activation of the human IL-1β gene [J].
Marecki, S ;
Riendeau, CJ ;
Liang, MD ;
Fenton, MJ .
JOURNAL OF IMMUNOLOGY, 2001, 166 (11) :6829-6838
[27]   Characterization of the c-MYC-regulated transcriptome by SAGE: Identification and analysis of c-MYC target genes [J].
Menssen, A ;
Hermeking, H .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (09) :6274-6279
[28]   IFN-stimulated gene 15 is synergistically activated through interactions between the myelocyte/lymphocyte-specific transcription factors, PU.1, IFN regulatory factor-8/IFN consensus sequence binding protein, and IFN regulatory factor-4: Characterization of a new subtype of IFN-stimulated response element [J].
Meraro, D ;
Gleit-Kielmanowicz, M ;
Hauser, H ;
Levi, BZ .
JOURNAL OF IMMUNOLOGY, 2002, 168 (12) :6224-6231
[29]   A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].
NEEDLEMAN, SB ;
WUNSCH, CD .
JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+
[30]   Transcription repression in oncogenic transformation: common targets of epigenetic repression in cells transformed by Fos, Ras or Dnmt1 [J].
Ordway, JM ;
Williams, K ;
Curran, T .
ONCOGENE, 2004, 23 (21) :3737-3748