oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes

被引:301
作者
Sui, SJH
Mortimer, JR
Arenillas, DJ
Brumm, J
Walsh, CJ
Kennedy, BP
Wasserman, WW [1 ]
机构
[1] Univ British Columbia, Ctr Mol Med & Therapeut, Vancouver, BC V5Z 1M9, Canada
[2] Univ British Columbia, Genet Grad Program, Vancouver, BC V5Z 1M9, Canada
[3] Merck Frosst Ctr Therapeut Res, Kirkland, PQ, Canada
[4] Univ British Columbia, Dept Med Genet, Vancouver, BC V5Z 1M9, Canada
[5] Univ British Columbia, Dept Stat, Vancouver, BC V5Z 1M9, Canada
基金
加拿大健康研究院;
关键词
D O I
10.1093/nar/gki624
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Targeted transcript profiling studies can identify sets of co-expressed genes; however, identification of the underlying functional mechanism(s) is a significant challenge. Established methods for the analysis of gene annotations, particularly those based on the Gene Ontology, can identify functional linkages between genes. Similar methods for the identification of over-represented transcription factor binding sites (TFBSs) have been successful in yeast, but extension to human genomics has largely proved ineffective. Creation of a system for the efficient identification of common regulatory mechanisms in a subset of co-expressed human genes promises to break a roadblock in functional genomics research. We have developed an integrated system that searches for evidence of co-regulation by one or more transcription factors (TFs). oPOSSUM combines a pre-computed database of conserved TFBSs in human and mouse promoters with statistical methods for identification of sites over-represented in a set of co-expressed genes. The algorithm successfully identified mediating TFs in control sets of tissue-specific genes and in sets of co-expressed genes from three transcript profiling studies. Simulation studies indicate that oPOSSUM produces few false positives using empirically defined thresholds and can tolerate up to 50% noise in a set of co-expressed genes.
引用
收藏
页码:3154 / 3164
页数:11
相关论文
共 43 条
[1]   Toucan:: deciphering the cis-regulatory logic of coregulated genes [J].
Aerts, S ;
Thijs, G ;
Coessens, B ;
Staes, M ;
Moreau, Y ;
Moor, BD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (06) :1753-1764
[2]   TRANSCRIPTIONAL ACTIVATION BY THE HUMAN C-MYC ONCOPROTEIN IN YEAST REQUIRES INTERACTION WITH MAX [J].
AMATI, B ;
DALTON, S ;
BROOKS, MW ;
LITTLEWOOD, TD ;
EVAN, GI ;
LAND, H .
NATURE, 1992, 359 (6394) :423-426
[3]   NF-κB as a frequent target for immunosuppressive and anti-inflammatory molecules [J].
Baeuerle, PA ;
Baichwal, VR .
ADVANCES IN IMMUNOLOGY, VOL 65, 1997, 65 :111-137
[4]  
Berezikov E, 2004, GENOME RES, V14, P170
[5]   LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA [J].
Brudno, M ;
Do, CB ;
Cooper, GM ;
Kim, MF ;
Davydov, E ;
Green, ED ;
Sidow, A ;
Batzoglou, S .
GENOME RESEARCH, 2003, 13 (04) :721-731
[6]   Quantifying DNA-protein interactions by double-stranded DNA arrays [J].
Bulyk, ML ;
Gentalen, E ;
Lockhart, DJ ;
Church, GM .
NATURE BIOTECHNOLOGY, 1999, 17 (06) :573-577
[7]   Ensembl 2002: accommodating comparative genomics [J].
Clamp, M ;
Andrews, D ;
Barker, D ;
Bevan, P ;
Cameron, G ;
Chen, Y ;
Clark, L ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Down, T ;
Durbin, R ;
Eyras, E ;
Gilbert, J ;
Hammond, M ;
Hubbard, T ;
Kasprzyk, A ;
Keefe, D ;
Lehvaslaiho, H ;
Iyer, V ;
Melsopp, C ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Rust, A ;
Schmidt, E ;
Searle, S ;
Slater, G ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Stupka, E ;
Ureta-Vidal, A ;
Vastrik, I ;
Birney, E .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :38-42
[8]   Evolution of transcription factor binding sites in mammalian gene regulatory regions: Conservation and turnover [J].
Dermitzakis, ET ;
Clark, AG .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (07) :1114-1121
[9]   CORG: a database for COmparative Regulatory Genomics [J].
Dieterich, C ;
Wang, H ;
Rateitschak, K ;
Luz, H ;
Vingron, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :55-57
[10]   Searching for regulatory elements in human noncoding sequences [J].
Duret, L ;
Bucher, P .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (03) :399-406