Genome-wide discovery of transcriptional modules from DNA sequence and gene expression

被引:156
作者
Segal, E. [1 ]
Yelensky, R. [1 ]
Koller, D. [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
probabilistic models; gene expression; transcriptional regulation;
D O I
10.1093/bioinformatics/btg1038
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In this paper, we describe an approach for understanding transcriptional regulation from both gene expression and promoter sequence data. We aim to identify transcriptional modules-sets of genes that are co-regulated in a set of experiments, through a common motif profile. Using the EM algorithm, our approach refines both the module assignment and the motif profile so as to best explain the expression data as a function of transcriptional motifs. It also dynamically adds and deletes motifs, as required to provide a genome-wide explanation of the expression data. We evaluate the method on two Saccharomyces cerevisiae gene expression data sets, showing that our approach is better than a standard one at recovering known motifs and at generating biologically coherent modules. We also combine our results with binding localization data to obtain regulatory relationships with known transcription factors, and show that many of the inferred relationships have support in the literature.
引用
收藏
页码:i273 / i282
页数:10
相关论文
共 21 条
  • [11] LEE T, 2002, SCIENCE, V298, P824
  • [12] Liu X, 2001, Pac Symp Biocomput, P127
  • [13] Pearl P, 1988, PROBABILISTIC REASON, DOI DOI 10.1016/C2009-0-27609-4
  • [14] Identifying regulatory networks by combinatorial analysis of promoter elements
    Pilpel, Y
    Sudarsanam, P
    Church, GM
    [J]. NATURE GENETICS, 2001, 29 (02) : 153 - 159
  • [15] Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation
    Roth, FP
    Hughes, JD
    Estep, PW
    Church, GM
    [J]. NATURE BIOTECHNOLOGY, 1998, 16 (10) : 939 - 945
  • [16] Segal E, 2001, Bioinformatics, V17 Suppl 1, pS243
  • [17] SEGAL E, 2002, P 6 INT C RES COMP M, P263
  • [18] Sinha S, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P344
  • [19] Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization
    Spellman, PT
    Sherlock, G
    Zhang, MQ
    Iyer, VR
    Anders, K
    Eisen, MB
    Brown, PO
    Botstein, D
    Futcher, B
    [J]. MOLECULAR BIOLOGY OF THE CELL, 1998, 9 (12) : 3273 - 3297
  • [20] Systematic determination of genetic network architecture
    Tavazoie, S
    Hughes, JD
    Campbell, MJ
    Cho, RJ
    Church, GM
    [J]. NATURE GENETICS, 1999, 22 (03) : 281 - 285