Context-specific Bayesian clustering for gene expression data

被引:45
作者
Barash, Y [1 ]
Friedman, N [1 ]
机构
[1] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
关键词
gene expression; clustering; Bayesian model selection;
D O I
10.1089/10665270252935403
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The recent growth in genomic data and measurements of genome-wide expression patterns allows us to apply computational tools to examine gene regulation by transcription factors. In this work, we present a class of mathematical models that help in understanding the connections between transcription factors and functional classes of genes based on genetic and genomic data. Such a model represents the joint distribution of transcription factor binding sites and of expression levels of a gene in a unified probabilistic model. Learning a combined probability model of binding sites and expression patterns enables us to improve the clustering of the genes based on the discovery of putative binding sites and to detect which binding sites and experiments best characterize a cluster. To learn such models from data, we introduce a new search method that rapidly learns a model according to a Bayesian score. We evaluate our method on synthetic data as well as on real life data and analyze the biological insights it provides, Finally, we demonstrate the applicability of the method to other data analysis problems in gene expression data.
引用
收藏
页码:169 / 191
页数:23
相关论文
共 38 条
  • [1] [Anonymous], 1998, Learning in Graphical Models, chapter A tutorial on learning with Bayesian networks
  • [2] [Anonymous], [No title captured], DOI DOI 10.1016/B978-1-55860-332-5.50055-9
  • [3] Combining evidence using p-values: application to sequence homology searches
    Bailey, TL
    Gribskov, M
    [J]. BIOINFORMATICS, 1998, 14 (01) : 48 - 54
  • [4] BARASH Y, 2001, LNCS, V2149, P278
  • [5] Data analysis and integration: of steps and arrows
    Bittner, M
    Meltzer, P
    Trent, J
    [J]. NATURE GENETICS, 1999, 22 (03) : 213 - 215
  • [6] Boutilier C, 1996, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, P115
  • [7] Gene expression data analysis
    Brazma, A
    Vilo, J
    [J]. FEBS LETTERS, 2000, 480 (01) : 17 - 24
  • [8] CHEESEMAN P, 1995, ADV KNOWLEDGE DISCOV, P153
  • [9] Efficient approximations for the marginal likelihood of Bayesian networks with hidden variables
    Chickering, DM
    Heckerman, D
    [J]. MACHINE LEARNING, 1997, 29 (2-3) : 181 - 212
  • [10] Chickering DM, 1997, P 13 C UNCERTAINTY A, P80