Probabilistic discovery of overlapping cellular processes and their regulation

被引:21
作者
Battle, A
Segal, E
Koller, D
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Rockefeller Univ, Ctr Studies Phys Biol, New York, NY 10021 USA
关键词
cellular processes; gene regulation; probabilistic relational models;
D O I
10.1089/cmb.2005.12.909
中图分类号
Q5 [生物化学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
In this paper, we explore modeling overlapping biological processes. We discuss a probabilistic model of overlapping biological processes, gene membership in those processes, and an addition to that model that identifies regulatory mechanisms controlling process activation. A key feature of our approach is that we allow genes to participate in multiple processes, thus providing a more biologically plausible model for the process of gene regulation. We present algorithms to learn each model automatically from data, using only genomewide measurements of gene expression as input. We compare our results to those obtained by other approaches and show that significant benefits can be gained by modeling both the organization of genes into overlapping cellular processes and the regulatory programs of these processes. Moreover, our method successfully grouped genes known to function together, recovered many regulatory relationships that are known in the literature, and suggested novel hypotheses regarding the regulatory role of previously uncharacterized proteins.
引用
收藏
页码:909 / 927
页数:19
相关论文
共 32 条
[1]
Singular value decomposition for genome-wide expression data processing and modeling [J].
Alter, O ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (18) :10101-10106
[2]
Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]
Bjorck A., 1996, NUMERICAL METHODS LE, DOI DOI 10.1137/1.9781611971484
[4]
Breiman L., 1998, CLASSIFICATION REGRE
[5]
CHENG Y, 2000, ISMB 00
[6]
MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]
Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[8]
Rme1, which controls CLN2 expression in Saccharomyces cerevisiae, is a nuclear protein that is cell cycle regulated [J].
Frenz, LM ;
Johnson, AL ;
Johnston, LH .
MOLECULAR GENETICS AND GENOMICS, 2001, 266 (03) :374-384
[9]
Friedman N., 1998, P UAI
[10]
FRIEDMAN N, 1999, UNPUB LEARNING BAYES