Connecting quantitative regulatory-network models to the genome

被引:10
作者
Pan, Yue
Durfee, Tim
Bockhorst, Joseph
Craven, Mark
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53706 USA
[3] Univ Wisconsin, Dept Genet, Madison, WI 53706 USA
[4] Univ Wisconsin, Dept Elect Engn & Comp Sci, Milwaukee, WI 53211 USA
关键词
D O I
10.1093/bioinformatics/btm228
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: An important task in computational biology is to infer, using background knowledge and high-throughput data sources, models of cellular processes such as gene regulation. Nachman et al. have developed an approach to inferring gene-regulatory networks that represents quantitative transcription rates, and simultaneously estimates both the kinetic parameters that govern these rates and the activity levels of unobserved regulators that control them. This approach is appealing in that it provides a more detailed and realistic description of how a gene's regulators influence its level of expression than alternative methods. We have developed an extension to this approach that involves representing and learning the key kinetic parameters as functions of features in the genomic sequence. The primary motivation for our approach is that it provides a more mechanistic representation of the regulatory relationships being modeled. Results: We evaluate our approach using two Escherichia coli gene-expression data sets, with a particular focus on modeling the networks that are involved in controlling how E. coli regulates its response to the carbon source(s) available to it. Our results indicate that our sequence-based models provide predictive accuracy that is better than similar models without sequence-based parameters, and substantially better than a simple baseline. Moreover, our approach results in models that offer more explanatory power and biological insight than models without sequence-based parameters.
引用
收藏
页码:I367 / I376
页数:10
相关论文
共 42 条
[1]  
Abadie J, 1969, OPTIMIZATION, P37
[2]  
[Anonymous], 1988, PROBABILISTIC REASON, DOI DOI 10.1016/C2009-0-27609-4
[3]  
[Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
[4]  
[Anonymous], BIOINFORMATICS
[5]   ESSENTIAL STRUCTURE OF ESCHERICHIA-COLI PROMOTER - EFFECT OF SPACER LENGTH BETWEEN THE 2 CONSENSUS SEQUENCES ON PROMOTER FUNCTION [J].
AOYAMA, T ;
TAKANAMI, M ;
OHTSUKA, E ;
TANIYAMA, Y ;
MARUMOTO, R ;
SATO, H ;
IKEHARA, M .
NUCLEIC ACIDS RESEARCH, 1983, 11 (17) :5855-5864
[6]   Additivity in protein-DNA interactions: how good an approximation is it? [J].
Benos, PV ;
Bulyk, ML ;
Stormo, GD .
NUCLEIC ACIDS RESEARCH, 2002, 30 (20) :4442-4451
[7]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[8]   A Bayesian network approach to operon prediction [J].
Bockhorst, J ;
Craven, M ;
Page, D ;
Shavlik, J ;
Glasner, J .
BIOINFORMATICS, 2003, 19 (10) :1227-1235
[9]   Predicting bacterial transcription units using sequence and expression data [J].
Bockhorst, Joseph ;
Qiu, Yu ;
Glasner, Jeremy ;
Liu, Mingzhu ;
Blattner, Frederick ;
Craven, Mark .
BIOINFORMATICS, 2003, 19 :i34-i43
[10]  
Bremer H., 1996, ESCHERICHIA COLI SAL