A Bayesian network approach to operon prediction

被引:70
作者
Bockhorst, J
Craven, M
Page, D
Shavlik, J
Glasner, J
机构
[1] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Comp Sci, Madison, WI USA
[3] Univ Wisconsin, Dept Genet, Madison, WI 53706 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btg147
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In order to understand transcription regulation in a given prokaryotic genome, it is critical to identify operons, the fundamental units of transcription, in such species. While there are a growing number of organisms whose sequence and gene coordinates are known, by and large their operons are not known. Results: We present a probabilistic approach to predicting operons using Bayesian networks. Our approach exploits diverse evidence sources such as sequence and expression data. We evaluate our approach on the Escherichia coli K-12 genome where our results indicate we are able to identify over 78% of its operons at a 10% false positive rate. Also, empirical evaluation using a reduced set of data sources suggests that our approach may have significant value for organisms that do not have as rich of evidence sources as E.coli.
引用
收藏
页码:1227 / 1235
页数:9
相关论文
共 21 条
[1]   The complete genome sequence of Escherichia coli K-12 [J].
Blattner, FR ;
Plunkett, G ;
Bloch, CA ;
Perna, NT ;
Burland, V ;
Riley, M ;
ColladoVides, J ;
Glasner, JD ;
Rode, CK ;
Mayhew, GF ;
Gregor, J ;
Davis, NW ;
Kirkpatrick, HA ;
Goeden, MA ;
Rose, DJ ;
Mau, B ;
Shao, Y .
SCIENCE, 1997, 277 (5331) :1453-+
[2]  
BOCKHORST J, 2001, P 17 INT JOINT C ART, P1315
[3]  
Craven M, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P116
[4]  
EAGEN JP, 1975, SIGNAL DETECTION THE
[5]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[6]   Prediction of operons in microbial genomes [J].
Ermolaeva, MD ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 2001, 29 (05) :1216-1221
[7]  
Jelinek F., 1980, Pattern Recognition in Practice. Proceedings of an International Workshop, P381
[8]   Codon usages in different gene classes of the Escherichia coli genome [J].
Karlin, S ;
Mrázek, J ;
Campbell, AM .
MOLECULAR MICROBIOLOGY, 1998, 29 (06) :1341-1355
[9]  
MITCHELL T, 1989, ANNU REV COMPUT SCI, V4, P417
[10]  
Moreno-Hagelsieb Gabriel, 2002, Bioinformatics, V18 Suppl 1, pS329