Predicting bacterial transcription units using sequence and expression data

被引:43
作者
Bockhorst, Joseph [1 ,2 ]
Qiu, Yu [3 ]
Glasner, Jeremy [3 ]
Liu, Mingzhu [3 ]
Blattner, Frederick [3 ]
Craven, Mark [1 ,2 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI 53706 USA
[3] Univ Wisconsin, Genet Lab, Madison, WI 53706 USA
关键词
D O I
10.1093/bioinformatics/btg1003
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A key aspect of elucidating gene regulation in bacterial genomes is identifying the basic units of transcription. We present a method, based on probabilistic language models, that we apply to predict operons, promoters and terminators in the genome of Escherichia coli K-12. Our approach has two key properties: (i) it provides a coherent set of predictions for related regulatory elements of various types and (ii) it takes advantage of both DNA sequence and gene expression data, including expression measurements from inter-genic probes. Results: Our experimental results show that we are able to predict operons and localize promoters and terminators with high accuracy. Moreover, our models that use both sequence and expression data are more accurate than those that use only one of these two data sources.
引用
收藏
页码:i34 / i43
页数:10
相关论文
共 23 条
[21]  
Tjaden Brian, 2002, Bioinformatics, V18 Suppl 1, pS337
[22]   Modeling and predicting transcriptional units of Escherichia coli genes using hidden Markov models [J].
Yada, T ;
Nakao, M ;
Totoki, Y ;
Nakai, K .
BIOINFORMATICS, 1999, 15 (12) :987-993
[23]   Computational identification of operons in microbial genomes [J].
Zheng, Y ;
Szustakowski, JD ;
Fortnow, L ;
Roberts, RJ ;
Kasif, S .
GENOME RESEARCH, 2002, 12 (08) :1221-1230