A comprehensive library of DNA-binding site matrices for 55 proteins applied to the complete Escherichia coli K-12 genome

被引:272
作者
Robison, K
McGuire, AM
Church, GM
机构
[1] Harvard Univ, Sch Med, Dept Genet, Boston, MA 02115 USA
[2] Harvard Univ, Sch Med, Grad Program Biophys, Boston, MA 02115 USA
关键词
bioinformatics; DNA-binding proteins; matrix search; SELEX; footprinting;
D O I
10.1006/jmbi.1998.2160
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A major mode of gene regulation occurs via the binding of specific proteins to specific DNA sequences. The availability of complete bacterial genome sequences offers an unprecedented opportunity to describe networks of such interactions by correlating existing experimental data with computational predictions. Of the 240 candidate Escherichia coli DNA-binding proteins, about 55 have DNA-binding sites identified by DNA footprinting. We used these sites to construct recognition matrices, which we used to search for additional binding sites in the E. coli genomic sequence. Many of these matrices show a strong preference for non-coding DNA. Discrepancies are identified between matrices derived from natural sites and those derived from SELEX (Systematic Evolution of Ligands by Exponential enrichment) experiments. We have constructed a database of these proteins and binding sites, called DPInteract (available at http://arep.med.harvard.edu/dpinteract). (C) 1998 Academic Press.
引用
收藏
页码:241 / 254
页数:14
相关论文
共 72 条