A dynamic Bayesian network for identifying protein-binding footprints from single molecule-based sequencing data

被引:29
作者
Chen, Xiaoyu [2 ]
Hoffman, Michael M. [1 ]
Bilmes, Jeff A. [3 ]
Hesselberth, Jay R. [1 ]
Noble, William S. [1 ,2 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
[2] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
[3] Univ Washington, Dept Elect Engn, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
IN-VIVO; YEAST; TRANSCRIPTION; CHROMATIN; SITES; MODEL; RAP1;
D O I
10.1093/bioinformatics/btq175
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A global map of transcription factor binding sites (TFBSs) is critical to understanding gene regulation and genome function. DNaseI digestion of chromatin coupled with massively parallel sequencing (digital genomic footprinting) enables the identification of protein-binding footprints with high resolution on a genome-wide scale. However, accurately inferring the locations of these footprints remains a challenging computational problem. Results: We present a dynamic Bayesian network-based approach for the identification and assignment of statistical confidence estimates to protein-binding footprints from digital genomic footprinting data. The method, DBFP, allows footprints to be identified in a probabilistic framework and outperforms our previously described algorithm in terms of precision at a fixed recall. Applied to a digital footprinting data set from Saccharomyces cerevisiae, DBFP identifies 4679 statistically significant footprints within intergenic regions. These footprints are mainly located near transcription start sites and are strongly enriched for known TFBSs. Footprints containing no known motif are preferentially located proximal to other footprints, consistent with cooperative binding of these footprints. DBFP also identifies a set of statistically significant footprints in the yeast coding regions. Many of these footprints coincide with the boundaries of antisense transcripts, and the most significant footprints are enriched for binding sites of the chromatin-associated factors Abf1 and Rap1. Contact: jay.hesselberth@ucdenver.edu; william-noble@u.washington.edu Supplementary information: Supplementary material is available at Bioinformatics online.
引用
收藏
页码:i334 / i342
页数:9
相关论文
共 28 条
[1]  
Bilmes J, 2002, INT CONF ACOUST SPEE, P3916
[2]   Graphical model architectures for speech recognition [J].
Bilmes, JA ;
Bartels, C .
IEEE SIGNAL PROCESSING MAGAZINE, 2005, 22 (05) :89-100
[3]   RAP1 IS REQUIRED FOR BAS1/BAS2-DEPENDENT AND GCN4-DEPENDENT TRANSCRIPTION OF THE YEAST HIS4 GENE [J].
DEVLIN, C ;
TICEBALDWIN, K ;
SHORE, D ;
ARNDT, KT .
MOLECULAR AND CELLULAR BIOLOGY, 1991, 11 (07) :3642-3651
[4]   THE YEAST STE12 PROTEIN BINDS TO THE DNA-SEQUENCE MEDIATING PHEROMONE INDUCTION [J].
DOLAN, JW ;
KIRKMAN, C ;
FIELDS, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (15) :5703-5707
[5]   High-throughput localization of functional elements by quantitative chromatin profiting [J].
Dorschner, M ;
Hawrylycz, M ;
Humbert, R ;
Wallace, JC ;
Shafer, A ;
Kawamoto, J ;
Mack, J ;
Hall, R ;
Goldy, J ;
Sabo, PJ ;
Kohli, A ;
Li, QL ;
McArthur, M ;
Stamatoyannopoulos, JA .
NATURE METHODS, 2004, 1 (03) :219-225
[6]  
Hesselberth JR, 2009, NAT METHODS, V6, P283, DOI [10.1038/NMETH.1313, 10.1038/nmeth.1313]
[7]   A ncRNA Modulates Histone Modification and mRNA Induction in the Yeast GAL Gene Cluster [J].
Houseley, Jonathan ;
Rubbi, Liudmilla ;
Grunstein, Michael ;
Tollervey, David ;
Vogelauer, Maria .
MOLECULAR CELL, 2008, 32 (05) :685-695
[8]   THE YEAST TRANSCRIPTION ACTIVATOR PRTF, A HOMOLOG OF THE MAMMALIAN SERUM RESPONSE FACTOR, IS ENCODED BY THE MCM1 GENE [J].
JARVIS, EE ;
CLARK, KL ;
SPRAGUE, GF .
GENES & DEVELOPMENT, 1989, 3 (07) :936-945
[9]   Genome-wide mapping of in vivo protein-DNA interactions [J].
Johnson, David S. ;
Mortazavi, Ali ;
Myers, Richard M. ;
Wold, Barbara .
SCIENCE, 2007, 316 (5830) :1497-1502
[10]   YEAST REPRESSOR ALPHA-2 BINDS TO ITS OPERATOR COOPERATIVELY WITH YEAST PROTEIN MCM1 [J].
KELEHER, CA ;
PASSMORE, S ;
JOHNSON, AD .
MOLECULAR AND CELLULAR BIOLOGY, 1989, 9 (11) :5228-5230