Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data

被引:382
作者
Pique-Regi, Roger [1 ]
Degner, Jacob F. [1 ,2 ]
Pai, Athma A. [1 ]
Gaffney, Daniel J. [1 ,3 ]
Gilad, Yoav [1 ]
Pritchard, Jonathan K. [1 ,3 ]
机构
[1] Univ Chicago, Dept Human Genet, Chicago, IL 60637 USA
[2] Univ Chicago, Comm Genet Genom & Syst Biol, Chicago, IL 60637 USA
[3] Univ Chicago, Howard Hughes Med Inst, Chicago, IL 60637 USA
基金
美国国家卫生研究院;
关键词
HUMAN GENOME; IN-VIVO; DISCOVERY; SITES; EXPRESSION; ENHANCERS; ELEMENTS; IDENTIFICATION; ASSOCIATION; SIGNATURES;
D O I
10.1101/gr.112623.110
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accurate functional annotation of regulatory elements is essential for understanding global gene regulation. Here, we report a genome-wide map of 827,000 transcription factor binding sites in human lymphoblastoid cell lines, which is comprised of sites corresponding to 239 position weight matrices of known transcription factor binding motifs, and 49 novel sequence motifs. To generate this map, we developed a probabilistic framework that integrates cell-or tissue-specific experimental data such as histone modifications and DNase I cleavage patterns with genomic information such as gene annotation and evolutionary conservation. Comparison to empirical ChIP-seq data suggests that our method is highly accurate yet has the advantage of targeting many factors in a single assay. We anticipate that this approach will be a valuable tool for genome-wide studies of gene regulation in a wide variety of cell types or tissues under diverse conditions.
引用
收藏
页码:447 / 455
页数:9
相关论文
共 43 条
[1]   Unbiased Reconstruction of a Mammalian Transcriptional Network Mediating Pathogen Responses [J].
Amit, Ido ;
Garber, Manuel ;
Chevrier, Nicolas ;
Leite, Ana Paula ;
Donner, Yoni ;
Eisenhaure, Thomas ;
Guttman, Mitchell ;
Grenier, Jennifer K. ;
Li, Weibo ;
Zuk, Or ;
Schubert, Lisa A. ;
Birditt, Brian ;
Shay, Tal ;
Goren, Alon ;
Zhang, Xiaolan ;
Smith, Zachary ;
Deering, Raquel ;
McDonald, Rebecca C. ;
Cabili, Moran ;
Bernstein, Bradley E. ;
Rinn, John L. ;
Meissner, Alex ;
Root, David E. ;
Hacohen, Nir ;
Regev, Aviv .
SCIENCE, 2009, 326 (5950) :257-263
[2]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[3]   High-resolution mapping and characterization of open chromatin across the genome [J].
Boyle, Alan P. ;
Davis, Sean ;
Shulha, Hennady P. ;
Meltzer, Paul ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Furey, Terrence S. ;
Crawford, Gregory E. .
CELL, 2008, 132 (02) :311-322
[4]   Binding Site Turnover Produces Pervasive Quantitative Changes in Transcription Factor Binding between Closely Related Drosophila Species [J].
Bradley, Robert K. ;
Li, Xiao-Yong ;
Trapnell, Cole ;
Davidson, Stuart ;
Pachter, Lior ;
Chu, Hou Cheng ;
Tonkin, Leath A. ;
Biggin, Mark D. ;
Eisen, Michael B. .
PLOS BIOLOGY, 2010, 8 (03)
[5]   Exploring the DNA-binding specificities of zinc fingers with DNA microarrays [J].
Bulyk, ML ;
Huang, XH ;
Choo, Y ;
Church, GM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (13) :7158-7163
[6]   A dynamic Bayesian network for identifying protein-binding footprints from single molecule-based sequencing data [J].
Chen, Xiaoyu ;
Hoffman, Michael M. ;
Bilmes, Jeff A. ;
Hesselberth, Jay R. ;
Noble, William S. .
BIOINFORMATICS, 2010, 26 (12) :i334-i342
[7]   Regulation of B lymphocyte and macrophage development by graded expression of PU.1 [J].
DeKoter, RP ;
Singh, H .
SCIENCE, 2000, 288 (5470) :1439-1441
[8]   Fast and systematic genome-wide discovery of conserved regulatory elements using a non-alignment based approach [J].
Elemento, O ;
Tavazoie, S .
GENOME BIOLOGY, 2005, 6 (02)
[9]   Integrating multiple evidence sources to predict transcription factor binding in the human genome [J].
Ernst, Jason ;
Plasterer, Heather L. ;
Simon, Itamar ;
Bar-Joseph, Ziv .
GENOME RESEARCH, 2010, 20 (04) :526-536
[10]   Using GOstats to test gene lists for GO term association [J].
Falcon, S. ;
Gentleman, R. .
BIOINFORMATICS, 2007, 23 (02) :257-258