Mocap: large-scale inference of transcription factor binding sites from chromatin accessibility

被引:23
作者
Chen, Xi [1 ]
Yu, Bowen [2 ]
Carriero, Nicholas [3 ]
Silva, Claudio [2 ]
Bonneau, Richard [1 ,2 ,3 ]
机构
[1] NYU, Dept Biol, New York, NY 10003 USA
[2] NYU, Dept Comp Sci, 550 1St Ave, New York, NY 10003 USA
[3] Simons Fdn, Flatiron Fdn, Ctr Computat Biol, New York, NY 10010 USA
关键词
DNA-BINDING; REGULATORY ELEMENTS; MOTIF ENVIRONMENT; SUPER-ENHANCERS; CELL IDENTITY; CPG ISLANDS; SEQUENCE; METHYLATION; LINEAGE; DETERMINANTS;
D O I
10.1093/nar/gkx174
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Differential binding of transcription factors (TFs) at cis-regulatory loci drives the differentiation and function of diverse cellular lineages. Understanding the regulatory interactions that underlie cell fate decisions requires characterizing TF binding sites (TFBS) across multiple cell types and conditions. Techniques, e.g. ChIP-Seq can reveal genome-wide patterns of TF binding, but typically requires laborious and costly experiments for each TF-cell-type (TFCT) condition of interest. Chromosomal accessibility assays can connect accessible chromatin in one cell type to many TFs through sequence motif mapping. Such methods, however, rarely take into account that the genomic context preferred by each factor differs from TF to TF, and from cell type to cell type. To address the differences in TF behaviors, we developed Mocap, a method that integrates chromatin accessibility, motif scores, TF footprints, CpG/GC content, evolutionary conservation and other factors in an ensemble of TFCT-specific classifiers. We show that integration of genomic features, such as CpG islands improves TFBS prediction in some TFCT. Further, we describe a method for mapping new TFCT, for which no ChIP-seq data exists, onto our ensemble of classifiers and show that our cross-sample TFBS prediction method outperforms several previously described methods.
引用
收藏
页码:4315 / 4329
页数:15
相关论文
共 104 条
[1]   Deconvolving the Recognition of DNA Shape from Sequence [J].
Abe, Namiko ;
Dror, Iris ;
Yang, Lin ;
Slattery, Matthew ;
Zhou, Tianyin ;
Bussemaker, Harmen J. ;
Rohs, Remo ;
Mann, Richard S. .
CELL, 2015, 161 (02) :307-318
[2]   An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[3]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[4]  
ANG SL, 1993, DEVELOPMENT, V119, P1301
[5]  
[Anonymous], 2008, MODERN METHODS ROBUS
[6]   An experimentally supported model of the Bacillus subtilis global transcriptional regulatory network [J].
Arrieta-Ortiz, Mario L. ;
Hafemeister, Christoph ;
Bate, Ashley Rose ;
Chu, Timothy ;
Greenfield, Alex ;
Shuster, Bentley ;
Barry, Samantha N. ;
Gallitto, Matthew ;
Liu, Brian ;
Kacmarczyk, Thadeous ;
Santoriello, Francis ;
Chen, Jie ;
Rodrigues, Christopher D. A. ;
Sato, Tsutomu ;
Rudner, David Z. ;
Driks, Adam ;
Bonneau, Richard ;
Eichenberger, Patrick .
MOLECULAR SYSTEMS BIOLOGY, 2015, 11 (11)
[7]   Sequence and chromatin determinants of cell-type-specific transcription factor binding [J].
Arvey, Aaron ;
Agius, Phaedra ;
Noble, William Stafford ;
Leslie, Christina .
GENOME RESEARCH, 2012, 22 (09) :1723-1734
[8]   Diversity and Complexity in DNA Recognition by Transcription Factors [J].
Badis, Gwenael ;
Berger, Michael F. ;
Philippakis, Anthony A. ;
Talukder, Shaheynoor ;
Gehrke, Andrew R. ;
Jaeger, Savina A. ;
Chan, Esther T. ;
Metzler, Genita ;
Vedenko, Anastasia ;
Chen, Xiaoyu ;
Kuznetsov, Hanna ;
Wang, Chi-Fong ;
Coburn, David ;
Newburger, Daniel E. ;
Morris, Quaid ;
Hughes, Timothy R. ;
Bulyk, Martha L. .
SCIENCE, 2009, 324 (5935) :1720-1723
[9]   Coregulation of Transcription Factor Binding and Nucleosome Occupancy through DNA Features of Mammalian Enhancers [J].
Barozzi, Iros ;
Simonatto, Marta ;
Bonifacio, Silvia ;
Yang, Lin ;
Rohs, Remo ;
Ghisletti, Serena ;
Natoli, Gioacchino .
MOLECULAR CELL, 2014, 54 (05) :844-857
[10]   High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837