Sequence and chromatin determinants of cell-type-specific transcription factor binding

被引:160
作者
Arvey, Aaron [1 ]
Agius, Phaedra [1 ]
Noble, William Stafford [2 ]
Leslie, Christina [1 ]
机构
[1] Mem Sloan Kettering Canc Ctr, Computat Biol Program, New York, NY 10065 USA
[2] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
关键词
DNA-BINDING; REGULATORY ELEMENTS; IN-VIVO; DISCOVERY; SITES; YY1; METHYLATION; PROMOTERS; ENHANCERS; DISTINCT;
D O I
10.1101/gr.127712.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene regulatory programs in distinct cell types are maintained in large part through the cell-type specific binding of transcription factors (TFs). The determinants of IF binding include direct DNA sequence preferences, DNA sequence preferences of cofactors, and the local cell-dependent chromatin context. To explore the contribution of DNA sequence signal, histone modifications, and DNase accessibility to cell-type-specific binding, we analyzed 286 ChIP-seq experiments performed by the ENCODE Consortium. This analysis included experiments for 67 transcriptional regulators, 15 of which were profiled in both the GMI2878 (lymphoblastoid) and K562 (erythroleukemic) human hematopoietic cell lines. To model TF-bound regions, we trained support vector machines (SVMs) that use flexible k-mer patterns to capture DNA sequence signals more accurately than traditional motif approaches. In addition, we trained SVM spatial chromatin signatures to model local histone modifications and DNase accessibility, obtaining significantly more accurate IF occupancy predictions than simpler approaches. Consistent with previous studies, we find that DNase accessibility can explain cell-line specific binding for many factors. However, we also find that of the 10 factors with prominent cell-type specific binding patterns, four display distinct cell-type-specific DNA sequence preferences according to our models. Moreover, for two factors we identify cell-specific binding sites that are accessible in both cell types but bound only in one. For these sites, cell-type specific sequence models, rather than DNase accessibility, are better able to explain differential binding. Our results suggest that using a single motif for each IF and filtering for chromatin accessible loci is not always sufficient to accurately account for cell-type specific binding profiles.
引用
收藏
页码:1723 / 1734
页数:12
相关论文
共 48 条
[1]   High Resolution Models of Transcription Factor-DNA Affinities Improve In Vitro and In Vivo Binding Predictions [J].
Agius, Phaedra ;
Arvey, Aaron ;
Chang, William ;
Noble, William Stafford ;
Leslie, Christina .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (09)
[2]  
[Anonymous], NATURE IN PRESS
[3]  
Bailey T L, 1994, Proc Int Conf Intell Syst Mol Biol, V2, P28
[4]   Searching for statistically significant regulatory modules [J].
Bailey, Timothy L. ;
Noble, William Stafford .
BIOINFORMATICS, 2003, 19 :II16-II25
[5]   Genome-wide mapping and analysis of active promoters in mouse embryonic stem cells and adult organs [J].
Barrera, Leah O. ;
Li, Zirong ;
Smith, Andrew D. ;
Arden, Karen C. ;
Cavenee, Webster K. ;
Zhang, Michael Q. ;
Green, Roland D. ;
Ren, Bing .
GENOME RESEARCH, 2008, 18 (01) :46-59
[6]   High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells [J].
Boyle, Alan P. ;
Song, Lingyun ;
Lee, Bum-Kyu ;
London, Darin ;
Keefe, Damian ;
Birney, Ewan ;
Iyer, Vishwanath R. ;
Crawford, Gregory E. ;
Furey, Terrence S. .
GENOME RESEARCH, 2011, 21 (03) :456-464
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   Pax5: the guardian of B cell identity and function [J].
Cobaleda, Cesar ;
Schebesta, Alexandra ;
Delogu, Alessio ;
Busslinger, Meinrad .
NATURE IMMUNOLOGY, 2007, 8 (05) :463-470
[9]   Cell type-specific DNA methylation at intragenic CpG islands in the immune system [J].
Deaton, Aimee M. ;
Webb, Shaun ;
Kerr, Alastair R. W. ;
Illingworth, Robert S. ;
Guy, Jacky ;
Andrews, Robert ;
Bird, Adrian .
GENOME RESEARCH, 2011, 21 (07) :1074-1086
[10]   YY1's longer DNA-binding motifs [J].
Do Kim, Jeong ;
Kim, Joomyeong .
GENOMICS, 2009, 93 (02) :152-158