Direct ChIP-Seq significance analysis improves target prediction

被引:10
作者
Bansal, Mukesh [1 ,2 ]
Mendiratta, Geetu [3 ]
Anand, Santosh [4 ,5 ]
Kushwaha, Ritu [6 ]
Kim, Ryan Hyunjae [7 ]
Kustagi, Manju [1 ,2 ]
Iyer, Archana [1 ,2 ]
Chaganti, Raju S. K. [7 ]
Califano, Andrea [1 ,2 ,8 ]
Sumazin, Pavel [9 ]
机构
[1] Columbia Univ, Dept Syst Biol, New York, NY 10027 USA
[2] Columbia Univ, Ctr Computat Biol & Bioinformat, New York, NY USA
[3] Canc Genet Inc, Rutherford, NJ USA
[4] Univ Sannio, Dept Sci & Technol, Milan, Italy
[5] CNR, Inst Biomed Technol, I-20133 Milan, Italy
[6] Mem Sloan Kettering Canc Ctr, New York, NY 10021 USA
[7] Rockefeller Univ, Lab Immune Cell Epigenet & Signaling, New York, NY 10065 USA
[8] Columbia Univ, Dept Biochem & Mol Biophys, Columbia Genome Ctr, Inst Canc Genet,High Throughput Screening Facil,D, New York, NY USA
[9] Baylor Coll Med, Texas Childrens Canc Ctr, Houston, TX 77030 USA
关键词
TRANSCRIPTION FACTOR; BINDING-SITES; GENOME; PROFILES;
D O I
10.1186/1471-2164-16-S5-S4
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 [微生物学]; 090105 [作物生产系统与生态工程];
摘要
Background: Chromatin immunoprecipitation followed by sequencing of protein-bound DNA fragments (ChIP-Seq) is an effective high-throughput methodology for the identification of context specific DNA fragments that are bound by specific proteins in vivo. Despite significant progress in the bioinformatics analysis of this genome-scale data, a number of challenges remain as technology-dependent biases, including variable target accessibility and mappability, sequence-dependent variability, and non-specific binding affinity must be accounted for. Results and discussion: We introduce a nonparametric method for scoring consensus regions of aligned immunoprecipitated DNA fragments when appropriate control experiments are available. Our method uses local models for null binding; these are necessary because binding prediction scores based on global models alone fail to properly account for specialized features of genomic regions and chance pull downs of specific DNA fragments, thus disproportionally rewarding some genomic regions and decreasing prediction accuracy. We make no assumptions about the structure or amplitude of bound peaks, yet we show that our method outperforms leading methods developed using either global or local null hypothesis models for random binding. We test prediction performance by comparing analyses of ChIP-seq, ChIP-chip, motif-based binding-site prediction, and shRNA assays, showing high reproducibility, binding-site enrichment in predicted target regions, and functional regulation of predicted targets. Conclusions: Given appropriate controls, a direct nonparametric method for identifying transcription-factor targets from ChIP-Seq assays may lead to both higher sensitivity and higher specificity, and should be preferred or used in conjunction with methods that use parametric models for null binding.
引用
收藏
页数:10
相关论文
共 27 条
[1]
High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[2]
Bartlett John M S, 2003, Methods Mol Biol, V226, P3
[3]
Core transcriptional regulatory circuitry in human embryonic stem cells [J].
Boyer, LA ;
Lee, TI ;
Cole, MF ;
Johnstone, SE ;
Levine, SS ;
Zucker, JR ;
Guenther, MG ;
Kumar, RM ;
Murray, HL ;
Jenner, RG ;
Gifford, DK ;
Melton, DA ;
Jaenisch, R ;
Young, RA .
CELL, 2005, 122 (06) :947-956
[4]
INVIVO INTERACTIONS OF RNA POLYMERASE-II WITH GENES OF DROSOPHILA-MELANOGASTER [J].
GILMOUR, DS ;
LIS, JT .
MOLECULAR AND CELLULAR BIOLOGY, 1985, 5 (08) :2009-2018
[5]
Herranz D, 2014, NATURE MED
[6]
DNA Specificity Determinants Associate with Distinct Transcription Factor Functions [J].
Hollenhorst, Peter C. ;
Chandler, Katherine J. ;
Poulsen, Rachel L. ;
Johnson, W. Evan ;
Speck, Nancy A. ;
Graves, Barbara J. .
PLOS GENETICS, 2009, 5 (12)
[7]
Jagadish N, 2014, STEM CELLS
[8]
An integrated software system for analyzing ChIP-chip and ChIP-seq data [J].
Ji, Hongkai ;
Jiang, Hui ;
Ma, Wenxiu ;
Johnson, David S. ;
Myers, Richard M. ;
Wong, Wing H. .
NATURE BIOTECHNOLOGY, 2008, 26 (11) :1293-1300
[9]
Genome-wide mapping of in vivo protein-DNA interactions [J].
Johnson, David S. ;
Mortazavi, Ali ;
Myers, Richard M. ;
Wold, Barbara .
SCIENCE, 2007, 316 (5830) :1497-1502
[10]
Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data [J].
Jothi, Raja ;
Cuddapah, Suresh ;
Barski, Artem ;
Cui, Kairong ;
Zhao, Keji .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16) :5221-5231