A clustering approach for identification of enriched domains from histone modification ChIP-Seq data

被引:773
作者
Zang, Chongzhi [1 ]
Schones, Dustin E. [2 ]
Zeng, Chen [1 ]
Cui, Kairong [2 ]
Zhao, Keji [2 ]
Peng, Weiqun [1 ]
机构
[1] George Washington Univ, Dept Phys, Washington, DC 20052 USA
[2] NHLBI, Lab Mol Immunol, NIH, Bethesda, MD 20892 USA
基金
美国国家科学基金会;
关键词
TRANSCRIPTION FACTOR-BINDING; GENOME-WIDE IDENTIFICATION; EMBRYONIC STEM-CELLS; FALSE DISCOVERY RATE; LYSINE; 9; H3; PROTEINS; SITES; METHYLATIONS; RECOGNITION;
D O I
10.1093/bioinformatics/btp340
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Chromatin states are the key to gene regulation and cell identity. Chromatin immunoprecipitation (ChIP) coupled with high-throughput sequencing (ChIP-Seq) is increasingly being used to map epigenetic states across genomes of diverse species. Chromatin modi. cation profiles are frequently noisy and diffuse, spanning regions ranging from several nucleosomes to large domains of multiple genes. Much of the early work on the identification of ChIP-enriched regions for ChIP-Seq data has focused on identifying localized regions, such as transcription factor binding sites. Bioinformatic tools to identify diffuse domains of ChIP-enriched regions have been lacking. Results: Based on the biological observation that histone modi. cations tend to cluster to form domains, we present a method that identifies spatial clusters of signals unlikely to appear by chance. This method pools together enrichment information from neighboring nucleosomes to increase sensitivity and specificity. By using genomic-scale analysis, as well as the examination of loci with validated epigenetic states, we demonstrate that this method outperforms existing methods in the identification of ChIP-enriched signals for histone modi. cation profiles. We demonstrate the application of this unbiased method in important issues in ChIP-Seq data analysis, such as data normalization for quantitative comparison of levels of epigenetic modi. cations across cell types and growth conditions.
引用
收藏
页码:1952 / 1958
页数:7
相关论文
共 38 条
[1]   Functional mammalian homologues of the Drosophila PEV-modifier Su(var)3-9 encode centromere-associated proteins which complex with the heterochromatin component M31 [J].
Aagaard, L ;
Laible, G ;
Selenko, P ;
Schmid, M ;
Dorn, R ;
Schotta, G ;
Kuhfittig, S ;
Wolf, A ;
Lebersorger, A ;
Singh, PB ;
Reuter, G ;
Jenuwein, T .
EMBO JOURNAL, 1999, 18 (07) :1923-1938
[2]   GeneTrack - a genomic data processing and visualization framework [J].
Albert, Istvan ;
Wachi, Shinichiro ;
Jiang, Cizhong ;
Pugh, Franklin .
BIOINFORMATICS, 2008, 24 (10) :1305-1306
[3]   Selective recognition of methylated lysine 9 on histone H3 by the HP1 chromo domain [J].
Bannister, AJ ;
Zegerman, P ;
Partridge, JF ;
Miska, EA ;
Thomas, JO ;
Allshire, RC ;
Kouzarides, T .
NATURE, 2001, 410 (6824) :120-124
[4]   High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[5]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[6]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]   The mammalian epigenome [J].
Bernstein, Bradley E. ;
Meissner, Alexander ;
Lander, Eric S. .
CELL, 2007, 128 (04) :669-681
[8]   F-Seq: a feature density estimator for high-throughput sequence tags [J].
Boyle, Alan P. ;
Guinney, Justin ;
Crawford, Gregory E. ;
Furey, Terrence S. .
BIOINFORMATICS, 2008, 24 (21) :2537-2538
[9]   Integration of external signaling pathways with the core transcriptional network in embryonic stem cells [J].
Chen, Xi ;
Xu, Han ;
Yuan, Ping ;
Fang, Fang ;
Huss, Mikael ;
Vega, Vinsensius B. ;
Wong, Eleanor ;
Orlov, Yuriy L. ;
Zhang, Weiwei ;
Jiang, Jianming ;
Loh, Yuin-Han ;
Yeo, Hock Chuan ;
Yeo, Zhen Xuan ;
Narang, Vipin ;
Govindarajan, Kunde Ramamoorthy ;
Leong, Bernard ;
Shahab, Atif ;
Ruan, Yijun ;
Bourque, Guillaume ;
Sung, Wing-Kin ;
Clarke, Neil D. ;
Wei, Chia-Lin ;
Ng, Huck-Hui .
CELL, 2008, 133 (06) :1106-1117
[10]   Theoretical analysis of epigenetic cell memory by nucleosome modification [J].
Dodd, Ian B. ;
Micheelsen, Mille A. ;
Sneppen, Kim ;
Thon, Genevive .
CELL, 2007, 129 (04) :813-822