Impact of artifact removal on ChIP quality metrics in ChIP-seq and ChIP-exo data

被引:160
作者
Carroll, Thomas S. [1 ]
Liang, Ziwei [2 ]
Salama, Rafik [1 ]
Stark, Rory [1 ]
de Santiago, Ines [1 ]
机构
[1] Univ Cambridge, Cambridge Inst CRUK, Cambridge CB2 0RE, England
[2] Univ London Imperial Coll Sci Technol & Med, MRC, Ctr Clin Sci, London, England
关键词
BINDING; DESIGN; ENCODE;
D O I
10.3389/fgene.2014.00075
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the advent of ChIP-seq multiplexing technologies and the subsequent increase in ChIP-seq throughput, the development of working standards for the quality assessment of ChIP-seq studies has received significant attention. The ENCODE consortium's large scale analysis of transcription factor binding and epigenetic marks as well as concordant work on ChIP-seq by other laboratories has established a new generation of ChIP-seq quality control measures. The use of these metrics alongside common processing steps has however not been evaluated. In this study, we investigate the effects of blacklisting and removal of duplicated reads on established metrics of ChIP-seq quality and show that the interpretation of these metrics is highly dependent on the ChIP-seq preprocessing steps applied. Further to this we perform the first investigation of the use of these metrics for ChIP-exo data and make recommendations for the adaptation of the NSC statistic to allow for the assessment of ChIP-exo efficiency.
引用
收藏
页数:11
相关论文
共 26 条
[1]   Whole exome capture in solution with 3 Gbp of data [J].
Bainbridge, Matthew N. ;
Wang, Min ;
Burgess, Daniel L. ;
Kovar, Christie ;
Rodesch, Matthew J. ;
D'Ascenzo, Mark ;
Kitzman, Jacob ;
Wu, Yuan-Qing ;
Newsham, Irene ;
Richmond, Todd A. ;
Jeddeloh, Jeffrey A. ;
Muzny, Donna ;
Albert, Thomas J. ;
Gibbs, Richard A. .
GENOME BIOLOGY, 2010, 11 (06)
[2]   High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[3]  
Chen YW, 2012, NAT METHODS, V9, P609, DOI [10.1038/NMETH.1985, 10.1038/nmeth.1985]
[4]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[5]   The UCSC Genome Browser database: update 2011 [J].
Fujita, Pauline A. ;
Rhead, Brooke ;
Zweig, Ann S. ;
Hinrichs, Angie S. ;
Karolchik, Donna ;
Cline, Melissa S. ;
Goldman, Mary ;
Barber, Galt P. ;
Clawson, Hiram ;
Coelho, Antonio ;
Diekhans, Mark ;
Dreszer, Timothy R. ;
Giardine, Belinda M. ;
Harte, Rachel A. ;
Hillman-Jackson, Jennifer ;
Hsu, Fan ;
Kirkup, Vanessa ;
Kuhn, Robert M. ;
Learned, Katrina ;
Li, Chin H. ;
Meyer, Laurence R. ;
Pohl, Andy ;
Raney, Brian J. ;
Rosenbloom, Kate R. ;
Smith, Kayla E. ;
Haussler, David ;
Kent, W. James .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D876-D882
[6]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[7]   ChIP-chip versus ChIP-seq: Lessons for experimental design and data analysis [J].
Ho, Joshua W. K. ;
Bishop, Eric ;
Karchenko, Peter V. ;
Negre, Nicolas ;
White, Kevin P. ;
Park, Peter J. .
BMC GENOMICS, 2011, 12
[8]   Integrative annotation of chromatin elements from ENCODE data [J].
Hoffman, Michael M. ;
Ernst, Jason ;
Wilder, Steven P. ;
Kundaje, Anshul ;
Harris, Robert S. ;
Libbrecht, Max ;
Giardine, Belinda ;
Ellenbogen, Paul M. ;
Bilmes, Jeffrey A. ;
Birney, Ewan ;
Hardison, Ross C. ;
Dunham, Ian ;
Kellis, Manolis ;
Noble, William Stafford .
NUCLEIC ACIDS RESEARCH, 2013, 41 (02) :827-841
[9]   Genome-wide mapping of in vivo protein-DNA interactions [J].
Johnson, David S. ;
Mortazavi, Ali ;
Myers, Richard M. ;
Wold, Barbara .
SCIENCE, 2007, 316 (5830) :1497-1502
[10]   Design and analysis of ChIP-seq experiments for DNA-binding proteins [J].
Kharchenko, Peter V. ;
Tolstorukov, Michael Y. ;
Park, Peter J. .
NATURE BIOTECHNOLOGY, 2008, 26 (12) :1351-1359