Identifying and mitigating bias in next-generation sequencing methods for chromatin biology

被引:239
作者
Meyer, Clifford A. [1 ]
Liu, X. Shirley
机构
[1] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; CHIP-SEQ DATA; ALLELE-SPECIFIC EXPRESSION; HI-C DATA; QUALITY-CONTROL; TRANSCRIPTION FACTORS; GENE-EXPRESSION; COPY NUMBER; DNASE-I; TECHNICAL CONSIDERATIONS;
D O I
10.1038/nrg3788
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Next-generation sequencing (NGS) technologies have been used in diverse ways to investigate various aspects of chromatin biology by identifying genomic loci that are bound by transcription factors, occupied by nucleosomes or accessible to nuclease cleavage, or loci that physically interact with remote genomic loci. However, reaching sound biological conclusions from such NGS enrichment profiles requires many potential biases to be taken into account. In this Review, we discuss common ways in which biases may be introduced into NGS chromatin profiling data, approaches to diagnose these biases and analytical techniques to mitigate their effect.
引用
收藏
页码:709 / 721
页数:13
相关论文
共 117 条
[1]
Personalized copy number and segmental duplication maps using next-generation sequencing [J].
Alkan, Can ;
Kidd, Jeffrey M. ;
Marques-Bonet, Tomas ;
Aksay, Gozde ;
Antonacci, Francesca ;
Hormozdiari, Fereydoun ;
Kitzman, Jacob O. ;
Baker, Carl ;
Malig, Maika ;
Mutlu, Onur ;
Sahinalp, S. Cenk ;
Gibbs, Richard A. ;
Eichler, Evan E. .
NATURE GENETICS, 2009, 41 (10) :1061-U29
[2]
Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[3]
Count-based differential expression analysis of RNA sequencing data using R and Bioconductor [J].
Anders, Simon ;
McCarthy, Davis J. ;
Chen, Yunshun ;
Okoniewski, Michal ;
Smyth, Gordon K. ;
Huber, Wolfgang ;
Robinson, Mark D. .
NATURE PROTOCOLS, 2013, 8 (09) :1765-1786
[4]
High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[5]
Summarizing and correcting the GC content bias in high-throughput sequencing [J].
Benjamini, Yuval ;
Speed, Terence P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (10) :e72
[6]
Quantifying ChIP-seq data: a spiking method providing an internal reference for sample-to-sample normalization [J].
Bonhoure, Nicolas ;
Bounova, Gergana ;
Bernasconi, David ;
Praz, Viviane ;
Lammers, Fabienne ;
Canella, Donatella ;
Willis, Ian M. ;
Herr, Winship ;
Hernandez, Nouria ;
Delorenzi, Mauro .
GENOME RESEARCH, 2014, 24 (07) :1157-1168
[7]
High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells [J].
Boyle, Alan P. ;
Song, Lingyun ;
Lee, Bum-Kyu ;
London, Darin ;
Keefe, Damian ;
Birney, Ewan ;
Iyer, Vishwanath R. ;
Crawford, Gregory E. ;
Furey, Terrence S. .
GENOME RESEARCH, 2011, 21 (03) :456-464
[8]
Model-Based Quality Assessment and Base-Calling for Second-Generation Sequencing Data [J].
Bravo, Hector Corrada ;
Irizarry, Rafael A. .
BIOMETRICS, 2010, 66 (03) :665-674
[9]
Buenrostro JD, 2013, NAT METHODS, V10, P1213, DOI [10.1038/nmeth.2688, 10.1038/NMETH.2688]
[10]
CAMPBELL VW, 1980, J BIOL CHEM, V255, P3726