The ENCODE Blacklist: Identification of Problematic Regions of the Genome

被引:671
作者
Amemiya, Haley M. [1 ,2 ]
Kundaje, Anshul [3 ]
Boyle, Alan P. [1 ,2 ,4 ]
机构
[1] Univ Michigan, Grad Program Cellular & Mol Biol, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[3] Stanford Sch Med, Dept Genet, Stanford, CA 94305 USA
[4] Univ Michigan, Dept Human Genet, Ann Arbor, MI 48109 USA
关键词
CHIP-SEQ;
D O I
10.1038/s41598-019-45839-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Functional genomics assays based on high-throughput sequencing greatly expand our ability to understand the genome. Here, we define the ENCODE blacklist- a comprehensive set of regions in the human, mouse, worm, and fly genomes that have anomalous, unstructured, or high signal in next-generation sequencing experiments independent of cell line or experiment. The removal of the ENCODE blacklist is an essential quality measure when analyzing functional genomics data.
引用
收藏
页数:5
相关论文
共 10 条
[1]   Mapping accessible chromatin regions using Sono-Seq [J].
Auerbach, Raymond K. ;
Euskirchen, Ghia ;
Rozowsky, Joel ;
Lamarre-Vincent, Nathan ;
Moqtaderi, Zarmik ;
Lefrancois, Philippe ;
Struhl, Kevin ;
Gerstein, Mark ;
Snyder, Michael .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (35) :14926-14931
[2]   Comparative analysis of regulatory information and circuits across distant species [J].
Boyle, Alan P. ;
Araya, Carlos L. ;
Brdlik, Cathleen ;
Cayting, Philip ;
Cheng, Chao ;
Cheng, Yong ;
Gardner, Kathryn ;
Hillier, LaDeana W. ;
Janette, Judith ;
Jiang, Lixia ;
Kasper, Dionna ;
Kawli, Trupti ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Li, Jingyi Jessica ;
Ma, Lijia ;
Niu, Wei ;
Rehm, E. Jay ;
Rozowsky, Joel ;
Slattery, Matthew ;
Spokony, Rebecca ;
Terrell, Robert ;
Vafeados, Dionne ;
Wang, Daifeng ;
Weisdepp, Peter ;
Wu, Yi-Chieh ;
Xie, Dan ;
Yan, Koon-Kiu ;
Feingold, Elise A. ;
Good, Peter J. ;
Pazin, Michael J. ;
Huang, Haiyan ;
Bickel, Peter J. ;
Brenner, Steven E. ;
Reinke, Valerie ;
Waterston, Robert H. ;
Gerstein, Mark ;
White, Kevin P. ;
Kellis, Manolis ;
Snyder, Michael .
NATURE, 2014, 512 (7515) :453-+
[3]   Impact of artifact removal on ChIP quality metrics in ChIP-seq and ChIP-exo data [J].
Carroll, Thomas S. ;
Liang, Ziwei ;
Salama, Rafik ;
Stark, Rory ;
de Santiago, Ines .
FRONTIERS IN GENETICS, 2014, 5
[4]   Deciphering ENCODE [J].
Diehl, Adam G. ;
Boyle, Alan P. .
TRENDS IN GENETICS, 2016, 32 (04) :238-249
[5]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[6]   Umap and Bismap: quantifying genome and methylome mappability [J].
Karimzadeh, Mehran ;
Ernst, Carl ;
Kundaje, Anshul ;
Hoffman, Michael M. .
NUCLEIC ACIDS RESEARCH, 2018, 46 (20)
[7]   The Sequence Alignment/Map format and SAMtools [J].
Li, Heng ;
Handsaker, Bob ;
Wysoker, Alec ;
Fennell, Tim ;
Ruan, Jue ;
Homer, Nils ;
Marth, Gabor ;
Abecasis, Goncalo ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (16) :2078-2079
[8]   Characterizing regions in the human genome unmappable by next-generation-sequencing at the read length of 1000 bases [J].
Li, Wentian ;
Freudenberg, Jan .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2014, 53 :108-117
[9]   False positive peaks in ChIP-seq and other sequencing-based functional assays caused by unannotated high copy number regions [J].
Pickrell, Joseph K. ;
Gaffney, Daniel J. ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
BIOINFORMATICS, 2011, 27 (15) :2144-2146
[10]   A comparative encyclopedia of DNA elements in the mouse genome [J].
Yue, Feng ;
Cheng, Yong ;
Breschi, Alessandra ;
Vierstra, Jeff ;
Wu, Weisheng ;
Ryba, Tyrone ;
Sandstrom, Richard ;
Ma, Zhihai ;
Davis, Carrie ;
Pope, Benjamin D. ;
Shen, Yin ;
Pervouchine, Dmitri D. ;
Djebali, Sarah ;
Thurman, Robert E. ;
Kaul, Rajinder ;
Rynes, Eric ;
Kirilusha, Anthony ;
Marinov, Georgi K. ;
Williams, Brian A. ;
Trout, Diane ;
Amrhein, Henry ;
Fisher-Aylor, Katherine ;
Antoshechkin, Igor ;
DeSalvo, Gilberto ;
See, Lei-Hoon ;
Fastuca, Meagan ;
Drenkow, Jorg ;
Zaleski, Chris ;
Dobin, Alex ;
Prieto, Pablo ;
Lagarde, Julien ;
Bussotti, Giovanni ;
Tanzer, Andrea ;
Denas, Olgert ;
Li, Kanwei ;
Bender, M. A. ;
Zhang, Miaohua ;
Byron, Rachel ;
Groudine, Mark T. ;
McCleary, David ;
Pham, Long ;
Ye, Zhen ;
Kuan, Samantha ;
Edsall, Lee ;
Wu, Yi-Chieh ;
Rasmussen, Matthew D. ;
Bansal, Mukul S. ;
Kellis, Manolis ;
Keller, Cheryl A. ;
Morrissey, Christapher S. .
NATURE, 2014, 515 (7527) :355-+