Large-scale imputation of epigenomic datasets for systematic annotation of diverse human tissues

被引:242
作者
Ernst, Jason [1 ,2 ,3 ,4 ,5 ]
Kellis, Manolis [6 ,7 ]
机构
[1] Univ Calif Los Angeles, Dept Biol Chem, Los Angeles, CA 90024 USA
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
[3] Univ Calif Los Angeles, Eli & Edythe Broad Ctr Regenerat Med & Stem Cell, Los Angeles, CA USA
[4] Univ Calif Los Angeles, Jonsson Comprehens Canc Ctr, Los Angeles, CA 90024 USA
[5] Univ Calif Los Angeles, Inst Mol Biol, Los Angeles, CA 90024 USA
[6] MIT Comp Sci & Artificial Intelligence Lab, Cambridge, MA USA
[7] Broad Inst MIT & Harvard, Cambridge, MA USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
HISTONE MODIFICATIONS; CHROMATIN-STRUCTURE; METHYLATION STATUS; DNA METHYLATION; DISCOVERY; GENE; RESOLUTION; BINDING; ENCODE; IDENTIFICATION;
D O I
10.1038/nbt.3157
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
With hundreds of epigenomic maps, the opportunity arises to exploit the correlated nature of epigenetic signals, across both marks and samples, for large-scale prediction of additional datasets. Here, we undertake epigenome imputation by leveraging such correlations through an ensemble of regression trees. We impute 4,315 high-resolution signal maps, of which 26% are also experimentally observed. Imputed signal tracks show overall similarity to observed signals and surpass experimental datasets in consistency, recovery of gene annotations and enrichment for disease-associated variants. We use the imputed data to detect low-quality experimental datasets, to find genomic sites with unexpected epigenomic signals, to define high-priority marks for new experiments and to delineate chromatin states in 127 reference epigenomes spanning diverse tissues and cell types. Our imputed datasets provide the most comprehensive human regulatory region annotation to date, and our approach and the ChromImpute software constitute a useful complement to large-scale experimental mapping of epigenomic information.
引用
收藏
页码:364 / U74
页数:16
相关论文
共 57 条
[1]   The diverse functions of Dot1 and H3K79 methylation [J].
Anh Tram Nguyen ;
Zhang, Yi .
GENES & DEVELOPMENT, 2011, 25 (13) :1345-1358
[2]  
Bar-Joseph Z, 2001, Bioinformatics, V17, pS22, DOI [10.1093/bioinformatics/17.suppl1.S22, DOI 10.1093/BIOINFORMATICS/17.SUPPL_1.S22, DOI 10.1093/BIOINFORMATICS/17.SUPPL1.S22]
[3]   SIRT7 links H3K18 deacetylation to maintenance of oncogenic transformation [J].
Barber, Matthew F. ;
Michishita-Kioi, Eriko ;
Xi, Yuanxin ;
Tasselli, Luisa ;
Kioi, Mitomu ;
Moqtaderi, Zarmik ;
Tennen, Ruth I. ;
Paredes, Silvana ;
Young, Nicolas L. ;
Chen, Kaifu ;
Struhl, Kevin ;
Garcia, Benjamin A. ;
Gozani, Or ;
Li, Wei ;
Chua, Katrin F. .
NATURE, 2012, 487 (7405) :114-+
[4]   High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[5]   A bivalent chromatin structure marks key developmental genes in embryonic stem cells [J].
Bernstein, BE ;
Mikkelsen, TS ;
Xie, XH ;
Kamal, M ;
Huebert, DJ ;
Cuff, J ;
Fry, B ;
Meissner, A ;
Wernig, M ;
Plath, K ;
Jaenisch, R ;
Wagschal, A ;
Feil, R ;
Schreiber, SL ;
Lander, ES .
CELL, 2006, 125 (02) :315-326
[6]   CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure [J].
Bock, Christoph ;
Paulsen, Martina ;
Tierling, Sascha ;
Mikeska, Thomas ;
Lengauer, Thomas ;
Walter, Joern .
PLOS GENETICS, 2006, 2 (03) :243-252
[7]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[8]   Modeling DNA methylation dynamics with approaches from phylogenetics [J].
Capra, John A. ;
Kostka, Dennis .
BIOINFORMATICS, 2014, 30 (17) :I408-I414
[9]   Computational prediction of methylation status in human genomic sequences [J].
Das, Rajdeep ;
Dimitrova, Nevenka ;
Xuan, Zhenyu ;
Rollins, Robert A. ;
Haghighi, Fatemah ;
Edwards, John R. ;
Ju, Jingyue ;
Bestor, Timothy H. ;
Zhang, Michael Q. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (28) :10713-10716
[10]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74