Estimating absolute methylation levels at single-CpG resolution from methylation enrichment and restriction enzyme sequencing methods

被引:111
作者
Stevens, Michael [1 ,2 ]
Cheng, Jeffrey B. [3 ]
Li, Daofeng [1 ]
Xie, Mingchao [1 ]
Hong, Chibo [4 ]
Maire, Cecile L. [5 ]
Ligon, Keith L. [5 ,6 ,7 ]
Hirst, Martin [8 ,9 ]
Marra, Marco A. [8 ]
Costello, Joseph F. [4 ]
Wang, Ting [1 ,2 ]
机构
[1] Washington Univ, Sch Med, Dept Genet, Ctr Genome Sci & Syst Biol, St Louis, MO 63108 USA
[2] Washington Univ, Dept Comp Sci & Engn, St Louis, MO 63130 USA
[3] Univ Calif San Francisco, Dept Dermatol, San Francisco, CA 94143 USA
[4] Univ Calif San Francisco, Brain Tumor Res Ctr, Dept Neurosurg, Helen Diller Family Comprehens Canc Ctr, San Francisco, CA 94143 USA
[5] Dana Farber Canc Inst, Ctr Mol Oncol Pathol, Dept Med Oncol, Boston, MA 02215 USA
[6] Boston Childrens Hosp, Dept Pathol, Brigham & Womens Hosp, Boston, MA 02115 USA
[7] Harvard Univ, Sch Med, Boston, MA 02115 USA
[8] BC Canc Agcy, Canadas Michael Smith Genome Sci Ctr Vancouver, Vancouver, BC V5Z 4S6, Canada
[9] Univ British Columbia, Dept Microbiol & Immunol, Vancouver, BC V6T 1Z3, Canada
关键词
WIDE DNA METHYLATION; METHYLOME; PREDICTION; SITES; MEDIP;
D O I
10.1101/gr.152231.112
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recent advancements in sequencing-based DNA methylation profiling methods provide an unprecedented opportunity to map complete DNA methylomes. These include whole-genome bisulfite sequencing (WGBS, MethyiC-seq, or BS-seq), reduced-representation bisulfite sequencing (RRBS), and enrichment-based methods such as MeDIP-seq, MBD-seq, and MRE-seq. These methods yield largely comparable results but differ significantly in extent of genomic CpG coverage, resolution, quantitative accuracy, and cost, at least while using current algorithms to interrogate the data. None of these existing methods provides single-CpG resolution, comprehensive genome-wide coverage, and cost feasibility for a typical laboratory. We introduce methylCRF, a novel conditional random fields based algorithm that integrates methylated DNA immunoprecipitation (MeDIP-seq) and methylation-sensitive restriction enzyme (MRE-seq) sequencing data to predict DNA methylation levels at single-CpG resolution. Our method is a combined computational and experimental strategy to produce DNA methylomes of all 28 million CpGs in the human genome for a fraction (<10%) of the cost of whole-genome bisulfite sequencing methods. methylCRF was benchmarked for accuracy against Infinium arrays, RRBS, WGBS sequencing, and locus-specific bisulfite sequencing performed on the same human embryonic stem cell line. methylCRF transformation of MeDIP-seq/MRE-seq was equivalent to a biological replicate of WGBS in quantification, coverage, and resolution. We used conventional bisulfite conversion, PCR, cloning, and sequencing to validate loci where our predictions do not agree with whole-genome bisulfite data, and in 11 out of 12 cases, methylCRF predictions of methylation level agree better with validated results than does whole-genome bisulfite sequencing. Therefore, methylCRF transformation of MeDIP-seq/MRE-seq data provides an accurate, inexpensive, and widely accessible strategy to create full DNA methylomes.
引用
收藏
页码:1541 / 1553
页数:13
相关论文
共 43 条
[31]   Comparison of methyl-DNA immunoprecipitation (MeDIP) and methyl-CpG binding domain (MBD) protein capture for genome-wide DNA methylation analysis reveal CpG sequence coverage bias [J].
Nair, Shalima S. ;
Coolen, Marcel W. ;
Stirzaker, Clare ;
Song, Jenny Z. ;
Statham, Aaron L. ;
Strbenac, Dario ;
Robinson, Mark D. ;
Clark, Susan J. .
EPIGENETICS, 2011, 6 (01) :34-44
[32]   Fast and sensitive mapping of bisulfite-treated sequencing data [J].
Otto, Christian ;
Stadler, Peter F. ;
Hoffmann, Steve .
BIOINFORMATICS, 2012, 28 (13) :1698-1704
[33]   MEDME: An experimental and analytical methodology for the estimation of DNA methylation levels based on microarray derived MeDIP-enrichment [J].
Pelizzola, Mattia ;
Koga, Yasuo ;
Urban, Alexander Eckehart ;
Krauthammer, Michael ;
Weissman, Sherman ;
Halaban, Ruth ;
Molinaro, Annette M. .
GENOME RESEARCH, 2008, 18 (10) :1652-1659
[34]   DNA methylation and human disease [J].
Robertson, KD .
NATURE REVIEWS GENETICS, 2005, 6 (08) :597-610
[35]   BISMA - Fast and accurate bisulfite sequencing data analysis of individual clones from unique and repetitive sequences [J].
Rohde, Christian ;
Zhang, Yingying ;
Reinhardt, Richard ;
Jeltsch, Albert .
BMC BIOINFORMATICS, 2010, 11
[36]   MBD-isolated Genome Sequencing provides a high-throughput and comprehensive survey of DNA methylation in the human genome [J].
Serre, David ;
Lee, Byron H. ;
Ting, Angela H. .
NUCLEIC ACIDS RESEARCH, 2010, 38 (02) :391-399
[37]  
Sha F, 2003, HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P213
[38]   DNA methylation landscapes: provocative insights from epigenomics [J].
Suzuki, Miho M. ;
Bird, Adrian .
NATURE REVIEWS GENETICS, 2008, 9 (06) :465-476
[39]  
Wallach H, 2004, MSCIS0421 U PENNS DE
[40]   Chromosome-wide and promoter-specific analyses identify sites of differential DNA methylation in normal and transformed human cells [J].
Weber, M ;
Davies, JJ ;
Wittig, D ;
Oakeley, EJ ;
Haase, M ;
Lam, WL ;
Schübeler, D .
NATURE GENETICS, 2005, 37 (08) :853-862