An evaluation of analysis pipelines for DNA methylation profiling using the Illumina HumanMethylation450 BeadChip platform

被引:162
作者
Marabita, Francesco [1 ]
Almgren, Malin [2 ]
Lindholm, Malene E. [3 ]
Ruhrmann, Sabrina [2 ]
Fagerstrom-Billai, Fredrik [4 ]
Jagodic, Maja [2 ]
Sundberg, Carl J. [3 ]
Ekstrom, Tomas J. [2 ]
Teschendorff, Andrew E. [5 ]
Tegner, Jesper [1 ]
Gomez-Cabrero, David [1 ,6 ]
机构
[1] Karolinska Inst, Dept Med, Ctr Mol Med, Unit Computat Med, Stockholm, Sweden
[2] Karolinska Inst, Dept Clin Neurosci, Ctr Mol Med, Stockholm, Sweden
[3] Karolinska Inst, Dept Physiol & Pharmacol, Stockholm, Sweden
[4] Karolinska Inst, Dept Biosci & Nutr, Stockholm, Sweden
[5] UCL, UCL Canc Inst, Stat Genom Grp, London, England
[6] Bioinformat Infrastruct Life Sci, Stockholm, Sweden
基金
瑞典研究理事会;
关键词
technical variability; DNA methylation; microarray; Illumina; 450K; normalization; CPG ISLAND SHORES; QUANTILE NORMALIZATION; SUBSET-QUANTILE; HIGH-THROUGHPUT; HUMAN GENOME; CELLS; PLURIPOTENT; EPIGENETICS; MICROARRAY; CHALLENGES;
D O I
10.4161/epi.24008
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The proper identification of differentially methylated CpGs is central in most epigenetic studies. The Illumina HumanMethylation450 BeadChip is widely used to quantify DNA methylation; nevertheless, the design of an appropriate analysis pipeline faces severe challenges due to the convolution of biological and technical variability and the presence of a signal bias between Infinium I and II probe design types. Despite recent attempts to investigate how to analyze DNA methylation data with such an array design, it has not been possible to perform a comprehensive comparison between different bioinformatics pipelines due to the lack of appropriate data sets having both large sample size and sufficient number of technical replicates. Here we perform such a comparative analysis, targeting the problems of reducing the technical variability, eliminating the probe design bias and reducing the batch effect by exploiting two unpublished data sets, which included technical replicates and were profiled for DNA methylation either on peripheral blood, monocytes or muscle biopsies. We evaluated the performance of different analysis pipelines and demonstrated that: (1) it is critical to correct for the probe design type, since the amplitude of the measured methylation change depends on the underlying chemistry; (2) the effect of different normalization schemes is mixed, and the most effective method in our hands were quantile normalization and Beta Mixture Quantile dilation (BMIQ); (3) it is beneficial to correct for batch effects. In conclusion, our comparative analysis using a comprehensive data set suggests an efficient pipeline for proper identification of differentially methylated CpGs using the Illumina 450K arrays.
引用
收藏
页码:333 / 346
页数:14
相关论文
共 41 条
[11]   Epigenomics reveals a functional genome anatomy and a new approach to common disease [J].
Feinberg, Andrew P. .
NATURE BIOTECHNOLOGY, 2010, 28 (10) :1049-1052
[12]   Increased methylation variation in epigenetic domains across cancer types [J].
Hansen, Kasper Daniel ;
Timp, Winston ;
Bravo, Hector Corrada ;
Sabunciyan, Sarven ;
Langmead, Benjamin ;
McDonald, Oliver G. ;
Wen, Bo ;
Wu, Hao ;
Liu, Yun ;
Diep, Dinh ;
Briem, Eirikur ;
Zhang, Kun ;
Irizarry, Rafael A. ;
Feinberg, Andrew P. .
NATURE GENETICS, 2011, 43 (08) :768-U77
[13]   Comparison of sequencing-based methods to profile DNA methylation and identification of monoallelic epigenetic modifications [J].
Harris, R. Alan ;
Wang, Ting ;
Coarfa, Cristian ;
Nagarajan, Raman P. ;
Hong, Chibo ;
Downey, Sara L. ;
Johnson, Brett E. ;
Fouse, Shaun D. ;
Delaney, Allen ;
Zhao, Yongjun ;
Olshen, Adam ;
Ballinger, Tracy ;
Zhou, Xin ;
Forsberg, Kevin J. ;
Gu, Junchen ;
Echipare, Lorigail ;
O'Geen, Henriette ;
Lister, Ryan ;
Pelizzola, Mattia ;
Xi, Yuanxin ;
Epstein, Charles B. ;
Bernstein, Bradley E. ;
Hawkins, R. David ;
Ren, Bing ;
Chung, Wen-Yu ;
Gu, Hongcang ;
Bock, Christoph ;
Gnirke, Andreas ;
Zhang, Michael Q. ;
Haussler, David ;
Ecker, Joseph R. ;
Li, Wei ;
Farnham, Peggy J. ;
Waterland, Robert A. ;
Meissner, Alexander ;
Marra, Marco A. ;
Hirst, Martin ;
Milosavljevic, Aleksandar ;
Costello, Joseph F. .
NATURE BIOTECHNOLOGY, 2010, 28 (10) :1097-U194
[14]   DNA methylation biomarkers and their utility for solid cancer diagnostics [J].
Heichman, Karen A. ;
Warren, Jorja D. .
CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2012, 50 (10) :1707-1721
[15]   DNA methylation profiling in the clinic: applications and challenges [J].
Heyn, Holger ;
Esteller, Manel .
NATURE REVIEWS GENETICS, 2012, 13 (10) :679-692
[16]  
Hui S L, 1998, Stat Methods Med Res, V7, P354, DOI 10.1191/096228098671192352
[17]   CpG islands - 'A rough guide' [J].
Illingworth, Robert S. ;
Bird, Adrian P. .
FEBS LETTERS, 2009, 583 (11) :1713-1720
[18]   Comprehensive high-throughput arrays for relative methylation (CHARM) [J].
Irizarry, Rafael A. ;
Ladd-Acosta, Christine ;
Carvalho, Benilton ;
Wu, Hao ;
Brandenburg, Sheri A. ;
Jeddeloh, Jeffrey A. ;
Wen, Bo ;
Feinberg, Andrew P. .
GENOME RESEARCH, 2008, 18 (05) :780-790
[19]   The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores [J].
Irizarry, Rafael A. ;
Ladd-Acosta, Christine ;
Wen, Bo ;
Wu, Zhijin ;
Montano, Carolina ;
Onyango, Patrick ;
Cui, Hengmi ;
Gabo, Kevin ;
Rongione, Michael ;
Webster, Maree ;
Ji, Hong ;
Potash, James B. ;
Sabunciyan, Sarven ;
Feinberg, Andrew P. .
NATURE GENETICS, 2009, 41 (02) :178-186
[20]   Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies [J].
Jaffe, Andrew E. ;
Murakami, Peter ;
Lee, Hwajin ;
Leek, Jeffrey T. ;
Fallin, M. Daniele ;
Feinberg, Andrew P. ;
Irizarry, Rafael A. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2012, 41 (01) :200-209