Quantitative DNA methylation analysis based on four-dye trace data from direct sequencing of PCR amplificates

被引:214
作者
Lewin, J
Schmitt, AO
Adorján, P
Hildmann, T
Piepenbrock, C
机构
[1] Epigenom AG, D-10178 Berlin, Germany
[2] Humboldt Univ, Inst Nutztierwissensch, D-10115 Berlin, Germany
关键词
D O I
10.1093/bioinformatics/bth346
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Methylation of cytosines in DNA plays an important role in the regulation of gene expression, and the analysis of methylation patterns is fundamental for the understanding of cell differentiation, aging processes, diseases and cancer development. Such analysis has been limited, because technologies for detailed and efficient high-throughput studies have not been available. We have developed a novel quantitative methylation analysis algorithm and workflow based on direct DNA sequencing of PCR products from bisulfite-treated DNA with high-throughput sequencing machines. This technology is a prerequisite for success of the Human Epigenome Project, the first large genome-wide sequencing study for DNA methylation in many different tissues. Methylation in tissue samples which are compositions of different cells is a quantitative information represented by cytosine/thymine proportions after bisulfite conversion of unmethylated cytosines to uracil and PCR. Calculation of quantitative methylation information from base proportions represented by different dye signals in four-dye sequencing trace files needs a specific algorithm handling imbalanced and overscaled signals, incomplete conversion, quality problems and basecaller artifacts. Results: The algorithm we developed has several key properties: it analyzes trace files from PCR products of bisulfite-treated DNA sequenced directly on ABI machines; it yields quantitative methylation measurements for individual cytosine positions after alignment with genomic reference sequences, signal normalization and estimation of effectiveness of bisulfite treatment; it works in a fully automated pipeline including data quality monitoring; it is efficient and avoids the usual cost of multiple sequencing runs on subclones to estimate DNA methylation. The power of our new algorithm is demonstrated with data from two test systems based on mixtures with known base compositions and defined methylation. In addition, the applicability is proven by identifying CpGs that are differentially methylated in real tissue samples.
引用
收藏
页码:3005 / 3012
页数:8
相关论文
共 14 条
[1]   Tumour class prediction and discovery by microarray-based DNA methylation analysis -: art. no. e21 [J].
Adorján, P ;
Distler, J ;
Lipscher, E ;
Model, F ;
Müller, J ;
Pelet, C ;
Braun, A ;
Florl, AR ;
Gütig, D ;
Grabs, G ;
Howe, A ;
Kursar, M ;
Lesche, R ;
Leu, E ;
Lewin, A ;
Maier, S ;
Müller, V ;
Otto, T ;
Scholz, C ;
Schulz, WA ;
Seifert, HH ;
Schwope, I ;
Ziebarth, H ;
Berlin, K ;
Piepenbrock, C ;
Olek, A .
NUCLEIC ACIDS RESEARCH, 2002, 30 (05) :e21
[2]  
BARTON GJ, 1993, COMPUT APPL BIOSCI, V9, P729
[3]   DNA methylation analysis techniques [J].
Dahl, C ;
Guldberg, P .
BIOGERONTOLOGY, 2003, 4 (04) :233-250
[4]  
Dear S, 1992, DNA Seq, V3, P107, DOI 10.3109/10425179209034003
[5]   Expression of various genes is controlled by DNA methylation during mammalian development [J].
Ehrlich, M .
JOURNAL OF CELLULAR BIOCHEMISTRY, 2003, 88 (05) :899-910
[6]   A GENOMIC SEQUENCING PROTOCOL THAT YIELDS A POSITIVE DISPLAY OF 5-METHYLCYTOSINE RESIDUES IN INDIVIDUAL DNA STRANDS [J].
FROMMER, M ;
MCDONALD, LE ;
MILLAR, DS ;
COLLIS, CM ;
WATT, F ;
GRIGG, GW ;
MOLLOY, PL ;
PAUL, CL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (05) :1827-1831
[7]  
*HUM EP CONS EP AG, 2003, HUM EP PROJ
[8]   DNA methylation and cancer [J].
Jones, PA .
ONCOGENE, 2002, 21 (35) :5358-5360
[9]   A modified and improved method for bisulphite based cytosine methylation analysis [J].
Olek, A ;
Oswald, J ;
Walter, J .
NUCLEIC ACIDS RESEARCH, 1996, 24 (24) :5064-5066
[10]  
Paul CL, 1996, BIOTECHNIQUES, V21, P126