Computational prediction of methylation status in human genomic sequences

被引:111
作者
Das, Rajdeep
Dimitrova, Nevenka
Xuan, Zhenyu
Rollins, Robert A.
Haghighi, Fatemah
Edwards, John R.
Ju, Jingyue
Bestor, Timothy H.
Zhang, Michael Q. [1 ]
机构
[1] Columbia Univ, Columbia Genome Ctr, New York, NY 10032 USA
[2] Columbia Univ, Dept Chem Engn, New York, NY 10032 USA
[3] Columbia Univ Coll Phys & Surg, Dept Genet & Dev, New York, NY 10032 USA
[4] Philips Labs, Briarcliff Manor, NY 10510 USA
[5] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
关键词
DNA methylation; epigenomics; methylation prediction; CpG islands;
D O I
10.1073/pnas.0602949103
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Epigenetic effects in mammals depend largely on heritable genomic methylation patterns. We describe a computational pattern recognition method that is used to predict the methylation landscape of human brain DNA. This method can be applied both to CpG islands and to non-CpG island regions. It computes the methylation propensity for an 800-bp region centered on a CpG dinucleotide based on specific sequence features within the region. We tested several classifiers for classification performance, including K means clustering, linear discriminant analysis, logistic regression, and support vector machine. The best performing classifier used the support vector machine approach. Our program (called HDFINDER) presently has a prediction accuracy of 86%, as validated with CpG regions for which methylation status has been experimentally determined. Using HDFINDER, we have depicted the entire genomic methylation patterns for all 22 human autosomes.
引用
收藏
页码:10713 / 10716
页数:4
相关论文
共 19 条
[1]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[2]   NUMBER OF CPG ISLANDS AND GENES IN HUMAN AND MOUSE [J].
ANTEQUERA, F ;
BIRD, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (24) :11995-11999
[3]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[4]   Epigenomics: Beyond CpG islands [J].
Fazzari, MJ ;
Greally, JM .
NATURE REVIEWS GENETICS, 2004, 5 (06) :446-455
[5]   Predicting aberrant CpG island methylation [J].
Feltus, FA ;
Lee, EK ;
Costello, JF ;
Plass, C ;
Vertino, PM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (21) :12253-12258
[6]   CPG ISLANDS IN VERTEBRATE GENOMES [J].
GARDINERGARDEN, M ;
FROMMER, M .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (02) :261-282
[7]   Large-scale methylation analysis of human genomic DNA reveals tissue-specific differences between the methylation profiles of genes and pseudogenes [J].
Grunau, C ;
Hindermann, W ;
Rosenthal, A .
HUMAN MOLECULAR GENETICS, 2000, 9 (18) :2651-2663
[8]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422
[9]   EVIDENCE FOR EROSION OF MOUSE CPG ISLANDS DURING MAMMALIAN EVOLUTION [J].
MATSUO, K ;
CLAY, O ;
TAKAHASHI, T ;
SILKE, J ;
SCHAFFNER, W .
SOMATIC CELL AND MOLECULAR GENETICS, 1993, 19 (06) :543-555
[10]   Alu sequences [J].
Mighell, AJ ;
Markham, AF ;
Robinson, PA .
FEBS LETTERS, 1997, 417 (01) :1-5