Predicting methylation status of CpG islands in the human brain

被引:80
作者
Fang, Fang
Fan, Shicai
Zhang, Xuegong
Zhang, Michael Q. [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Bioinformat Div, TNLIST, Beijing 100084, Peoples R China
[2] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11274 USA
关键词
D O I
10.1093/bioinformatics/btl377
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Over 50% of human genes contain CpG islands in their 5'-regions. Methylation patterns of CpG islands are involved in tissue-specific gene expression and regulation. Mis-epigenetic silencing associated with aberrant CpG island methylation is one mechanism leading to the loss of tumor suppressor functions in cancer cells. Large-scale experimental detection of DNA methylation is still both labor-intensive and time-consuming. Therefore, it is necessary to develop in silico approaches for predicting methylation status of CpG islands. Results: Based on a recent genome-scale dataset of DNA methylation in human brain tissues, we developed a classifier called MethCGI for predicting methylation status of CpG islands using a support vector machine (SVM). Nucleotide sequence contents as well as transcription factor binding sites (TFBSs) are used as features for the classification. The method achieves specificity of 84.65% and sensitivity of 84.32% on the brain data, and can also correctly predict about two-third of the data from other tissues reported in the MethDB database.
引用
收藏
页码:2204 / 2209
页数:6
相关论文
共 41 条
[1]   NUMBER OF CPG ISLANDS AND GENES IN HUMAN AND MOUSE [J].
ANTEQUERA, F ;
BIRD, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (24) :11995-11999
[2]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[3]   Alu repeats and human genomic diversity [J].
Batzer, MA ;
Deininger, PL .
NATURE REVIEWS GENETICS, 2002, 3 (05) :370-379
[4]   Prediction of methylated CpGs in DNA sequences using a support vector machine [J].
Bhasin, M ;
Zhang, H ;
Reinherz, EL ;
Reche, PA .
FEBS LETTERS, 2005, 579 (20) :4302-4308
[5]   DNA methylation patterns and epigenetic memory [J].
Bird, A .
GENES & DEVELOPMENT, 2002, 16 (01) :6-21
[6]   Methylation-induced repression - Belts, braces, and chromatin [J].
Bird, AP ;
Wolffe, AP .
CELL, 1999, 99 (05) :451-454
[8]   CPG-RICH ISLANDS AND THE FUNCTION OF DNA METHYLATION [J].
BIRD, AP .
NATURE, 1986, 321 (6067) :209-213
[9]   CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure [J].
Bock, Christoph ;
Paulsen, Martina ;
Tierling, Sascha ;
Mikeska, Thomas ;
Lengauer, Thomas ;
Walter, Joern .
PLOS GENETICS, 2006, 2 (03) :243-252
[10]   SVMTorch: Support vector machines for large-scale regression problems [J].
Collobert, R ;
Bengio, S .
JOURNAL OF MACHINE LEARNING RESEARCH, 2001, 1 (02) :143-160