Integration of genome and chromatin structure with gene expression profiles to predict c-MYC recognition site binding and function

被引:26
作者
Chen, Yili
Blackwell, Thomas W.
Chen, Ji
Gao, Jing
Lee, Angel W.
States, David J. [1 ]
机构
[1] Univ Michigan, Sch Med, Bioinformat Program, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Sch Med, Dept Human Genet, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Sch Med, Dept Pharmacol, Ann Arbor, MI 48109 USA
关键词
D O I
10.1371/journal.pcbi.0030063
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The MYC genes encode nuclear sequence specific-binding DNA-binding proteins that are pleiotropic regulators of cellular function, and the c-MYC proto-oncogene is deregulated and/or mutated in most human cancers. Experimental studies of MYC binding to the genome are not fully consistent. While many c-MYC recognition sites can be identified in c-MYC responsive genes, other motif matches-even experimentally confirmed sites-are associated with genes showing no c-MYC response. We have developed a computational model that integrates multiple sources of evidence to predict which genes will bind and be regulated by MYC in vivo. First, a Bayesian network classifier is used to predict those c-MYC recognition sites that are most likely to exhibit high-occupancy binding in chromatin immunoprecipitation studies. This classifier incorporates genomic sequence, experimentally determined genomic chromatin acetylation islands, and predicted methylation status from a computational model estimating the likelihood of genomic DNA methylation. We find that the predictions from this classifier are also applicable to other transcription factors, such as cAMP-response element-binding protein, whose binding sites are sensitive to DNA methylation. Second, the MYC binding probability is combined with the gene expression profile data from nine independent microarray datasets in multiple tissues. Finally, we may consider gene function annotations in Gene Ontology to predict the c-MYC targets. We assess the performance of our prediction results by comparing them with the c-myc targets identified in the biomedical literature. In total, we predict 460 likely c-MYC target genes in the human genome, of which 67 have been reported to be both bound and regulated by MYC, 68 are bound by MYC, and another 80 are MYC-regulated. The approach thus successfully identifies many known c-MYC targets and suggests many novel sites. Our findings suggest that to identify c-MYC genomic targets, integration of different data sources helps to improve the accuracy.
引用
收藏
页码:602 / 615
页数:14
相关论文
共 59 条
  • [1] Amati B., 2001, BIOCHIM BIOPHYS ACTA, V1471, P135
  • [2] MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia
    Armstrong, SA
    Staunton, JE
    Silverman, LB
    Pieters, R
    de Boer, ML
    Minden, MD
    Sallan, SE
    Lander, ES
    Golub, TR
    Korsmeyer, SJ
    [J]. NATURE GENETICS, 2002, 30 (01) : 41 - 47
  • [3] Reverse engineering of regulatory networks in human B cells
    Basso, K
    Margolin, AA
    Stolovitzky, G
    Klein, U
    Dalla-Favera, R
    Califano, A
    [J]. NATURE GENETICS, 2005, 37 (04) : 382 - 390
  • [4] Involvement of Myc targets in c-myc and N-myc induced human tumors
    Ben-Yosef, T
    Yanuka, O
    Halle, D
    Benvenisty, N
    [J]. ONCOGENE, 1998, 17 (02) : 165 - 171
  • [5] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [6] Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses
    Bhattacharjee, A
    Richards, WG
    Staunton, J
    Li, C
    Monti, S
    Vasa, P
    Ladd, C
    Beheshti, J
    Bueno, R
    Gillette, M
    Loda, M
    Weber, G
    Mark, EJ
    Lander, ES
    Wong, W
    Johnson, BE
    Golub, TR
    Sugarbaker, DJ
    Meyerson, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) : 13790 - 13795
  • [7] BIRD A, 1985, CELL, V40, P91, DOI 10.1016/0092-8674(85)90312-5
  • [8] Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs
    Cawley, S
    Bekiranov, S
    Ng, HH
    Kapranov, P
    Sekinger, EA
    Kampa, D
    Piccolboni, A
    Sementchenko, V
    Cheng, J
    Williams, AJ
    Wheeler, R
    Wong, B
    Drenkow, J
    Yamanaka, M
    Patel, S
    Brubaker, S
    Tammana, H
    Helt, G
    Struhl, K
    Gingeras, TR
    [J]. CELL, 2004, 116 (04) : 499 - 509
  • [9] Treatment-specific changes in gene expression discriminate in vivo drug response in human leukemia cells
    Cheok, MH
    Yang, WL
    Pui, CH
    Downing, JR
    Cheng, C
    Naeve, CW
    Relling, MV
    Evans, WE
    [J]. NATURE GENETICS, 2003, 34 (01) : 85 - 90
  • [10] CPNPG METHYLATION IN MAMMALIAN-CELLS
    CLARK, SJ
    HARRISON, J
    FROMMER, M
    [J]. NATURE GENETICS, 1995, 10 (01) : 20 - 27