An Integrated Approach to Identifying Cis-Regulatory Modules in the Human Genome

被引:15
作者
Won, Kyoung-Jae
Agarwal, Saurabh
Shen, Li
Shoemaker, Robert
Ren, Bing
Wang, Wei
机构
[1] Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA
[2] Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA
来源
PLOS ONE | 2009年 / 4卷 / 05期
关键词
HIDDEN MARKOV-MODELS; CLUSTERS; PREDICTION; DISCOVERY; PROMOTERS; MOTIFS;
D O I
10.1371/journal.pone.0005501
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In eukaryotic genomes, it is challenging to accurately determine target sites of transcription factors (TFs) by only using sequence information. Previous efforts were made to tackle this task by considering the fact that TF binding sites tend to be more conserved than other functional sites and the binding sites of several TFs are often clustered. Recently, ChIP-chip and ChIP-sequencing experiments have been accumulated to identify TF binding sites as well as survey the chromatin modification patterns at the regulatory elements such as promoters and enhancers. We propose here a hidden Markov model (HMM) to incorporate sequence motif information, TF-DNA interaction data and chromatin modification patterns to precisely identify cis-regulatory modules (CRMs). We conducted ChIP-chip experiments on four TFs, CREB, E2F1, MAX, and YY1 in 1% of the human genome. We then trained a hidden Markov model (HMM) to identify the labels of the CRMs by incorporating the sequence motifs recognized by these TFs and the ChIP-chip ratio. Chromatin modification data was used to predict the functional sites and to further remove false positives. Cross-validation showed that our integrated HMM had a performance superior to other existing methods on predicting CRMs. Incorporating histone signature information successfully penalized false prediction and improved the whole performance. The dataset we used and the software are available at http://nash.ucsd.edu/CIS/.
引用
收藏
页数:8
相关论文
共 30 条
[1]   High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[2]   Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression [J].
Blanchette, M ;
Bataille, AR ;
Chen, XY ;
Poitras, C ;
Laganière, J ;
Lefèbvre, C ;
Deblois, G ;
Giguère, V ;
Ferretti, V ;
Bergeron, D ;
Coulombe, B ;
Robert, FO .
GENOME RESEARCH, 2006, 16 (05) :656-668
[3]  
Bulyk ML, 2004, GENOME BIOL, V5
[4]   WebLogo: A sequence logo generator [J].
Crooks, GE ;
Hon, G ;
Chandonia, JM ;
Brenner, SE .
GENOME RESEARCH, 2004, 14 (06) :1188-1190
[5]  
Durbin R., 1998, Analysis, V356, DOI [10.1017/CBO9780511790492, DOI 10.1017/CBO9780511790492]
[6]   Cluster-Buster: finding dense clusters of motifs in DNA sequences [J].
Frith, MC ;
Li, MC ;
Weng, ZP .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3666-3668
[7]   Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences [J].
Frith, MC ;
Spouge, JL ;
Hansen, U ;
Weng, ZP .
NUCLEIC ACIDS RESEARCH, 2002, 30 (14) :3214-3224
[8]   Detection of cis-element clusters in higher eukaryotic DNA [J].
Frith, MC ;
Hansen, U ;
Weng, ZP .
BIOINFORMATICS, 2001, 17 (10) :878-889
[9]   De novo cis-regulatory module elicitation for eukaryotic genomes [J].
Gupta, M ;
Liu, JS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (20) :7079-7084
[10]   Transcriptional regulatory code of a eukaryotic genome [J].
Harbison, CT ;
Gordon, DB ;
Lee, TI ;
Rinaldi, NJ ;
Macisaac, KD ;
Danford, TW ;
Hannett, NM ;
Tagne, JB ;
Reynolds, DB ;
Yoo, J ;
Jennings, EG ;
Zeitlinger, J ;
Pokholok, DK ;
Kellis, M ;
Rolfe, PA ;
Takusagawa, KT ;
Lander, ES ;
Gifford, DK ;
Fraenkel, E ;
Young, RA .
NATURE, 2004, 431 (7004) :99-104