Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes

被引:36
作者
Kreiman, G [1 ]
机构
[1] MIT, Ctr Biol & Comp Learning, McGovern Inst Brain Res, Cambridge, MA 02142 USA
关键词
D O I
10.1093/nar/gkh614
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Sequence information and high-throughput methods to measure gene expression levels open the door to explore transcriptional regulation using computational tools. Combinatorial regulation and sparseness of regulatory elements throughout the genome allow organisms to control the spatial and temporal patterns of gene expression. Here we study the organization of cis-regulatory elements in sets of co-regulated genes. We build an algorithm to search for combinations of transcription factor binding sites that are enriched in a set of potentially co-regulated genes with respect to the whole genome. No knowledge is assumed about involvement of specific sets of transcription factors. Instead, the search is exhaustively conducted over combinations of up to four binding sites obtained from databases or motif search algorithms. We evaluate the performance on random sets of genes as a negative control and on three biologically validated sets of co-regulated genes in yeasts, flies and humans. We show that we can detect DNA regions that play a role in the control of transcription. These results shed light on the structure of transcription regulatory regions in eukaryotes and can be directly applied to clusters of co-expressed genes obtained in gene expression studies. Supplementary information is available at http://www.mit.edu/similar tokreiman/resources/cisregul/.
引用
收藏
页码:2889 / 2900
页数:12
相关论文
共 54 条
[1]
ADHYA S, 1989, ANNU REV GENET, V23, P227, DOI 10.1146/annurev.genet.23.1.227
[2]
Computational detection of cis-regulatory modules [J].
Aerts, Stein ;
Van Loo, Peter ;
Thijs, Gert ;
Moreau, Yves ;
De Moor, Bart .
BIOINFORMATICS, 2003, 19 :II5-II14
[3]
Alberts B., 1994, MOL BIOL CELL
[4]
Arnone MI, 1997, DEVELOPMENT, V124, P1851
[5]
Searching for statistically significant regulatory modules [J].
Bailey, Timothy L. ;
Noble, William Stafford .
BIOINFORMATICS, 2003, 19 :II16-II25
[6]
BAILEY TL, 1995, MACH LEARN, V21, P51, DOI 10.1007/BF00993379
[7]
SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS - STATISTICAL-MECHANICAL THEORY AND APPLICATION TO OPERATORS AND PROMOTERS [J].
BERG, OG ;
VONHIPPEL, PH .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (04) :723-743
[8]
Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome [J].
Berman, BP ;
Nibu, Y ;
Pfeiffer, BD ;
Tomancak, P ;
Celniker, SE ;
Levine, M ;
Rubin, GM ;
Eisen, MB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) :757-762
[9]
Going the distance: A current view of enhancer action [J].
Blackwood, EM ;
Kadonaga, JT .
SCIENCE, 1998, 281 (5373) :60-63
[10]
Phylogenetic shadowing of primate sequences to find functional regions of the human genome [J].
Boffelli, D ;
McAuliffe, J ;
Ovcharenko, D ;
Lewis, KD ;
Ovcharenko, I ;
Pachter, L ;
Rubin, EM .
SCIENCE, 2003, 299 (5611) :1391-1394