Computational detection of cis-regulatory modules

被引:87
作者
Aerts, Stein [1 ]
Van Loo, Peter [1 ]
Thijs, Gert [1 ]
Moreau, Yves [1 ]
De Moor, Bart [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn, ESAT, SCD, B-3001 Louvain, Belgium
关键词
D O I
10.1093/bioinformatics/btg1052
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The transcriptional regulation of a metazoan gene depends on the cooperative action of multiple transcription factors that bind to cis-regulatory modules (CRMs) located in the neighborhood of the gene. By integrating multiple signals, CRMs confer an organism specific spatial and temporal rate of transcription. Results: Based on the hypothesis that genes that are needed in exactly the same conditions might share similar regulatory switches, we have developed a novel methodology to find CRMs in a set of coexpressed or coregulated genes. The ModuleSearcher algorithm finds for a given gene set the best scoring combination of transcription factor binding sites within a sequence window using an A* procedure for tree searching. To keep the level of noise low, we use DNA sequences that are most likely to contain functional cis-regulatory information, namely conserved regions between human and mouse orthologous genes. The ModuleScanner performs genomic searches with a predicted CRM or with a user-defined CRM known from the literature to find possible target genes. The validity of a set of putative targets is checked using Gene Ontology annotations. We demonstrate the use and effectiveness of the ModuleSearcher and ModuleScanner algorithms and test their specificity and sensitivity on semi- artificial data. Next, we search for a module in a cluster of gene expression profiles of human cell cycle genes.
引用
收藏
页码:II5 / II14
页数:10
相关论文
共 34 条
[1]   Toucan:: deciphering the cis-regulatory logic of coregulated genes [J].
Aerts, S ;
Thijs, G ;
Coessens, B ;
Staes, M ;
Moreau, Y ;
Moor, BD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (06) :1753-1764
[2]   Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome [J].
Berman, BP ;
Nibu, Y ;
Pfeiffer, BD ;
Tomancak, P ;
Celniker, SE ;
Levine, M ;
Rubin, GM ;
Eisen, MB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) :757-762
[3]   AVID: A global alignment program [J].
Bray, N ;
Dubchak, I ;
Pachter, L .
GENOME RESEARCH, 2003, 13 (01) :97-102
[4]  
Davidson E.H, 2001, Genomic regulatory systems
[5]   SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data [J].
Diehn, M ;
Sherlock, G ;
Binkley, G ;
Jin, H ;
Matese, JC ;
Hernandez-Boussard, T ;
Rees, CA ;
Cherry, JM ;
Botstein, D ;
Brown, PO ;
Alizadeh, AA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :219-223
[6]   Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences [J].
Frith, MC ;
Spouge, JL ;
Hansen, U ;
Weng, ZP .
NUCLEIC ACIDS RESEARCH, 2002, 30 (14) :3214-3224
[7]  
Halfon MS, 2002, GENOME RES, V12, P1019, DOI 10.1101/gr.228902
[8]   A FORMAL BASIS FOR HEURISTIC DETERMINATION OF MINIMUM COST PATHS [J].
HART, PE ;
NILSSON, NJ ;
RAPHAEL, B .
IEEE TRANSACTIONS ON SYSTEMS SCIENCE AND CYBERNETICS, 1968, SSC4 (02) :100-+
[9]   The Ensembl genome database project [J].
Hubbard, T ;
Barker, D ;
Birney, E ;
Cameron, G ;
Chen, Y ;
Clark, L ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Down, T ;
Durbin, R ;
Eyras, E ;
Gilbert, J ;
Hammond, M ;
Huminiecki, L ;
Kasprzyk, A ;
Lehvaslaiho, H ;
Lijnzaad, P ;
Melsopp, C ;
Mongin, E ;
Pettett, R ;
Pocock, M ;
Potter, S ;
Rust, A ;
Schmidt, E ;
Searle, S ;
Slater, G ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Stupka, E ;
Ureta-Vidal, A ;
Vastrik, I ;
Clamp, M .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :38-41
[10]   Detection and visualization of compositionally similar cis-regulatory element clusters in orthologous and coordinately controlled genes [J].
Jegga, AG ;
Sherwood, SP ;
Carman, JW ;
Pinski, AT ;
Phillips, JL ;
Pestian, JP ;
Aronow, BJ .
GENOME RESEARCH, 2002, 12 (09) :1408-1417