Discovering regulatory elements in non-coding sequences by analysis of spaced dyads

被引:200
作者
van Heiden, Jacques [1 ]
Rios, Alma. F. [2 ]
Collado-Vides, Julio [2 ]
机构
[1] Univ Libre Bruxelles, Unite Conformat Macromol Biol, B-1050 Brussels, Belgium
[2] Univ Nacl Autonoma Mexico, Ctr Invest Fijac Nitrogeno, Cuernavaca 62100, Morelos, Mexico
关键词
D O I
10.1093/nar/28.8.1808
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The application of microarray and related technologies is currently generating a systematic catalog of the transcriptional response of any single gene to a multiplicity of experimental conditions. Clustering genes according to the similarity of their transcriptional response provides a direct hint to the regulons of the different transcription factors, many of which have still not been characterized. We have developed a new method for deciphering the mechanism underlying the common transcriptional response of a set of genes, i.e. discovering cis-acting regulatory elements from a set of unaligned upstream sequences. This method, called dyad analysis, is based on the observation that many regulatory sites consist of a pair of highly conserved trinucleotides, spaced by a non-conserved region of fixed width. The approach is to count the number of occurrences of each possible spaced pair of trinucleotides, and to assess its statistical significance. The method is highly efficient in the detection of sites bound by C-6 Zn-2 binuclear cluster proteins, as well as other transcription factors. In addition, we show that the dyad and single-word analyses are efficient for the detection of regulatory patterns in gene clusters from DNA chip experiments. In combination, these programs should provide a fast and efficient way to discover new regulatory sites for as yet unknown transcription factors.
引用
收藏
页码:1808 / 1818
页数:11
相关论文
共 36 条
[1]   YEAST MULTIDRUG-RESISTANCE - THE PDR NETWORK [J].
BALZI, E ;
GOFFEAU, A .
JOURNAL OF BIOENERGETICS AND BIOMEMBRANES, 1995, 27 (01) :71-76
[2]   A nonameric core sequence is required upstream of the LYS genes of Saccharomyces cerevisiae for Lys14p-mediated activation and apparent repression by lysine [J].
Becker, B ;
Feller, A ;
El Alami, M ;
Dubois, E ;
Piérard, A .
MOLECULAR MICROBIOLOGY, 1998, 29 (01) :151-163
[3]   Exploring the new world of the genome with DNA microarrays [J].
Brown, PO ;
Botstein, D .
NATURE GENETICS, 1999, 21 (Suppl 1) :33-37
[4]   EXPECTATION MAXIMIZATION ALGORITHM FOR IDENTIFYING PROTEIN-BINDING SITES WITH VARIABLE LENGTHS FROM UNALIGNED DNA FRAGMENTS [J].
CARDON, LR ;
STORMO, GD .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 223 (01) :159-170
[5]   The transcriptional program of sporulation in budding yeast [J].
Chu, S ;
DeRisi, J ;
Eisen, M ;
Mulholland, J ;
Botstein, D ;
Brown, PO ;
Herskowitz, I .
SCIENCE, 1998, 282 (5389) :699-705
[6]   Exploring the metabolic and genetic control of gene expression on a genomic scale [J].
DeRisi, JL ;
Iyer, VR ;
Brown, PO .
SCIENCE, 1997, 278 (5338) :680-686
[7]   Genomic-scale analysis goes upstream? [J].
Goffeau, A .
NATURE BIOTECHNOLOGY, 1998, 16 (10) :907-908
[8]  
Grundy WN, 1997, COMPUT APPL BIOSCI, V13, P397
[9]   Identifying DNA and protein patterns with statistically significant alignments of multiple sequences [J].
Hertz, GZ ;
Stormo, GD .
BIOINFORMATICS, 1999, 15 (7-8) :563-577
[10]  
HERTZ GZ, 1990, COMPUT APPL BIOSCI, V6, P81