A universal framework for regulatory element discovery across all Genomes and data types

被引:228
作者
Elemento, Olivier
Slonim, Noam
Tavazoie, Saeed [1 ]
机构
[1] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ 08544 USA
[2] Princeton Univ, Dept Mol Biol, Princeton, NJ 08544 USA
关键词
D O I
10.1016/j.molcel.2007.09.027
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Deciphering the noncoding regulatory genome has proved a formidable challenge. Despite the wealth of available gene expression data, there currently exists no broadly applicable method for characterizing the regulatory elements that shape the rich underlying dynamics. We present a general framework for detecting such regulatory DNA and RNA motifs that relies on directly assessing the mutual information between sequence and gene expression measurements. Our approach makes minimal assumptions about the background sequence model and the mechanisms by which elements affect gene expression. This provides a versatile motif discovery framework, across all data types and genomes, with exceptional sensitivity and near-zero false-positive rates. Applications from yeast to human uncover putative and established transcription -factor binding and miRNA target sites, revealing rich diversity in their spatial configurations, pervasive cooccurrences of DNA and RNA motifs, context dependent selection for motif avoidance, and the strong impact of post transcriptional processes on eukaryotic transcriptomes.
引用
收藏
页码:337 / 350
页数:14
相关论文
共 33 条
[1]  
[Anonymous], 2006, Elements of information theory
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   Predicting gene expression from sequence [J].
Beer, MA ;
Tavazoie, S .
CELL, 2004, 117 (02) :185-198
[4]   The cyclin B2 promoter depends on NF-Y, a trimer whose CCAAT-binding activity is cell-cycle regulated [J].
Bolognese, F ;
Wasner, M ;
Lange-zu Dohna, C ;
Gurtner, A ;
Ronchi, A ;
Muller, H ;
Manni, I ;
Mossner, J ;
Piaggio, G ;
Mantovani, R ;
Engeland, K .
ONCOGENE, 1999, 18 (10) :1845-1853
[5]   The transcriptome of the intraerythrocytic developmental cycle of Plasmodium falciparum [J].
Bozdech, Z ;
Llinás, M ;
Pulliam, BL ;
Wong, ED ;
Zhu, JC ;
DeRisi, JL .
PLOS BIOLOGY, 2003, 1 (01) :85-100
[6]   WEIGHT MATRIX DESCRIPTIONS OF 4 EUKARYOTIC RNA POLYMERASE-II PROMOTER ELEMENTS DERIVED FROM 502 UNRELATED PROMOTER SEQUENCES [J].
BUCHER, P .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 212 (04) :563-578
[7]   Regulatory element detection using correlation with expression [J].
Bussemaker, HJ ;
Li, H ;
Siggia, ED .
NATURE GENETICS, 2001, 27 (02) :167-171
[8]   Revealing posttranscriptional regulatory elements through network-level conservation [J].
Chan, g S. Chan ;
Elemento, Olivier ;
Tavazoie, Saeed .
PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (07) :564-578
[9]   COREGULATION OF PURINE AND HISTIDINE BIOSYNTHESIS BY THE TRANSCRIPTIONAL ACTIVATORS BAS1 AND BAS2 [J].
DAIGNANFORNIER, B ;
FINK, GR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (15) :6746-6750
[10]   Fast and systematic genome-wide discovery of conserved regulatory elements using a non-alignment based approach [J].
Elemento, O ;
Tavazoie, S .
GENOME BIOLOGY, 2005, 6 (02)