Computational identification of transcriptional regulatory elements in DNA sequence

被引:87
作者
GuhaThakurta, Debraj [1 ]
机构
[1] Rosetta Inpharmat LLC, Res Genet Div, Seattle, WA 98109 USA
关键词
D O I
10.1093/nar/gkl372
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Identification and annotation of all the functional elements in the genome, including genes and the regulatory sequences, is a fundamental challenge in genomics and computational biology. Since regulatory elements are frequently short and variable, their identification and discovery using computational algorithms is difficult. However, significant advances have been made in the computational methods for modeling and detection of DNA regulatory elements. The availability of complete genome sequence from multiple organisms, as well as mRNA profiling and high-throughput experimental methods for mapping protein-binding sites in DNA, have contributed to the development of methods that utilize these auxiliary data to inform the detection of transcriptional regulatory elements. Progress is also being made in the identification of cis-regulatory modules and higher order structures of the regulatory sequences, which is essential to the understanding of transcription regulation in the metazoan genomes. This article reviews the computational approaches for modeling and identification of genomic regulatory elements, with an emphasis on the recent developments, and current challenges.
引用
收藏
页码:3585 / 3598
页数:14
相关论文
共 188 条
[61]   Cross-species sequence comparisons: A review of methods and available resources [J].
Frazer, KA ;
Elnitski, L ;
Church, DM ;
Dubchak, I ;
Hardison, RC .
GENOME RESEARCH, 2003, 13 (01) :1-12
[62]   Cluster-Buster: finding dense clusters of motifs in DNA sequences [J].
Frith, MC ;
Li, MC ;
Weng, ZP .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3666-3668
[63]   Statistical significance of clusters of motifs represented by position specific scoring matrices in nucleotide sequences [J].
Frith, MC ;
Spouge, JL ;
Hansen, U ;
Weng, ZP .
NUCLEIC ACIDS RESEARCH, 2002, 30 (14) :3214-3224
[64]   Detection of cis-element clusters in higher eukaryotic DNA [J].
Frith, MC ;
Hansen, U ;
Weng, ZP .
BIOINFORMATICS, 2001, 17 (10) :878-889
[65]   RIGOROUS PATTERN-RECOGNITION METHODS FOR DNA-SEQUENCES - ANALYSIS OF PROMOTER SEQUENCES FROM ESCHERICHIA-COLI [J].
GALAS, DJ ;
EGGERT, M ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1985, 186 (01) :117-128
[66]  
Gelfand M S, 2000, Brief Bioinform, V1, P357, DOI 10.1093/bib/1.4.357
[67]   Prediction of transcription regulatory sites in Archaea by a comparative genomic approach [J].
Gelfand, MS ;
Koonin, EV ;
Mironov, AA .
NUCLEIC ACIDS RESEARCH, 2000, 28 (03) :695-705
[68]   Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites [J].
Gershenzon, NI ;
Stormo, GD ;
Ioshikhes, IP .
NUCLEIC ACIDS RESEARCH, 2005, 33 (07) :2290-2301
[69]   From oligonucleotide shapes to genomic SELEX: Novel biological regulatory loops [J].
Gold, L ;
Brown, D ;
He, YY ;
Shtatland, T ;
Singer, BS ;
Wu, Y .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (01) :59-64
[70]   Novel transcription regulatory elements in Caenorhabditis elegans muscle genes [J].
GuhaThakurta, D ;
Schriefer, LA ;
Waterston, RH ;
Stormo, GD .
GENOME RESEARCH, 2004, 14 (12) :2457-2468