Using RSAT oligo-analysis and dyad-analysis tools to discover regulatory signals in nucleic sequences

被引:40
作者
Defrance, Matthieu [1 ]
Janky, Rekin's [1 ]
Sand, Olivier [1 ]
van Helden, Jacques [1 ]
机构
[1] Univ Libre Bruxelles, Lab Bioinformat Genomes & Reseaux BiGRe, B-1050 Brussels, Belgium
关键词
D O I
10.1038/nprot.2008.98
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This protocol explains how to discover functional signals in genomic sequences by detecting over-or under-represented oligonucleotides (words) or spaced pairs thereof (dyads) with the Regulatory Sequence Analysis Tools (http://rsat.ulb.ac.be/rsat/). Two typical applications are presented: (i) predicting transcription factor-binding motifs in promoters of coregulated genes and (ii) discovering phylogenetic footprints in promoters of orthologous genes. The steps of this protocol include purging genomic sequences to discard redundant fragments, discovering over-represented patterns and assembling them to obtain degenerate motifs, scanning sequences and drawing feature maps. The main strength of the method is its statistical ground: the binomial significance provides an efficient control on the rate of false positives. In contrast with optimization-based pattern discovery algorithms, the method supports the detection of under-as well as over-represented motifs. Computation times vary from seconds (gene clusters) to minutes (whole genomes). The execution of the whole protocol should take similar to 1 h.
引用
收藏
页码:1589 / 1603
页数:15
相关论文
共 65 条
  • [1] Fine-Tuning Enhancer Models to Predict Transcriptional Targets across Multiple Genomes
    Aerts, Stein
    van Helden, Jacques
    Sand, Olivier
    Hassan, Bassem A.
    [J]. PLOS ONE, 2007, 2 (11):
  • [2] Bailey T L, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P21
  • [3] Bailey TL., 1994, P 2 INT C INT SYST M, V2, P28
  • [4] PHOSPHORYLATION OF BACILLUS-SUBTILIS TRANSCRIPTION FACTOR SPOOA STIMULATES TRANSCRIPTION FROM THE SPOIIG PROMOTER BY ENHANCING BINDING TO WEAK OA BOXES
    BALDUS, JM
    GREEN, BD
    YOUNGMAN, P
    MORAN, CP
    [J]. JOURNAL OF BACTERIOLOGY, 1994, 176 (02) : 296 - 306
  • [5] Algorithms for phylogenetic footprinting
    Blanchette, M
    Schwikowski, B
    Tompa, M
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) : 211 - 223
  • [6] Blanchette M, 2000, Proc Int Conf Intell Syst Mol Biol, V8, P37
  • [7] Gene expression data analysis
    Brazma, A
    Vilo, J
    [J]. FEBS LETTERS, 2000, 480 (01) : 17 - 24
  • [8] Approaches to the automatic discovery of patterns in biosequences
    Brazma, A
    Jonassen, I
    Eidhammer, I
    Gilbert, D
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (02) : 279 - 305
  • [9] Predicting gene regulatory elements in silico on a genomic scale
    Brazma, A
    Jonassen, I
    Vilo, J
    Ukkonen, E
    [J]. GENOME RESEARCH, 1998, 8 (11) : 1202 - 1215
  • [10] Brazma A, 1997, ISMB-97 - FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY, PROCEEDINGS, P65