Functional bioinformatics of microarray data: From expression to regulation

被引:27
作者
Moreau, Y [1 ]
De Smet, F [1 ]
Thijs, G [1 ]
Marchal, K [1 ]
De Moor, B [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn, Louvain, Belgium
关键词
adaptive quality-based clustering; clustering; Gibbs sampling; microarray; motif finding; regulation;
D O I
10.1109/JPROC.2002.804681
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Using microarrays is a powerful technique to monitor the expression of thousands of genes in a single experiment. From series of such experiments, it is possible to identify the mechanisms that govern the activation of genes in an organism. Short deoxyribonucleic acid patterns (called binding sites) near the genes serve as switches that control gene expression. As a result similar patterns of expression can correspond to similar binding site patterns. Here we integrate clustering of coexpressed genes with the discovery of binding motifs. We overview several important clustering techniques and present a clustering algorithm (called adaptive quality-based clustering), which we have developed to address several shortcomings of existing methods. We overview the different techniques for motif finding, in particular the technique of Gibbs sampling, and we present several extensions of this technique in our Motif Sampler Finally, we present an integrated web tool called INCLUSive (available online at http://www.esat.kuleuven.ac.belsimilar todna/BioI/Software.html) that allows the easy analysis of microarray data for motif finding.
引用
收藏
页码:1722 / 1743
页数:22
相关论文
共 61 条
[41]  
Rousseeuw P.J., 1990, Finding groups in data: An introduction to cluster analysis, V1
[42]   ESTIMATING DIMENSION OF A MODEL [J].
SCHWARZ, G .
ANNALS OF STATISTICS, 1978, 6 (02) :461-464
[43]   Analysis of large-scale gene expression data [J].
Sherlock, G .
CURRENT OPINION IN IMMUNOLOGY, 2000, 12 (02) :201-205
[44]   Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation [J].
Tamayo, P ;
Slonim, D ;
Mesirov, J ;
Zhu, Q ;
Kitareewan, S ;
Dmitrovsky, E ;
Lander, ES ;
Golub, TR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (06) :2907-2912
[45]   THE CALCULATION OF POSTERIOR DISTRIBUTIONS BY DATA AUGMENTATION [J].
TANNER, MA ;
WING, HW .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1987, 82 (398) :528-540
[46]   Systematic determination of genetic network architecture [J].
Tavazoie, S ;
Hughes, JD ;
Campbell, MJ ;
Cho, RJ ;
Church, GM .
NATURE GENETICS, 1999, 22 (03) :281-285
[47]   INCLUSive: INtegrated clustering, upstream of sequence retrieval and motif sampling [J].
Thijs, G ;
Moreau, Y ;
De Smet, F ;
Mathys, J ;
Lescot, M ;
Rombauts, S ;
Rouze, P ;
De Moor, B ;
Marchal, K .
BIOINFORMATICS, 2002, 18 (02) :331-332
[48]   A Gibbs sampling method to detect overrepresented motifs in the upstream regions of coexpressed genes [J].
Thijs, G ;
Marchal, K ;
Lescot, M ;
Rombauts, S ;
De Moor, B ;
Rouzé, P ;
Moreau, Y .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) :447-464
[49]   A higher-order background model improves the detection of promoter regulatory elements by Gibbs sampling [J].
Thijs, G ;
Lescot, M ;
Marchal, K ;
Rombauts, S ;
De Moor, B ;
Rouzé, P ;
Moreau, Y .
BIOINFORMATICS, 2001, 17 (12) :1113-1122
[50]  
Tompa M, 1999, Proc Int Conf Intell Syst Mol Biol, P262