Unsupervised pattern recognition: An introduction to the whys and wherefores of clustering microarray data

被引:74
作者
Boutros, PC
Okey, AB
机构
[1] Univ Toronto, Dept Pharmacol, Toronto, ON M5S 1A8, Canada
[2] Univ Toronto, Dept Phys Med, Toronto, ON M5S 1A8, Canada
关键词
pattern recognition; clustering; microarray; systems biology; data integration; co-regulation;
D O I
10.1093/bib/6.4.331
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Clustering has become an integral part of microarray data analysis and interpretation. The algorithmic basis of clustering - the application of unsupervised machine-learning techniques to identify the patterns inherent in a data set - is well established. This review discusses the biological motivations for and applications of these techniques to integrating gene expression data with other biological information, such as functional annotation, promoter data and proteomic data.
引用
收藏
页码:331 / 343
页数:13
相关论文
共 72 条
[1]   Analysis of the cytosolic proteome in a cell culture model of familial amyotrophic lateral sclerosis reveals alterations to the proteasome, antioxidant defenses, and nitric oxide synthetic pathways [J].
Allen, S ;
Heath, PR ;
Kirby, J ;
Wharton, SB ;
Cookson, MR ;
Menzies, FM ;
Banks, RE ;
Shaw, PJ .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2003, 278 (08) :6371-6383
[2]   Quantifying the relationship between co-expression, co-regulation and gene function [J].
Allocco, DJ ;
Kohane, IS ;
Butte, AJ .
BMC BIOINFORMATICS, 2004, 5 (1)
[3]   Whole-genome expression analysis: challenges beyond clustering [J].
Altman, RB ;
Raychaudhuri, S .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2001, 11 (03) :340-347
[4]  
Azuaje Francisco, 2003, Briefings in Bioinformatics, V4, P31, DOI 10.1093/bib/4.1.31
[5]   GOstat: find statistically overrepresented Gene Ontologies within a group of genes [J].
Beissbarth, T ;
Speed, TP .
BIOINFORMATICS, 2004, 20 (09) :1464-1465
[6]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[7]   Dioxin-responsive AHRE-II gene battery: identification by phylogenetic footprinting [J].
Boutros, PC ;
Moffat, ID ;
Franc, MA ;
Tijet, N ;
Tuomisto, J ;
Pohjanvirta, R ;
Okey, AB .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 321 (03) :707-715
[8]   NetAffx gene ontology mining tool: A visual approach for microarray data analysis [J].
Cheng, J ;
Sun, S ;
Tracy, A ;
Hubbell, E ;
Morris, J ;
Valmeekam, V ;
Kimbrough, A ;
Cline, MS ;
Liu, GY ;
Shigeta, R ;
Kulp, D ;
Siani-Rose, MA .
BIOINFORMATICS, 2004, 20 (09) :1462-1463
[9]  
Daoud SS, 2003, CANCER RES, V63, P2782
[10]   Comparisons and validation of statistical clustering techniques for microarray gene expression data [J].
Datta, S ;
Datta, S .
BIOINFORMATICS, 2003, 19 (04) :459-466