Analysis and visualization of gene expression data using Self-Organizing Maps

被引:106
作者
Nikkilä, J
Törönen, P
Kaski, S
Venna, J
Castrén, E
Wong, G
机构
[1] Aalto Univ, Neural Networks Res Ctr, Espoo 02015, Finland
[2] Univ Kuopio, AI Virtanen Inst, FIN-70211 Kuopio, Finland
关键词
clustering; exploratory data analysis; gene expression; information visualization; Self-Organizing Map;
D O I
10.1016/S0893-6080(02)00070-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cluster structure of gene expression data obtained from DNA microarrays is analyzed and visualized with the Self-Organizing Map (SOM) algorithm. The SOM forms a non-linear mapping of the data to a two-dimensional map grid that can be used as an exploratory data analysis tool for generating hypotheses on the relationships, and ultimately of the function of the genes. Similarity relationships within the data and cluster structures can be visualized and interpreted. The methods are demonstrated by computing a SOM of yeast genes. The relationships of known functional classes of genes are investigated by analyzing their distribution on the SOM, the cluster structure is visualized by the U-matrix method, and the clusters are characterized in terms of the properties of the expression profiles of the genes. Finally, it is shown that the SOM visualizes the similarity of genes in a more trustworthy way than two alternative methods, multidimensional scaling and hierarchical clustering. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:953 / 966
页数:14
相关论文
共 32 条
[1]  
[Anonymous], 1952, Psychometrika
[2]   MEAN SHIFT, MODE SEEKING, AND CLUSTERING [J].
CHENG, YZ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (08) :790-799
[3]   The transcriptional program of sporulation in budding yeast [J].
Chu, S ;
DeRisi, J ;
Eisen, M ;
Mulholland, J ;
Botstein, D ;
Brown, PO ;
Herskowitz, I .
SCIENCE, 1998, 282 (5389) :699-705
[4]   First InP/InGaAs PNPHBT grown by metal organic chemical vapor deposition [J].
Cui, DL ;
Hsu, S ;
Pavlidis, D .
2001 INTERNATIONAL CONFERENCE ON INDIUM PHOSPHIDE AND RELATED MATERIALS, CONFERENCE PROCEEDINGS, 2001, :224-227
[5]   Exploring the metabolic and genetic control of gene expression on a genomic scale [J].
DeRisi, JL ;
Iyer, VR ;
Brown, PO .
SCIENCE, 1997, 278 (5338) :680-686
[6]   Delineation of prognostic biomarkers in prostate cancer [J].
Dhanasekaran, SM ;
Barrette, TR ;
Ghosh, D ;
Shah, R ;
Varambally, S ;
Kurachi, K ;
Pienta, KJ ;
Rubin, MA ;
Chinnaiyan, AM .
NATURE, 2001, 412 (6849) :822-826
[7]   Cluster analysis and display of genome-wide expression patterns [J].
Eisen, MB ;
Spellman, PT ;
Brown, PO ;
Botstein, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868
[8]   Genomic expression programs in the response of yeast cells to environmental changes [J].
Gasch, AP ;
Spellman, PT ;
Kao, CM ;
Carmel-Harel, O ;
Eisen, MB ;
Storz, G ;
Botstein, D ;
Brown, PO .
MOLECULAR BIOLOGY OF THE CELL, 2000, 11 (12) :4241-4257
[9]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[10]  
Jain K, 1988, Algorithms for clustering data