ICASSO:: Software for investigating the reliability of ICA estimates by clustering and visualization

被引:207
作者
Himberg, J [1 ]
Hyvärinen, A [1 ]
机构
[1] Aalto Univ, Neural Networks Res Ctr, Helsinki 02015, Finland
来源
2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03 | 2003年
关键词
D O I
10.1109/NNSP.2003.1318025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major problem in application of independent component analysis (ICA) is that the reliability of the estimated independent components is not known. Firstly, the finite sample size induces statistical errors in the estimation. Secondly, as real data never exactly follows the ICA model, the contrast function used in the estimation may have many local minima which are all equally good, or the practical algorithm may not always perform properly, for example getting stuck in local minima with strongly suboptimal values of the contrast function. We present an explorative visualization method for investigating the relations between estimates from FastICA. The algorithmic and statistical reliability is investigated by running the algorithm many times with different initial values or with differently bootstrapped data sets, respectively. Resulting estimates are compared by visualizing their clustering according to a suitable similarity measure. Reliable estimates correspond to tight clusters, and unreliable ones to points which do not belong to any such cluster. We have developed a software package called Icasso to implement these operations. We also present results of this method when applying Icasso on biomedical data.
引用
收藏
页码:259 / 268
页数:10
相关论文
共 22 条
[1]  
[Anonymous], 1952, Psychometrika
[2]   Nonparametric genetic clustering: Comparison of validity indices [J].
Bandyopadhyay, S ;
Maulik, U .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2001, 31 (01) :120-125
[3]   Some new indexes of cluster validity [J].
Bezdek, JC ;
Pal, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03) :301-315
[4]   Curvilinear component analysis: A self-organizing neural network for nonlinear mapping of data sets [J].
Demartines, P ;
Herault, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1997, 8 (01) :148-154
[5]  
Everitt B., 1993, CLUSTER ANAL
[6]   A REVIEW OF HIERARCHICAL-CLASSIFICATION [J].
GORDON, AD .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1987, 150 :119-137
[7]  
Hyvärinen A, 2001, INDEPENDENT COMPONENT ANALYSIS: PRINCIPLES AND PRACTICE, P71
[8]   Topographic independent component analysis [J].
Hyvärinen, A ;
Hoyer, PO ;
Inki, M .
NEURAL COMPUTATION, 2001, 13 (07) :1527-1558
[9]   Resampling method for unsupervised estimation of cluster validity [J].
Levine, E ;
Domany, E .
NEURAL COMPUTATION, 2001, 13 (11) :2573-2593
[10]   Linear modes of gene expression determined by independent component analysis [J].
Liebermeister, W .
BIOINFORMATICS, 2002, 18 (01) :51-60