k-plane clustering

被引:317
作者
Bradley, PS [1 ]
Mangasarian, OL
机构
[1] Microsoft Corp, Res, Redmond, WA 98052 USA
[2] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
clustering; k-mean; linear regression;
D O I
10.1023/A:1008324625522
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
A finite new algorithm is proposed for clustering m given points in n-dimensional real space into k clusters by generating k planes that constitute a local solution to the nonconvex problem of minimizing the sum of squares of the 2-norm distances between each point and a nearest plane. The key to the algorithm lies in a formulation that generates a plane in n-dimensional space that minimizes the sum of the squares of the 2-norm distances to each of m(1) given points in the space. The plane is generated by an eigenvector corresponding to a smallest eigenvalue of an n x n simple matrix derived from the m(1) points. The algorithm was tested on the publicly available Wisconsin Breast Prognosis Cancer database to generate well separated patient survival curves. In contrast, the k-mean algorithm did not generate such well-separated survival curves.
引用
收藏
页码:23 / 32
页数:10
相关论文
共 19 条
  • [1] Anderberg M.R., 1973, Probability and Mathematical Statistics
  • [2] ANDREWS HC, 1972, INTRO MATH TECHNIQUE
  • [3] Using linear algebra for intelligent information retrieval
    Berry, MW
    Dumais, ST
    OBrien, GW
    [J]. SIAM REVIEW, 1995, 37 (04) : 573 - 595
  • [4] Bradley PS, 1997, ADV NEUR IN, V9, P368
  • [5] CAVALIER T, 1995, COMPUTERS OPERATIONS, V28, P781
  • [6] GAUSSIAN PARSIMONIOUS CLUSTERING MODELS
    CELEUX, G
    GOVAERT, G
    [J]. PATTERN RECOGNITION, 1995, 28 (05) : 781 - 793
  • [7] Fisher D. H., 1987, Machine Learning, V2, P139, DOI 10.1007/BF00114265
  • [8] Hassoun M. H., 1995, FUNDAMENTALS ARTIFIC
  • [9] Jain K, 1988, Algorithms for clustering data
  • [10] NONPARAMETRIC-ESTIMATION FROM INCOMPLETE OBSERVATIONS
    KAPLAN, EL
    MEIER, P
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1958, 53 (282) : 457 - 481