The maximal data piling direction for discrimination

被引:50
作者
Ahn, Jeongyoun [1 ]
Marron, J. S. [2 ]
机构
[1] Univ Georgia, Dept Stat, Athens, GA 30602 USA
[2] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
Classification; Fisher's linear discrimination; High dimension; low sample size; Maximal data piling; Support vector machine; GEOMETRIC REPRESENTATION; HIGH-DIMENSION;
D O I
10.1093/biomet/asp084
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We study a discriminant direction vector that generally exists only in high-dimension, low sample size settings. Projections of data onto this direction vector take on only two distinct values, one for each class. There exist infinitely many such directions in the subspace generated by the data; but the maximal data piling vector has the longest distance between the projections. This paper investigates mathematical properties and classification performance of this discrimination method.
引用
收藏
页码:254 / 259
页数:6
相关论文
共 11 条
[11]   Hierarchical clustering algorithms for document datasets [J].
Zhao, Y ;
Karypis, G .
DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 10 (02) :141-168