Verifying the proximity and size hypothesis for self-organizing maps

被引:27
作者
Lin, CT [1 ]
Chen, HC [1 ]
Nunamaker, JF [1 ]
机构
[1] Univ Arizona, Artificial Intelligence Lab, Tucson, AZ 85721 USA
关键词
document clustering techniques; experimental research; group support systems; self-organizing maps; unsupervised learning algorithms;
D O I
10.1080/07421222.1999.11518256
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Kohonen Self-Organizing Mag (SOM) is an unsupervised learning technique for summarizing high-dimensional data so that similar inputs are, in general, mapped close to one another. When applied to textual data, SOM has been shown to be able to group together related concepts in a data collection and to present major topics within the collection with larger regions. This article presents research in which we sought to validate these properties of SOM, called the Proximity and Size Hypotheses, through a user evaluation study. Building upon our previous research in automatic concept generation and classification, we demonstrated that the Kohonen SOM was able to perform concept clustering effectively, based on its concept precision and recall7 scores as judged by human experts. We also demonstrated a positive relationship between the size of an SOM region and the number of documents contained in the region. We believe this research has established the Kohonen SOM algorithm as an intuitively appearing and promising neural-network-based textual classification technique for addressing part of the longstanding "information overload" problem.
引用
收藏
页码:57 / 70
页数:14
相关论文
共 14 条
  • [1] CHEN HC, 1995, J AM SOC INFORM SCI, V46, P194, DOI 10.1002/(SICI)1097-4571(199504)46:3<194::AID-ASI4>3.0.CO
  • [2] 2-S
  • [3] Internet categorization and search: A self-organizing approach
    Chen, HC
    Schuffels, C
    Orwig, R
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1996, 7 (01) : 88 - 102
  • [4] DOSZKOCS TE, 1990, ANNU REV INFORM SCI, V25, P209
  • [5] Kaski S., 1996, P WORLD C NEUR NETW
  • [6] Kohonen T., 1995, SELF ORG MAPS
  • [7] LIN X, 1992, P 15 ANN INT ACM SIG, P37
  • [8] MIIKKULAINEN R, 1993, SYMBOLIC NATURAL LAN
  • [9] Orwig RE, 1997, J AM SOC INFORM SCI, V48, P157, DOI 10.1002/(SICI)1097-4571(199702)48:2<157::AID-ASI6>3.0.CO
  • [10] 2-X