A fuzzy approach to classification of text documents

被引:12
作者
Liu, WY [1 ]
Song, N
机构
[1] Yunnan Univ, Dept Comp Sci, Kunming 650091, Peoples R China
[2] Chinese Acad Sci, Comp Technol Inst, Key Lab Intelligent Informat Proc, Beijing 100080, Peoples R China
[3] Kunming Univ Sci & Technol, Dept Met, Kunming 650093, Peoples R China
基金
中国国家自然科学基金;
关键词
text document classification; fuzzy approach; semantic association;
D O I
10.1007/BF02947124
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper discusses the classification problems of text documents. Based on the concept of the proximity degree, the set of words is partitioned into some equivalence classes. Particularly, the concepts of the semantic field and association degree are given in this paper. Based on the above concepts, this paper presents a fuzzy classification approach for document categorization. Furthermore, applying the concept of the entropy of information, the approaches to select key words from the set of words covering the classification of documents and to construct the hierarchical structure of key words are obtained.
引用
收藏
页码:640 / 647
页数:8
相关论文
共 27 条
  • [1] [Anonymous], CSTR49595 PRINC U
  • [2] [Anonymous], 1995, ICML
  • [3] Cheng HF, 1997, J FOOD DRUG ANAL, V5, P41
  • [4] CHIARAMELLA Y, 8134 ESPRIT BRA U GL
  • [5] COHEN WW, 1995, P 5 INT WORKSH IND L, P3
  • [6] FALOUTSOS C, 1995, CSTR3541 U MAR
  • [7] FUHI N, 1991, ACM T INFORM SYST, V9, P223
  • [8] JARJAN RE, 1984, J ACM, V31, P245
  • [9] KLIR G, 2000, FUZZY SETS OVERVIEW
  • [10] KOLDA TG, 1991, ACM T INFORMATION SY, V9, P223