文本聚类研究综述

被引：15

作者：

曹晓

机构：

[1] 福州大学经济与管理学院

来源：

情报探索 | 2016年 / 01期

关键词：

文本聚类; 本体; 评价指标;

D O I：

暂无

中图分类号：

TP391.1 [文字信息处理];

学科分类号：

081203 ; 0835 ;

摘要：

[目的 /意义]文本聚类技术是提高搜索引擎性能的有效方法,是对文本信息进行组织的有效手段。[方法 /过程]介绍了文本聚类的研究背景和研究内容,总结了引入本体技术的文本聚类研究,分析了文本聚类结果评价的几种指标,并对文本聚类的方法和结果评价进行了综述。[结果 /结论]文本聚类的应用领域将不断扩大,文本聚类技术将成为人工智能的一个重要研究课题。

引用

页码：131 / 134

页数：4

共 29 条

[21] Exploiting noun phrases and semantic relationships for text document clustering [J].

Zheng, Hai-Tao ;

Kang, Bo-Yeong ;

Kim, Hong-Gee .

INFORMATION SCIENCES, 2009, 179 (13) :2249-2262

[22]

Mining fuzzy frequent itemsets for hierarchical document clustering[J] . Chun-Ling Chen,Frank S.C. Tseng,Tyne Liang.Information Processing and Management . 2009 (2)

[23] Text document clustering based on neighbors [J].

Luo, Congnan ;

Li, Yanjun ;

Chung, Soon M. .

DATA & KNOWLEDGE ENGINEERING, 2009, 68 (11) :1271-1288

[24]

Dynamic hierarchical algorithms for document clustering[J] . Reynaldo Gil-García,Aurora Pons-Porrata.Pattern Recognition Letters . 2009 (6)

[25]

A document clustering algorithm for discovering and describing topics[J] . Henry Anaya-Sánchez,Aurora Pons-Porrata,Rafael Berlanga-Llavori.Pattern Recognition Letters . 2009 (6)

[26]

Genetic algorithm for text clustering using ontology and evaluating the validity of various semantic similarity measures[J] . Wei Song,Cheng Hua Li,Soon Cheol Park.Expert Systems With Applications . 2008 (5)

[27]

Discovering significant OPSM subspace clusters in massive gene expression data .2 Gao BJ,Griffith OL,Ester M,Jones S J. Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . 2006

[28]

基于领域本体的SOM文本逐层聚类方法[J]. 朱恒民,马静,黄卫东.情报学报. 2008 (06)

[29]

Text document clustering based on frequent word meaning sequences[J] . Yanjun Li,Soon M. Chung,John D. Holt.Data & Knowledge Engineering . 2007 (1)

← 1 2 3 →