A statistical information-based clustering approach in distance space

被引:10
作者
岳士弘
李平
郭继东
周水庚
机构
[1] Zhejiang University
[2] Hangzhou 310027
[3] Institute of Industrial Process Control
[4] Yining 835000
[5] Department of Mathematics
[6] Yili Teacher’s College
[7] China
关键词
DBSCAN; A statistical information-based clustering approach in distance space;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Clustering, as a powerful data mining technique for discovering interesting data distributions and patterns in the nderlying database, is used in many fields, such as statistical data analysis, pattern recognition, image processing, and other usiness applications. Density-based Spatial Clustering of Applications with Noise (DBSCAN) (Ester et al., 1996) is a good erformance clustering method for dealing with spatial data although it leaves many problems to be solved. For example, BSCAN requires a necessary user-specified threshold while its computation is extremely time-consuming by current method uch as OPTICS, etc. (Ankerst et al., 1999), and the performance of DBSCAN under different norms has yet to be examined. In his paper, we first developed a method based on statistical information of distance space in database to determine the necessary hreshold. Then our examination of the DBSCAN performance under different norms showed that there was determinable relation etween them. Finally, we used two artificial databases to verify the effectiveness and efficiency of the proposed methods. ey words: DBSCAN algorithm, Statistical information, Threshold
引用
收藏
页码:72 / 79
页数:8
相关论文
共 1 条
  • [1] Using Greedy algorithm: DBSCAN revisited II[J] . Shi-hong Yue,Ping Li,Ji-dong Guo,Shui-geng Zhou.Journal of Zhejiang University Science . 2004 (11)