Data mining and machine oriented modeling: A granular computing approach

被引:120
作者
Lin, TY [1 ]
机构
[1] San Jose State Univ, Dept Math & Comp Sci, San Jose, CA 95192 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley Initiat Soft Comp, Berkeley, CA 94720 USA
关键词
binary relation; data mining; granulation; machine-oriented modeling; nighborhood system; partition;
D O I
10.1023/A:1008384328214
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From the processing point of view, data mining is machine derivation of interesting properties (to human) from the stored data. Hence, the notion of machine oriented data modeling is explored: An attribute value, in a relational model, is a meaningful label (a property) of a set of entities (granule). A model using these granules themselves as attribute values (their bit patterns or lists of members) is called a machine oriented data model. The model provides a good database compaction and data mining environment. For moderate size databases, finding association rules, decision rules, and etc., can be reduced to easy computation of set theoretical operations of granules. In the second part, these notions are extended to real world objects, where the universe is granulated (clustered) into granules by binary relations. Data modeling and mining with such additional semantics are formulated and investigated. In such models, data mining is essentially a machine "calculus" of granules-granular computing.
引用
收藏
页码:113 / 124
页数:12
相关论文
共 24 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
AGRAWAL R, 1994, P 20 VLDB C SAN CHIL
[3]  
[Anonymous], P 4 INT S METH INT S
[4]  
BAIRAMIAN S, 1989, THESIS CALIFORNIA ST
[5]  
Cai Y., 1991, Knowledge discovery in databases, P213
[6]  
CHU W, 1992, J INTELL INF SYST, V1, P355
[7]  
DATE CJ, 1996, INTRO DATABASE SYSTE
[8]  
Lin T.Y., 1998, Rough Sets, V1, P121
[9]  
Lin T.Y., 1988, P 1988 ACM 16 ANN CO, P725, DOI DOI 10.1145/322609.323183
[10]  
Lin T.Y., 1997, ADV MACHINE INTELLIG, P132