A framework for evaluating knowledge-based interestingness of association rules

被引:4
作者
Shekar B. [1 ]
Natarajan R. [1 ]
机构
[1] Quantit. Methods/Info. Systems Area, Indian Inst. of Management Bangalore
关键词
Association rules; Fuzzy taxonomy; Interestingness; Item-relatedness;
D O I
10.1023/B:FODM.0000022043.43885.55
中图分类号
学科分类号
摘要
In Knowledge Discovery in Databases (KDD)/Data Mining literature, "interestingness" measures are used to rank rules according to the "interest" a particular rule is expected to evoke. In this paper, we introduce an aspect of subjective interestingness called "item- relatedness". Relatedness is a consequence of relationships that exist between items in a domain. Association rules containing unrelated or weakly related items are interesting since the co-occurrence of such items is unexpected. 'Item-Relatedness' helps in ranking association rules on the basis of one kind of subjective unexpectedness. We identify three types of item-relatedness - captured in the structure of a "fuzzy taxonomy" (an extension of the classical concept hierarchy tree). An "item- relatedness" measure for describing relatedness between two items is developed by combining these three types. Efficacy of this measure is illustrated with the help of a sample taxonomy. We discuss three mechanisms for extending this measure from a two-item set to an association rule consisting of a set of more than two items. These mechanisms utilize the relatedness of item-pairs and other aspects of an association rule, namely its structure, distribution of items and item-pairs. We compare our approach with another method from recent literature.
引用
收藏
页码:157 / 185
页数:28
相关论文
共 25 条
[1]  
Adomavicius G., Tuzhilin A., Discovery of actionable patterns in databases: The action hierarchy approach, Proceedings of KDD -1997, (1997)
[2]  
Agirre E., Rigau G., Word sense disambiguation using conceptual density, Proceedings of COLING 1996, (1996)
[3]  
Basu S., Mooney R.J., Pasupuleti K.V., Ghosh J., Evaluating the novelty of text-mined rules using lexical knowledge, Proceedings of the Seventh International Conference on Knowledge Discovery and Data Mining (KDD 2001), (2001)
[4]  
Bayardo Jr. R.J., Agrawal R., Gunopulos D., Constraint-based rule mining in large, dense databases, Data Mining and Knowledge Discovery, 4, 2-3, pp. 217-240, (2000)
[5]  
Chen G., Wets G., Vanhoof K., Representation and discovery of fuzzy association rules (FARs), Institute of Applied Economic Research (ITEO), (2000)
[6]  
De Graaf J.M., Kosters W.A., Witteman J.J.W., Interesting fuzzy association rules in quantitative databases, Proceedings of P KDD 2001 (The 5th European Conference on Principles of Data Mining and Knowledge Discovery), pp. 140-151, (2001)
[7]  
De Graaf J.M., Kosters W.A., Witteman J.J.W., Interesting association rules in multiple taxonomies, Proceedings of BNAIC'00, Kaatsheuvel, pp. 93-100, (2000)
[8]  
Deo N., Graph Theory with Applications to Engineering and Computer Science, (1989)
[9]  
Dong G., Li J., Interestingness of discovered association rules in terms of neighborhood-based unexpectedness, Proceedings of the Second Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 72-86, (1998)
[10]  
Freitas A.A., On rule interestingness measures, Knowledge-Based Systems, 12, pp. 309-315, (1999)