Mining of Frequent Itemsets from Streams of Uncertain Data

被引:47
作者
Leung, Carson Kai-Sang [1 ]
Hao, Boyu [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
来源
ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3 | 2009年
关键词
D O I
10.1109/ICDE.2009.157
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Frequent itemset mining plays an essential role in the mining of various patterns and is in demand in many real-life applications. Hence, the mining of frequent itemsets has been the subject of numerous studies since its introduction. Generally, most of these studies find frequent itemsets from traditional transaction databases, in which the contents of each transaction-namely, items-are definitely known and precise. However, there are many real-life situations in which ones are uncertain about the contents of transactions. This calls for the mining of uncertain data. Moreover, due to advances in technology, a flood of precise or uncertain data can be produced in many situations. This calls for the mining of data streams. To deal with these situations, we propose two tree-based mining algorithms to efficiently find frequent itemsets from streams of uncertain data, where each item in the transactions in the streams is associated with an existential probability. Experimental results show the effectiveness of our algorithms in mining frequent itemsets from streams of uncertain data.
引用
收藏
页码:1663 / 1670
页数:8
相关论文
共 21 条
[1]  
Agrawal R., P ACM SIGMOD 1993, P207
[2]  
AGRAWAL R, P VLDB 1994, P487
[3]  
CHENG R, P IEEE ICDE 2008, P973
[4]  
Chui C.-K., P PAKDD 2007, P47
[5]  
CHUI CK, P PAKDD 2008, P64
[6]  
Cormode G., P VLDB 2008, P1530
[7]  
Dai X., P SSTD 2005, P400
[8]  
Gaber MM, 2005, SIGMOD REC, V34, P18, DOI 10.1145/1083784.1083789
[9]  
Giannella C, 2004, DATA MINING NEXT GEN
[10]  
Han J., P ACM SIGMOD 2000, P1