A Survey of Uncertain Data Algorithms and Applications

被引:258
作者
Aggarwal, Charu C. [1 ]
Yu, Philip S. [2 ]
机构
[1] IBM Corp, TJ Watson Res Ctr, Hawthorne, NY 10532 USA
[2] Univ Illinois, Dept Comp Sci, Chicago, IL 60607 USA
关键词
Mining methods and algorithms; database applications; database management; information technology and systems; IMPRECISE;
D O I
10.1109/TKDE.2008.190
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, a number of indirect data collection methodologies have led to the proliferation of uncertain data. Such databases are much more complex because of the additional challenges of representing the probabilistic information. In this paper, we provide a survey of uncertain data mining and management applications. We will explore the various models utilized for uncertain data representation. In the field of uncertain data management, we will examine traditional database management methods such as join processing, query processing, selectivity estimation, OLAP queries, and indexing. In the field of uncertain data mining, we will examine traditional mining problems such as frequent pattern mining, outlier detection, classification, and clustering. We discuss different methodologies to process and mine uncertain data in a variety of forms.
引用
收藏
页码:609 / 623
页数:15
相关论文
共 79 条
[31]  
CHENG R, 2006, P 15 ACM INT C INF K
[32]  
CHUI CK, 2008, P 12 PAC AS C KNOWL
[33]  
CHUI CK, 2007, P 11 PAC AS C KNOWL
[34]  
DALVI N, 2005, P 31 INT C VER LARG
[35]  
DASSARMA A, 2006, P 22 IEEE INT C DAT
[36]  
Deshpande A, 2004, P VER LARG DAT BAS
[37]   PSQL: A query language for probabilistic relational data [J].
Dey, D ;
Sarkar, S .
DATA & KNOWLEDGE ENGINEERING, 1998, 28 (01) :107-120
[38]  
DONG X, 2007, P 33 INT C VER LARG
[39]  
Ester M., 1996, KDD, V96, P226, DOI DOI 10.5555/3001460.3001507
[40]  
Florescu D, 1997, PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, P216