Data categorization using decision trellises

被引:7
作者
Frasconi, P
Gori, M
Soda, G
机构
[1] Univ Florence, Dipartimento Sistemi & Informat, I-50139 Florence, Italy
[2] Univ Siena, Dipartimento Ingn Informaz, I-53100 Siena, Italy
关键词
belief networks; classification; connectionist models; context specific independence; data mining; decision trees; machine learning;
D O I
10.1109/69.806931
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a probabilistic graphical model for supervised learning on databases with categorical attributes. The proposed belief network contains hidden variables that play a role similar to nodes in decision trees and each of their states either corresponds to a class label or to a single attribute test. As a major difference with respect to decision trees, the selection of the attribute to be tested is probabilistic. Thus, the model can be used to assess the probability that a tuple belongs to some class, given the predictive attributes. Unfolding the network along the hidden states dimension yields a trellis structure having a signal flow similar to second order connectionist networks. The network encodes context specific probabilistic independencies to reduce parametric complexity. We present a custom tailored inference algorithm and derive a teaming procedure based on the expectation-maximization algorithm. We propose decision trellises as an alternative to decision trees in the context of tuple categorization in databases, which is an important step for building data mining systems. Preliminary experiments on-standard machine learning databases are reported, comparing the classification accuracy of decision trellises and decision trees induced by C4.5. In particular, we show that the proposed model can offer significant advantages for sparse databases in which many predictive attributes are missing.
引用
收藏
页码:697 / 712
页数:16
相关论文
共 47 条
[1]  
[Anonymous], MACHINE LEARNING
[2]  
[Anonymous], 1996, 12 C UNCERTAINTY ART
[3]  
[Anonymous], 1993, P 13 INT JOINT C ART
[4]  
[Anonymous], DATA MINING SEARCH K
[5]  
BENGIO Y, 1996, IEEE T NEURAL NETWOR, V7, P231
[6]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[7]   A guide to the literature on learning probabilistic networks from data [J].
Buntine, W .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (02) :195-210
[8]   Operations for Learning with Graphical Models [J].
Buntine, Wray L. .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1994, 2 :159-225
[9]  
CHEESEMAN P, 1996, ADV KNOXLEDGE DISCOV, V1, P153
[10]   THE COMPUTATIONAL-COMPLEXITY OF PROBABILISTIC INFERENCE USING BAYESIAN BELIEF NETWORKS [J].
COOPER, GF .
ARTIFICIAL INTELLIGENCE, 1990, 42 (2-3) :393-405