Scaling up inductive logic programming by learning from interpretations

被引:59
作者
Blockeel, H [1 ]
de Raedt, L [1 ]
Jacobs, N [1 ]
Demoen, B [1 ]
机构
[1] Katholieke Univ Leuven, Dept Comp Sci, B-3001 Heverlee, Belgium
关键词
inductive logic programming; knowledge discovery in databases; first order decision trees; learning from interpretations;
D O I
10.1023/A:1009867806624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When comparing inductive logic programming (ILP) and attribute-value learning techniques, there is a trade-off between expressive power and efficiency. Inductive logic programming techniques are typically more expressive but also less efficient. Therefore, the data sets handled by current inductive logic programming systems are small according to general standards within the data mining community. The main source of inefficiency lies in the assumption that several examples may be related to each other, so they cannot be handled independently. Within the learning from interpretations framework for inductive logic programming this assumption is unnecessary, which allows to scale up existing ILP algorithms. In this paper we explain this learning setting in the context of relational databases. We relate the setting to propositional data mining and to the classical ILP setting, and show that learning from interpretations corresponds to learning from multiple relations and thus extends the expressiveness of propositional learning, while maintaining its efficiency to a large extent (which is not the case in the classical ILP setting). As a case study, we present two alternative implementations of the ILP system TILDE (Top-down Induction of Logical DEcision trees): TILDEclassic, which loads all data in main memory, and TILDELDS, which loads the examples one by one. We experimentally compare the implementations, showing TILDELDS can handle large data sets (in the order of 100,000 examples or 100 MB) and indeed scales up linearly in the number of examples.
引用
收藏
页码:59 / 93
页数:35
相关论文
共 48 条
[1]  
Agrawal R., 1996, Advances in Knowledge Discovery and Data Mining, P307
[2]  
[Anonymous], P 15 INT C MACH LEAR
[3]  
[Anonymous], 1993, P 13 INT JOINT C ART
[4]   Top-down induction of first-order logical decision trees [J].
Blockeel, H ;
De Raedt, L .
ARTIFICIAL INTELLIGENCE, 1998, 101 (1-2) :285-297
[5]  
BLOCKEEL H, 1997, LECT NOTES ARTIF INT, V1297, P77
[6]  
Bongard M. M., 1970, PATTERN RECOGNITION
[7]   APPLICATIONS OF INDUCTIVE LOGIC PROGRAMMING [J].
BRATKO, I ;
MUGGLETON, S .
COMMUNICATIONS OF THE ACM, 1995, 38 (11) :65-70
[8]  
BRATKO I, 1990, PROLOG PROGRAMMING A
[9]  
Cohen W. W., 1995, Journal of Artificial Intelligence Research, V2, P541
[10]  
COHEN WW, NEW GEN COMPUTING, V13, P95