Discovering simple rules in complex data: A meta-learning algorithm and some surprising musical discoveries

被引:47
作者
Widmer, G [1 ]
机构
[1] Univ Vienna, Dept Med Cybernet & Artificial Intelligence, Vienna, Austria
[2] Austrian Res Inst Artificial Intelligence, Vienna, Austria
基金
奥地利科学基金会;
关键词
machine learning; data mining; rule discovery; ensemble methods; meta-learning; partial models; expressive music performance;
D O I
10.1016/S0004-3702(03)00016-X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a new rule discovery algorithm named PLCG that can find simple, robust partial rule models (sets of classification rules) in complex data where it is difficult or impossible to find models that completely account for all the phenomena of interest. Technically speaking, PLCG is an ensemble learning method that learns multiple models via some standard rule learning algorithm, and then combines these into one final rule set via clustering, generalization, and heuristic rule selection. The algorithm was developed in the context of an interdisciplinary research project that aims at discovering fundamental principles of expressive music performance from large amounts of complex real-world data (specifically, measurements of actual performances by concert pianists). It will be shown that PLCG succeeds in finding some surprisingly simple and robust performance principles, some of which represent truly novel and musically meaningful discoveries. A set of more systematic experiments shows that PLCG usually discovers significantly simpler theories than more direct approaches to rule learning (including the state-of-the-art learning algorithm RIPPER), While striking a compromise between coverage and precision. The experiments also show how easy it is to use PLCG as a meta-learning strategy to explore different parts of the space of rule models. (C) 2003 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:129 / 148
页数:20
相关论文
共 32 条
[1]  
[Anonymous], 1998, INTELL DATA ANAL, DOI DOI 10.1016/S1088-467X(98)00023-7
[2]  
CAMBOUROPOULOS E, 2000, P AAAI 2000 WORKSH A, P19
[3]  
CAMBOUROPOULOS E, 2001, P 8 BRAZ S COMP MUS
[4]  
Cohen W. W., 1995, P 12 INT C MACH LEAR, P115, DOI DOI 10.1016/B978-1-55860-377-6.50023-2
[5]  
COHEN WW, 1993, IJCAI-93, VOLS 1 AND 2, P988
[6]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15
[7]   Automatic extraction of tempo and beat from expressive performances [J].
Dixon, S .
JOURNAL OF NEW MUSIC RESEARCH, 2001, 30 (01) :39-58
[8]  
Dixon S, 2000, FR ART INT, V54, P626
[9]  
Domingos P, 1996, MACH LEARN, V24, P141
[10]   Separate-and-conquer rule learning [J].
Fürnkranz, J .
ARTIFICIAL INTELLIGENCE REVIEW, 1999, 13 (01) :3-54