The role of occam's razor in knowledge discovery

被引:236
作者
Domingos, P [1 ]
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98195 USA
关键词
model selection; overfitting; multiple comparisons; comprehensible models; domain knowledge;
D O I
10.1023/A:1009868929893
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many KDD systems incorporate an implicit or explicit preference for simpler models, but this use of "Occam's razor" has been strongly criticized by several authors (e.g., Schaffer, 1993; Webb, 1996). This controversy arises partly because Occam's razor has been interpreted in two quite different ways. The first interpretation (simplicity is a goal in itself) is essentially correct, but is at heart a preference for more comprehensible models. The second interpretation (simplicity leads to greater accuracy) is much more problematic. A critical review of the theoretical arguments for and against it shows that it is unfounded as a universal principle, and demonstrably false. A review of empirical evidence shows that it also fails as a practical heuristic. This article argues that its continued use in KDD risks causing significant opportunities to be missed, and should therefore be restricted to the comparatively few applications where it is appropriate. The article proposes and reviews the use of domain constraints as an alternative for avoiding overfitting, and examines possible methods for handling the accuracy-comprehensibility trade-off.
引用
收藏
页码:409 / 425
页数:17
相关论文
共 102 条
  • [61] Knowledge-based learning in exploratory science: Learning rules to predict rodent carcinogenicity
    Lee, Y
    Buchanan, BG
    Aronis, JM
    [J]. MACHINE LEARNING, 1998, 30 (2-3) : 217 - 240
  • [62] A PRACTICAL BAYESIAN FRAMEWORK FOR BACKPROPAGATION NETWORKS
    MACKAY, DJC
    [J]. NEURAL COMPUTATION, 1992, 4 (03) : 448 - 472
  • [63] Maclin R, 1996, MACH LEARN, V22, P251, DOI 10.1007/BF00114730
  • [64] MACLIN R, 1997, P 14 NAT C ART INT P
  • [65] Meo R, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P122
  • [66] Impacts on agriculture following the 1991 eruption of Vulcan Hudson, Patagonia: lessons for recovery
    Wilson, Thomas
    Cole, Jim
    Cronin, Shane
    Stewart, Carol
    Johnston, David
    [J]. NATURAL HAZARDS, 2011, 57 (02) : 185 - 212
  • [67] Mingers J., 1989, Machine Learning, V4, P227, DOI 10.1023/A:1022604100933
  • [68] Mitchell, 1980, CBMTR117 RUTG U
  • [69] Murphy M A, 1994, J Clin Neurosci, V1, P33, DOI 10.1016/0967-5868(94)90066-3
  • [70] MURTHY S, 1995, P 14 INT JOINT C ART, P1025