Modeling wine preferences by data mining from physicochemical properties

被引:787
作者
Cortez, Paulo [1 ]
Cerdeira, Antonio [2 ]
Almeida, Fernando [2 ]
Matos, Telmo [2 ]
Reis, Jose [1 ,2 ]
机构
[1] Univ Minho, Dept Informat Syst, R&D Ctr Algoritmi, P-4800058 Guimaraes, Portugal
[2] CVRVV, P-4050501 Oporto, Portugal
关键词
Sensory preferences; Regression; Variable selection; Model selection; Support vector machines; Neural networks; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; CLASSIFICATION; DISCRIMINATION; PARAMETERS;
D O I
10.1016/j.dss.2009.05.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a data mining approach to predict human wine taste preferences that is based on easily available analytical tests at the certification step. A large dataset (when compared to other studies in this domain) is considered, with white and red vinho verde samples (from Portugal). Three regression techniques were applied, under a computationally efficient procedure that performs simultaneous variable and model selection. The support vector machine achieved promising results, Outperforming the multiple regression and neural network methods. Such model is useful to support the oenologist wine tasting evaluations and improve wine production. Furthermore, similar techniques can help in target marketing by modeling consumer tastes from niche markets. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:547 / 553
页数:7
相关论文
共 37 条
  • [11] Ebeler SE, 1999, FLAVOR CHEMISTRY, P409
  • [12] *FAO FAOSTAT, 2008, FOOD AGR ORG AGR TRA
  • [13] An optimization approach for scheduling wine grape harvest operations
    Ferrer, Juan-Carlos
    Mac Cawley, Alejandro
    Maturana, Sergio
    Toloza, Sergio
    Vera, Jorge
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2008, 112 (02) : 985 - 999
  • [14] Flexer A., 1996, P 13 EUROPEAN M CYBE, V2, P1005
  • [15] Classifier technology and the illusion of progress
    Hand, David J.
    [J]. STATISTICAL SCIENCE, 2006, 21 (01) : 1 - 14
  • [16] Hastie T., 2009, The elements of statistical learning: data mining, inference, and prediction, P9
  • [17] Modeling consumer situational choice of long distance communication with neural networks
    Hu, Michael Y.
    Shanker, Murali
    Zhang, G. Peter
    Hung, Ming S.
    [J]. DECISION SUPPORT SYSTEMS, 2008, 44 (04) : 899 - 908
  • [18] Credit rating analysis with support vector machines and neural networks: a market comparative study
    Huang, Z
    Chen, HC
    Hsu, CJ
    Chen, WH
    Wu, SS
    [J]. DECISION SUPPORT SYSTEMS, 2004, 37 (04) : 543 - 558
  • [19] Data strip mining for the virtual design of pharmaceuticals with neural networks
    Kewley, RH
    Embrechts, MJ
    Breneman, C
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03): : 668 - 679
  • [20] A comparative assessment of classification methods
    Kiang, MY
    [J]. DECISION SUPPORT SYSTEMS, 2003, 35 (04) : 441 - 454