Modeling wine preferences by data mining from physicochemical properties

被引:787
作者
Cortez, Paulo [1 ]
Cerdeira, Antonio [2 ]
Almeida, Fernando [2 ]
Matos, Telmo [2 ]
Reis, Jose [1 ,2 ]
机构
[1] Univ Minho, Dept Informat Syst, R&D Ctr Algoritmi, P-4800058 Guimaraes, Portugal
[2] CVRVV, P-4050501 Oporto, Portugal
关键词
Sensory preferences; Regression; Variable selection; Model selection; Support vector machines; Neural networks; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; CLASSIFICATION; DISCRIMINATION; PARAMETERS;
D O I
10.1016/j.dss.2009.05.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a data mining approach to predict human wine taste preferences that is based on easily available analytical tests at the certification step. A large dataset (when compared to other studies in this domain) is considered, with white and red vinho verde samples (from Portugal). Three regression techniques were applied, under a computationally efficient procedure that performs simultaneous variable and model selection. The support vector machine achieved promising results, Outperforming the multiple regression and neural network methods. Such model is useful to support the oenologist wine tasting evaluations and improve wine production. Furthermore, similar techniques can help in target marketing by modeling consumer tastes from niche markets. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:547 / 553
页数:7
相关论文
共 37 条
  • [1] [Anonymous], Journal of machine learning research
  • [2] Asuncion Arthur, 2007, Uci machine learning repository
  • [3] Bi J., 2003, P 20 INT C MACH LEAR, P43
  • [4] Bishop Christopher M, 1995, Neural networks for pattern recognition
  • [5] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [6] Practical selection of SVM parameters and noise estimation for SVM regression
    Cherkassky, V
    Ma, YQ
    [J]. NEURAL NETWORKS, 2004, 17 (01) : 113 - 126
  • [7] CORTEZ P, INTRO ADV S IN PRESS
  • [8] Lamb meat quality assessment by support vector machines
    Cortez, Paulo
    Portelinha, Manuel
    Rodrigues, Sandra
    Cadavez, Vasco
    Teixeira, Alfredo
    [J]. NEURAL PROCESSING LETTERS, 2006, 24 (01) : 41 - 51
  • [9] *CVRVV, 2008, PORT WIN VINH VERD
  • [10] Approximate statistical tests for comparing supervised classification learning algorithms
    Dietterich, TG
    [J]. NEURAL COMPUTATION, 1998, 10 (07) : 1895 - 1923