Classification of geographic origin of rice by data mining and inductively coupled plasma mass spectrometry

被引:88
作者
Maione, Camila [1 ]
Batista, Bruno Lemos [2 ]
Campiglia, Andres Dobal [3 ]
Barbosa, Fernando, Jr. [4 ]
Barbosa, Rommel Melgaco [1 ]
机构
[1] Univ Fed Goias, Inst Informat, Goiania, Go, Brazil
[2] Fed Univ ABC, Ctr Nat Sci & Humanities, Sao Paulo, Brazil
[3] Univ Cent Florida, Dept Chem, Orlando, FL 32816 USA
[4] Univ Sao Paulo, Dept Clin Anal Toxicol & Food Sci, Sch Pharmaceut Sci Ribeirao Preto, Sao Paulo, Brazil
关键词
Data mining; ICP-MS; Rice; Classification; Support vector machines; F-score; ARTIFICIAL NEURAL-NETWORKS; TRACE-ELEMENTS; DISCRIMINATION; AUTHENTICITY; ALGORITHMS; SAMPLES;
D O I
10.1016/j.compag.2015.11.009
中图分类号
S [农业科学];
学科分类号
082806 [农业信息与电气工程];
摘要
Rice is one of the most consumed cereals in the world and the main food product in the diet of the Brazilian population. Brazil itself is among the ten largest producers of rice, and most of the harvest comes from the South and Midwest regions. This paper presents a data mining study of samples of rice obtained from producers in Goias (Midwest region) and Rio Grande do Sul (South region), and builds classification models capable of predicting the geographical origin of a rice sample based on its chemical components. We use three popular classification techniques, support vector machines, random forests and neural networks, along with the F-score formula which measures the relative importance of the input variables. We achieved very good performances for the SVM, RF and MLP models with 93.66%, 93.83% and 90% prediction accuracy, respectively, on the 10-fold cross validation. The F-score shows that Cd(cadmium), Rb(rubidium), Mg(magnesium) and K(potassium) are the four most relevant components for prediction. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:101 / 107
页数:7
相关论文
共 34 条
[1]
AL-Bakary A.M., 2010, IJACT INT J ADV COMP
[2]
A simple and practical control of the authenticity of organic sugarcane samples based on the use of machine-learning algorithms and trace elements determination by inductively coupled plasma mass spectrometry [J].
Barbosa, Rommel M. ;
Batista, Bruno L. ;
Bariao, Camila V. ;
Varrique, Renan M. ;
Coelho, Vinicius A. ;
Campiglia, Andres D. ;
Barbosa, Fernando, Jr. .
FOOD CHEMISTRY, 2015, 184 :154-159
[3]
The Use of Decision Trees and Naive Bayes Algorithms and Trace Element Patterns for Controlling the Authenticity of Free-Range-Pastured Hens' Eggs [J].
Barbosa, Rommel Melgaco ;
Nacano, Leticia Ramos ;
Freitas, Rodolfo ;
Batista, Bruno Lemos ;
Barbosa, Fernando, Jr. .
JOURNAL OF FOOD SCIENCE, 2014, 79 (09) :C1672-C1677
[4]
Survey of 13 trace elements of toxic and nutritional significance in rice from Brazil and exposure assessment [J].
Batista, B. L. ;
De Oliveira Souza, V. C. ;
Da Silva, F. G. ;
Barbosa, F., Jr. .
FOOD ADDITIVES & CONTAMINANTS PART B-SURVEILLANCE, 2010, 3 (04) :253-262
[5]
CADMIUM AVAILABILITY TO RICE IN SLUDGE-AMENDED SOIL UNDER FLOOD AND NON-FLOOD CULTURE [J].
BINGHAM, FT ;
PAGE, AL ;
MAHLER, RJ ;
GANJE, TJ .
SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 1976, 40 (05) :715-719
[6]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]
A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]
Discrimination of geographical origin of rice based on multi-element fingerprinting by high resolution inductively coupled plasma mass spectrometry [J].
Cheajesadagul, Pracha ;
Arnaudguilhem, Carine ;
Shiowatana, Juwadee ;
Siripinyanond, Atitaya ;
Szpunar, Joanna .
FOOD CHEMISTRY, 2013, 141 (04) :3504-3509
[9]
Chen YW, 2006, STUD FUZZ SOFT COMP, V207, P315
[10]
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411