Comparison of two exploratory data analysis methods for classification of Phyllanthus chemical fingerprint: unsupervised vs. supervised pattern recognition technologies

被引:24
作者
Guo, Jianru [1 ]
Chen, QianQian [1 ]
Wang, Caiyun [1 ]
Qiu, Hongcong [2 ]
Liu, Buming [2 ]
Jiang, Zhi-Hong [1 ]
Zhang, Wei [1 ]
机构
[1] Macau Univ Sci & Technol, Macau Inst Appl Res Med & Hlth, State Key Lab Qual Res Chinese Med, Taipa, Macau, Peoples R China
[2] Guangxi Inst Tradit Med & Pharmaceut Sci, Guangxi Key Lab Tradit Chinese Med Qual Stand, Nanning 530022, Peoples R China
关键词
Phyllanthus; Unsupervised; Supervised; Pattern recognition; High-performance liquid chromatography time-of-flightmass spectrometry; PRINCIPAL COMPONENT ANALYSIS; PERFORMANCE LIQUID-CHROMATOGRAPHY; ARTIFICIAL NEURAL-NETWORKS; DIODE-ARRAY DETECTOR; HEPATITIS-B VIRUS; MASS-SPECTROMETRY; CLUSTER-ANALYSIS; QUALITY-CONTROL; NIRURI; AMARUS;
D O I
10.1007/s00216-014-8371-x
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
In this study, unsupervised and supervised classification methods were compared for comprehensive analysis of the fingerprints of 26 Phyllanthus samples from different geographical regions and species. A total of 63 compounds were identified and tentatively assigned structures for the establishment of fingerprints using high-performance liquid chromatography time-of-flight mass spectrometry (HPLC/TOFMS). Unsupervised and supervised pattern recognition technologies including principal component analysis (PCA), nearest neighbors algorithm(NN), partial least squares discriminant analysis (PLS-DA), and artificial neural network (ANN) were employed. Results showed that Phyllanthus could be correctly classified according to their geographical locations and species through ANN and PLS-DA. Important variables for clusters discrimination were also identified by PCA. Although unsupervised and supervised pattern recognitions have their own disadvantage and application scope, they are effective and reliable for studying fingerprints of traditional Chinese medicines (TCM). These two technologies are complementary and can be superimposed. Our study is the first holistic comparison of supervised and unsupervised pattern recognition technologies in the TCM chemical fingerprinting. They showed advantages in sample classification and data mining, respectively.
引用
收藏
页码:1389 / 1401
页数:13
相关论文
共 49 条
[1]
Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]
Similarity analyses of chromatographic fingerprints as tools for identification and quality control of green tea [J].
Alaerts, G. ;
Van Erps, J. ;
PietersA, S. ;
Dumarey, M. ;
van Nederkassel, A. M. ;
Goodarzi, M. ;
Smeyers-Verbeke, J. ;
Vander Heyden, Y. .
JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES, 2012, 910 :61-70
[3]
[Anonymous], 2014, Discovering Knowledge in Data: An Introduction to Data Mining, P149
[4]
APPLICATION OF ARTIFICIAL NEURAL NETWORKS TO CLINICAL MEDICINE [J].
BAXT, WG .
LANCET, 1995, 346 (8983) :1135-1138
[5]
A comparison of maximum covariance and k-means cluster analysis in classifying cases into known taxon groups [J].
Beauchaine, TP ;
Beauchaine, RJ .
PSYCHOLOGICAL METHODS, 2002, 7 (02) :245-261
[6]
A New Avenue for Classification and Prediction of Olive Cultivars Using Supervised and Unsupervised Algorithms [J].
Beiki, Amir H. ;
Saboor, Saba ;
Ebrahimi, Mansour .
PLOS ONE, 2012, 7 (09)
[7]
Supervised pattern recognition in food analysis [J].
Berrueta, Luis A. ;
Alonso-Salces, Rosa M. ;
Heberger, Karoly .
JOURNAL OF CHROMATOGRAPHY A, 2007, 1158 (1-2) :196-214
[8]
HEPATITIS-B VIRUS AND PRIMARY HEPATOCELLULAR-CARCINOMA - TREATMENT OF HBV CARRIERS WITH PHYLLANTHUS-AMARUS [J].
BLUMBERG, BS ;
MILLMAN, I ;
VENKATESWARAN, PS ;
THYAGARAJAN, SP .
VACCINE, 1990, 8 :S86-S92
[9]
Brindle JT, 2002, NAT MED, V8, P1439, DOI 10.1038/nm802
[10]
Calixto JB, 1998, MED RES REV, V18, P225, DOI 10.1002/(SICI)1098-1128(199807)18:4<225::AID-MED2>3.0.CO