Gaussian mixture discriminant analysis for the single-cell differentiation of bacteria using micro-Raman spectroscopy

被引:83
作者
Schmid, Ulrike [1 ]
Roesch, Petra [2 ]
Krause, Mario [2 ]
Harz, Michaela [2 ]
Popp, Juergen [2 ]
Baumann, Knut [1 ]
机构
[1] Univ Technol, Inst Pharmaceut Chem, D-38106 Braunschweig, Germany
[2] Univ Jena, Inst Phys Chem, D-00743 Jena, Germany
关键词
Micro-Raman spectroscopy; Bacterial classification; Gaussian mixture discriminant analysis; (MDA); Pairwise classification; LEAST-SQUARES; IDENTIFICATION; SPECTRA;
D O I
10.1016/j.chemolab.2009.01.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The differentiation of single bacterial cells using micro-Raman spectroscopy can be hampered by large intrastrain variability of the measured microorganisms due to fluctuating culture ages, nutrition conditions, and cultivation temperatures. Gaussian mixture discriminant analysis (MDA) is an effective classification approach for this task, as it is able to model inhomogeneous and scattering class structures. On the basis of a highly diverse dataset comprising 3642 spectra of 29 different strains of bacteria, the utility of MDA for the differentiation of microorganisms by micro-Raman spectroscopy was demonstrated in comparison to various linear and nonlinear classification algorithms. The employed algorithms include partial least squares discriminant analysis (PLS-DA), linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), k-nearest neighbor classifier (kNN) and support vector machines (SVMs). In a first attempt the best prediction performance was achieved by a SVM model yielding 87.3% of correctly classified spectra outperforming MDA (80.9%) and the other classification methods. The prediction accuracy of MDA can be improved markedly by establishing multiple one-class-versus-one-class models and making predictions by a major vote decision overall pairwise classifications. Using this pairwise approach the performance of MDA increased up to 86.6%, which is statistically equivalent to the performance of a support vector machine. In the case of MDA, the assessment of a posteriori probabilities allows a straightforward novelty detection procedure. Moreover, due to its cluster property, MDA can be employed to visualize the effect of varying cultivation parameters on the group-structure of the investigated dataset. The analysis demonstrates that MDA exhibits useful features for the differentiation of single bacteria by micro-Raman spectroscopy in terms of prediction accuracy, novelty detection, and interpretation of the model. (c) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:159 / 171
页数:13
相关论文
共 52 条
[1]  
Alhoniemi E., 2005, SOM TOOLBOX
[2]   Partial least squares for discrimination [J].
Barker, M ;
Rayens, W .
JOURNAL OF CHEMOMETRICS, 2003, 17 (03) :166-173
[3]   Properties of sufficiency and statistical tests [J].
Bartlett, MS .
PROCEEDINGS OF THE ROYAL SOCIETY OF LONDON SERIES A-MATHEMATICAL AND PHYSICAL SCIENCES, 1937, 160 (A901) :0268-0282
[4]  
BISHOP CM, 1994, IEE C VIS IM SIGN PR
[5]   THE EFFECTS OF VIOLATIONS OF ASSUMPTIONS UNDERLYING THE T-TEST [J].
BONEAU, CA .
PSYCHOLOGICAL BULLETIN, 1960, 57 (01) :49-64
[6]   Consequences of sample size, variable selection, and model validation and optimisation, for predicting classification ability from analytical data [J].
Brereton, Richard G. .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2006, 25 (11) :1103-1111
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]  
CHANG WC, 1983, J ROY STAT SOC C, V32, P267
[9]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[10]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411