Bacterial species identification from MALDI-TOF mass spectra through data analysis and machine learning

被引:144
作者
De Bruyne, Katrien [1 ,2 ]
Slabbinck, Bram [1 ,3 ]
Waegeman, Willem [3 ]
Vauterin, Paul [2 ]
De Baets, Bernard [3 ]
Vandamme, Peter [1 ]
机构
[1] Univ Ghent, Microbiol Lab, B-9000 Ghent, Belgium
[2] Appl Maths NV, B-9830 Sint Martens Latem, Belgium
[3] Univ Ghent, Dept Appl Math Biometr & Proc Control, KERMIT, B-9000 Ghent, Belgium
基金
比利时弗兰德研究基金会;
关键词
MALDI-TOF MS; Bacteria; Species; Identification; Data processing; Machine learning techniques; Leuconostoc; Fructobacillus; Lactococcus; ASSISTED LASER-DESORPTION; TRANSFORM INFRARED-SPECTROSCOPY; NETWORK-BASED IDENTIFICATION; RAPID IDENTIFICATION; IONIZATION-TIME; INTACT MICROORGANISMS; PATTERN-RECOGNITION; SAMPLE PREPARATION; STRAIN LEVEL; SPECTROMETRY;
D O I
10.1016/j.syapm.2010.11.003
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
At present, there is much variability between MALDI-TOF MS methodology for the characterization of bacteria through differences in e.g., sample preparation methods, matrix solutions, organic solvents, acquisition methods and data analysis methods. After evaluation of the existing methods, a standard protocol was developed to generate MALDI-TOF mass spectra obtained from a collection of reference strains belonging to the genera Leuconostoc. Fructobacillus and Lactococcus. Bacterial cells were harvested after 24 h of growth at 28 degrees C on the media MRS or TSA. Mass spectra were generated, using the CHCA matrix combined with a 50:48:2 acetonitrile:water:trifluoroacetic acid matrix solution, and analyzed by the cell smear method and the cell extract method. After a data preprocessing step, the resulting high quality data set was used for PCA, distance calculation and multi-dimensional scaling. Using these analyses, species-specific information in the MALDI-TOF mass spectra could be demonstrated. As a next step, the spectra, as well as the binary character set derived from these spectra, were successfully used for species identification within the genera Leuconostoc, Fructobacillus, and Lactococcus. Using MALDI-TOF MS identification libraries for Leuconostoc and Fructobacillus strains, 84% of the MALDI-TOF mass spectra were correctly identified at the species level. Similarly, the same analysis strategy within the genus Lactococcus resulted in 94% correct identifications, taking species and subspecies levels into consideration. Finally, two machine learning techniques were evaluated as alternative species identification tools. The two techniques, support vector machines and random forests, resulted in accuracies between 94% and 98% for the identification of Leuconostoc and Fructobacillus species, respectively. (C) 2011 Elsevier GmbH. All rights reserved.
引用
收藏
页码:20 / 29
页数:10
相关论文
共 47 条
[1]  
*ANAGNOSTEC GMBH, 2010 SARAMIS PROD VE
[2]  
*APPL MATHS NV, 2009 BIONUMERICS MAN
[3]   Rapid identification and typing of Listeria species by matrix-assisted laser desorption ionization-time of flight mass spectrometry [J].
Barbuddhe, Sukhadeo B. ;
Maier, Thomas ;
Schwarz, Gerold ;
Kostrzewa, Markus ;
Hof, Herbert ;
Domann, Eugen ;
Chakraborty, Trinad ;
Hain, Torsten .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2008, 74 (17) :5402-5407
[4]   Fourier transform infrared spectroscopy for rapid identification of nonfermenting gram-negative bacteria isolated from sputum samples from cystic fibrosis patients [J].
Bosch, Alejandra ;
Minan, Alejandro ;
Vescina, Cecilia ;
Degrossi, Jose ;
Gatti, Blanca ;
Montanaro, Patricia ;
Messina, Matias ;
Franco, Mirta ;
Vay, Carlos ;
Schmitt, Juergen ;
Naumann, Dieter ;
Yantorno, Osvaldo .
JOURNAL OF CLINICAL MICROBIOLOGY, 2008, 46 (08) :2535-2546
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]  
BREIMAN L, 2004 RANDOM FORESTS
[7]   Rapid typing of bacteria using matrix-assisted laser desorption ionisation time-of-flight mass spectrometry and pattern recognition software [J].
Bright, JJ ;
Claydon, MA ;
Soufian, M ;
Gordon, DB .
JOURNAL OF MICROBIOLOGICAL METHODS, 2002, 48 (2-3) :127-138
[8]  
*BRUK DALT GMBH, 2010, MT80 BRUK DALT GMBH
[9]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[10]   The rapid identification of intact microorganisms using mass spectrometry [J].
Claydon, MA ;
Davey, SN ;
EdwardsJones, V ;
Gordon, DB .
NATURE BIOTECHNOLOGY, 1996, 14 (11) :1584-1586