Discrimination of intact mycobacteria at the strain level: A combined MALDI-TOF MS and biostatistical analysis

被引:85
作者
Hettick, Justin M.
Kashon, Michael L.
Slaven, James E.
Ma, Yan
Simpson, Janet P.
Siegel, Paul D.
Mazurek, Gerald N.
Weissman, David N.
机构
[1] Ctr Dis Control & Prevent, HELD, ACIB, NIOSH, Morgantown, WV 26505 USA
[2] Ctr Dis Control & Prevent, Natl Ctr HIV STD & TB Prevent, Div TB Eliminat, Atlanta, GA USA
[3] Ctr Dis Control & Prevent, NIOSH, Div Resp Dis Studies, Morgantown, WV 26505 USA
[4] W Virginia Univ, Dept Stat, Morgantown, WV 26506 USA
关键词
linear discriminant analysis; MALDI; MS; mycobacteria; random forests;
D O I
10.1002/pmic.200600335
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
New methodologies for surveillance and identification of Mycobacterium tuberculosis are required to stem the spread of disease worldwide. In addition, the ability to discriminate mycobacteria at the strain level may be important to contact or source case investigations. To this end, we are developing MALDI-TOF MS methods for the identification of M. tuberculosis in culture. In this report, we describe the application of MALDI-TOF MS, as well as statistical analysis including linear discriminant and random forest analysis, to 16 medically relevant strains from four species of mycobacteria, M. tuberculosis, M. avium, M. intracellulare, and M. kansasii. Although species discrimination can be accomplished on the basis of unique m/z values observed in the MS fingerprint spectrum, discrimination at the strain level is predicted on the relative abundance of shared m/z values among strains within a species. For the 16 mycobacterial strains investigated in the present study, it is possible to unambiguously identify strains within a species on the basis of MALDI-TOF MS data. The error rate for classification of individual strains using linear discriminant analysis was 0.053 using 37 m/z variables, whereas the error rate for classification of individual strains using random forest analysis was 0.023 using only 18 m/z variables. In addition, using random forest analysis of MALDI-TOF MS data, it was possible to correctly classify bacterial strains as either M. tuberculosis or non-tuberculous with 100% accuracy.
引用
收藏
页码:6416 / 6425
页数:10
相关论文
共 39 条
  • [1] Tablan Ofelia C, 2004, MMWR Recomm Rep, V53, P1
  • [2] Mycolic acids: Structure, biosynthesis and physiological functions
    Barry, CE
    Lee, RE
    Mdluli, K
    Sampson, AE
    Schroeder, BG
    Slayden, RA
    Yuan, Y
    [J]. PROGRESS IN LIPID RESEARCH, 1998, 37 (2-3) : 143 - 179
  • [3] SAMPLE MATRIX EFFECTS IN INFRARED-LASER NEUTRAL DESORPTION, MULTIPHOTON-IONIZATION MASS-SPECTROMETRY .7.
    BEAVIS, RC
    LINDNER, J
    GROTEMEYER, J
    SCHLAG, EW
    [J]. CHEMICAL PHYSICS LETTERS, 1988, 146 (3-4) : 310 - 314
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] IDENTIFICATION OF MYCOBACTERIA BY HIGH-PERFORMANCE LIQUID-CHROMATOGRAPHY
    BUTLER, WR
    JOST, KC
    KILBURN, JO
    [J]. JOURNAL OF CLINICAL MICROBIOLOGY, 1991, 29 (11) : 2468 - 2472
  • [7] ULTRAHIGH-RESOLUTION MATRIX-ASSISTED LASER-DESORPTION IONIZATION OF SMALL PROTEINS BY FOURIER-TRANSFORM MASS-SPECTROMETRY
    CASTORO, JA
    WILKINS, CL
    [J]. ANALYTICAL CHEMISTRY, 1993, 65 (19) : 2621 - 2627
  • [8] The new time-of-flight mass spectrometry
    Cotter, RJ
    [J]. ANALYTICAL CHEMISTRY, 1999, 71 (13) : 445A - 451A
  • [9] Gene selection and classification of microarray data using random forest -: art. no. 3
    Díaz-Uriarte, R
    de Andrés, SA
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [10] Domin MA, 1999, RAPID COMMUN MASS SP, V13, P222, DOI 10.1002/(SICI)1097-0231(19990228)13:4<222::AID-RCM440>3.0.CO