Comparative evaluation of preprocessing freeware on chromatography/mass spectrometry data for signature discovery

被引:73
作者
Coble, Jamie B. [1 ]
Fraga, Carlos G. [2 ]
机构
[1] Univ Tennessee, Dept Nucl Engn, Knoxville, TN 37996 USA
[2] Pacific NW Natl Lab, Richland, WA 99352 USA
关键词
Chemical forensics; Chemometrics; Metabolomics; Biomarkers; Impurity profiling; Metabolite profiling; GC/MS; LC/MS; Met Align; MZmine; SpectConnect; XCMS; LC-MS; METABOLOMICS; ALIGNMENT; ALGORITHMS; SOFTWARE; TOOLS;
D O I
10.1016/j.chroma.2014.06.100
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Preprocessing software, which converts large instrumental data sets into a manageable format for data analysis, is crucial for the discovery of chemical signatures in metabolomics, chemical forensics, and other signature-focused disciplines. Here, four freely available and published preprocessing tools known as MetAlign, MZmine, SpectConnect, and XCMS were evaluated for impurity profiling using nominal mass GC/MS data and accurate mass LC/MS data. Both data sets were previously collected from the analysis of replicate samples from multiple stocks of a nerve-agent precursor and method blanks. Parameters were optimized for each of the four tools for the untargeted detection, matching, and cataloging of chromatographic peaks from impurities present in the stock samples. The peak table generated by each preprocessing tool was analyzed to determine the number of impurity components detected in all replicate samples per stock and absent in the method blanks. A cumulative set of impurity components was then generated using all available peak tables and used as a reference to calculate the percent of component detections for each tool, in which 100% indicated the detection of every known component present in a stock. For the nominal mass GC/MS data, MetAlign had the most component detections followed by MZmine, SpectConnect, and XCMS with detection percentages of 83, 60, 47, and 41%, respectively. For the accurate mass LC/MS data, the order was MetAlign, XCMS, and MZmine with detection percentages of 80, 45, and 35%, respectively. SpectConnect did not function for the accurate mass LC/MS data. Larger detection percentages were obtained by combining the top performer with at least one of the other tools such as 96% by combining MetAlign with MZmine for the GC/MS data and 93% by combining MetAlign with XCMS for the LC/MS data. In terms of quantitative performance, the reported peak intensities from each tool had averaged absolute biases (relative to peak intensities obtained using instrument software) of 41, 4.4, 1.3 and 1.3% for SpectConnect, MetAlign, XCMS, and MZmine, respectively, for the GC/MS data. For the LC/MS data, the averaged absolute biases were 22, 4.5, and 3.1% for MetAlign, MZmine, and XCMS, respectively. In summary, MetAlign performed the best in terms of the number of component detections; however, more than one preprocessing tool should be considered to avoid missing impurities or other trace components as potential chemical signatures. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 17 条
[1]
Comparative LC-MS: A landscape of peaks and valleys [J].
America, Antoine H. P. ;
Cordewener, Jan H. G. .
PROTEOMICS, 2008, 8 (04) :731-749
[2]
Algorithms and tools for the preprocessing of LC-MS metabolomics data [J].
Castillo, Sandra ;
Gopalacharyulu, Peddinti ;
Yetukuri, Laxman ;
Oresic, Matej .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 108 (01) :23-32
[3]
A cross-platform toolkit for mass spectrometry and proteomics [J].
Chambers, Matthew C. ;
Maclean, Brendan ;
Burke, Robert ;
Amodei, Dario ;
Ruderman, Daniel L. ;
Neumann, Steffen ;
Gatto, Laurent ;
Fischer, Bernd ;
Pratt, Brian ;
Egertson, Jarrett ;
Hoff, Katherine ;
Kessner, Darren ;
Tasman, Natalie ;
Shulman, Nicholas ;
Frewen, Barbara ;
Baker, Tahmina A. ;
Brusniak, Mi-Youn ;
Paulse, Christopher ;
Creasy, David ;
Flashner, Lisa ;
Kani, Kian ;
Moulding, Chris ;
Seymour, Sean L. ;
Nuwaysir, Lydia M. ;
Lefebvre, Brent ;
Kuhlmann, Frank ;
Roark, Joe ;
Rainer, Paape ;
Detlev, Suckau ;
Hemenway, Tina ;
Huhmer, Andreas ;
Langridge, James ;
Connolly, Brian ;
Chadick, Trey ;
Holly, Krisztina ;
Eckels, Josh ;
Deutsch, Eric W. ;
Moritz, Robert L. ;
Katz, Jonathan E. ;
Agus, David B. ;
MacCoss, Michael ;
Tabb, David L. ;
Mallick, Parag .
NATURE BIOTECHNOLOGY, 2012, 30 (10) :918-920
[4]
Tools for computational processing of LC-MS datasets:: A user's perspective [J].
Codrea, Marius C. ;
Jimenez, Connie R. ;
Heringa, Jaap ;
Marchiori, Elena .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2007, 86 (03) :281-290
[5]
Impurity Profiling to Match a Nerve Agent to Its Precursor Source for Chemical Forensics Applications [J].
Fraga, Carlos G. ;
Acosta, Gabriel A. Perez ;
Crenshaw, Michael D. ;
Wallace, Krys ;
Mong, Gary M. ;
Colburn, Heather A. .
ANALYTICAL CHEMISTRY, 2011, 83 (24) :9564-9572
[6]
Signature-Discovery Approach for Sample Matching of a Nerve-Agent Precursor Using Liquid Chromatography-Mass Spectrometry, XCMS, and Chemometrics [J].
Fraga, Carlos G. ;
Clowers, Brian H. ;
Moore, Ronald J. ;
Zink, Erika M. .
ANALYTICAL CHEMISTRY, 2010, 82 (10) :4165-4173
[7]
Processing methods for differential analysis of LC/MS profile data [J].
Katajamaa, M ;
Oresic, M .
BMC BIOINFORMATICS, 2005, 6 (1)
[8]
Data processing for mass spectrometry-based metabolomics [J].
Katajamaa, Mikko ;
Oresic, Matej .
JOURNAL OF CHROMATOGRAPHY A, 2007, 1158 (1-2) :318-328
[9]
Comparative evaluation of software for retention time alignment of gas chromatography/time-of-flight mass spectrometry-based metabonomic data [J].
Koh, Yueting ;
Pasikanti, Kishore Kumar ;
Yap, Chun Wei ;
Chan, Eric Chun Yong .
JOURNAL OF CHROMATOGRAPHY A, 2010, 1217 (52) :8308-8316
[10]
Critical assessment of alignment procedures for LC- MS proteomics and metabolomics measurements [J].
Lange, Eva ;
Tautenhahn, Ralf ;
Neumann, Steffen ;
Groepl, Clemens .
BMC BIOINFORMATICS, 2008, 9 (1)