Comprehensive evaluation of untargeted metabolomics data processing software in feature detection, quantification and discriminating marker selection

被引：103

作者：

Li, Zhucui ^{[1
,2
,3
]}

Lu, Yan ^{[1
,2
,4
]}

Guo, Yufeng ^{[3
]}

Cao, Haijie ^{[5
]}

Wang, Qinhong ^{[3
]}

Shui, Wenqing ^{[2
,4
]}

机构：

[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[2] ShanghaiTech Univ, iHuman Inst, Shanghai 201210, Peoples R China

[3] Chinese Acad Sci, Tianjin Inst Ind Biotechnol, Tianjin 300308, Peoples R China

[4] ShanghaiTech Univ, Sch Life Sci & Technol, Shanghai 201210, Peoples R China

[5] Nankai Univ, Coll Pharm, Tianjin 300071, Peoples R China

来源：

ANALYTICA CHIMICA ACTA | 2018年 / 1029卷

基金：

中国国家自然科学基金;

关键词：

Untargeted metabolomics; Data processing software; Feature detection; Feature quantification; Discriminating marker selection; SPECTROMETRY-BASED METABOLOMICS; MASS-SPECTROMETRY; MISSING VALUES; DATA SET; DISCOVERY; PERFORMANCE; METABOLISM; WORKFLOW; PLATFORM; URINE;

D O I：

10.1016/j.aca.2018.05.001

中图分类号：

O65 [分析化学];

学科分类号：

070302 [分析化学];

摘要：

Data analysis represents a key challenge for untargeted metabolomics studies and it commonly requires extensive processing of more than thousands of metabolite peaks included in raw high-resolution MS data. Although a number of software packages have been developed to facilitate untargeted data processing, they have not been comprehensively scrutinized in the capability of feature detection, quantification and marker selection using a well-defined benchmark sample set. In this study, we acquired a benchmark dataset from standard mixtures consisting of 1100 compounds with specified concentration ratios including 130 compounds with significant variation of concentrations. Five software evaluated here (MS-Dial, MZmine 2, XCMS, MarkerView, and Compound Discoverer) showed similar performance in detection of true features derived from compounds in the mixtures. However, significant differences between untargeted metabolomics software were observed in relative quantification of true features in the benchmark dataset. MZmine 2 outperformed the other software in terms of quantification accuracy and it reported the most true discriminating markers together with the fewest false markers. Furthermore, we assessed selection of discriminating markers by different software using both the benchmark dataset and a real-case metabolomics dataset to propose combined usage of two software for increasing confidence of biomarker identification. Our findings from comprehensive evaluation of untargeted metabolomics software would help guide future improvements of these widely used bioinformatics tools and enable users to properly interpret their metabolomics results. (C) 2018 Elsevier B.V. All rights reserved.

引用

页码：50 / 57

页数：8

共 34 条

[1]

Toward Merging Untargeted and Targeted Methods in Mass Spectrometry-Based Metabolomics and Lipidomics [J].