MetFusion: integration of compound identification strategies

被引:141
作者
Gerlich, Michael [1 ]
Neumann, Steffen [1 ]
机构
[1] Leibniz Inst Plant Biochem, Dept Stress & Dev Biol, Leipzig, Germany
来源
JOURNAL OF MASS SPECTROMETRY | 2013年 / 48卷 / 03期
关键词
metabolomics; integrated identification; MassBank; Met-Frag; in silico fragmentation; MASS; FRAGMENTATION; SIMILARITY; LIBRARY; MS;
D O I
10.1002/jms.3123
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Mass spectrometry (MS) is an important analytical technique for the detection and identification of small compounds. The main bottleneck in the interpretation of metabolite profiling or screening experiments is the identification of unknown compounds from tandem mass spectra. Spectral libraries for tandem MS, such as MassBank or NIST, contain reference spectra for many compounds, but their limited chemical coverage reduces the chance for a correct and reliable identification of unknown spectra outside the database domain. On the other hand, compound databases like PubChem or ChemSpider have a much larger coverage of the chemical space, but they cannot be queried with spectral information directly. Recently, computational mass spectrometry methods and in silico fragmentation prediction allow users to search such databases of chemical structures. We present a new strategy called MetFusion to combine identification results from several resources, in particular, from the in silico fragmenter MetFrag with the spectral library MassBank to improve compound identification. We evaluate the performance on a set of 1062 spectra and achieve an improved ranking of the correct compound from rank 28 using MetFrag alone, to rank 7 with MetFusion, even if the correct compound and similar compounds are absent from the spectral library. On the basis of the evaluation, we extrapolate the performance of MetFusion to the KEGG compound database. Copyright (c) 2013 John Wiley & Sons, Ltd.
引用
收藏
页码:291 / 298
页数:8
相关论文
共 33 条
[1]
Metabolome analysis of Biosynthetic mutants reveals a diversity of metabolic changes and allows identification of a large number of new compounds in arabidopsis [J].
Boettcher, Christoph ;
von Roepenack-Lahaye, Edda ;
Schmidt, Juergen ;
Schmotz, Constanze ;
Neumann, Steffen ;
Scheel, Dierk ;
Clemens, Stephan .
PLANT PHYSIOLOGY, 2008, 147 (04) :2107-2120
[2]
Bolton EE, 2010, ANN REP COMP CHEM, V4, P217, DOI 10.1016/S1574-1400(08)00012-1
[3]
Unsupervised data base clustering based on Daylight's fingerprint and Tanimoto similarity: A fast and automated way to cluster small and large data sets [J].
Butina, D .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (04) :747-750
[4]
Collier H., 2003, P 2003 INT CHEM INF
[5]
Phytochemistry meets genome analysis, and beyond ......... [J].
Dixon, RA ;
Strack, D .
PHYTOCHEMISTRY, 2003, 62 (06) :815-816
[6]
Mass appeal: metabolite identification in mass spectrometry-focused untargeted metabolomics [J].
Dunn, Warwick B. ;
Erban, Alexander ;
Weber, Ralf J. M. ;
Creek, Darren J. ;
Brown, Marie ;
Breitling, Rainer ;
Hankemeier, Thomas ;
Goodacre, Royston ;
Neumann, Steffen ;
Kopka, Joachim ;
Viant, Mark R. .
METABOLOMICS, 2013, 9 (01) :S44-S66
[7]
Optimized liquid chromatography-mass spectrometry approach for the isolation of minor stress biomarkers in plant extracts and their identification by capillary nuclear magnetic resonance [J].
Glauser, Gaetan ;
Guillarme, Davy ;
Grata, Elia ;
Boccard, Julien ;
Thiocone, Aly ;
Carrupt, Pierre-Alain ;
Veuthey, Jean-Luc ;
Rudaz, Serge ;
Wolfender, Jean-Luc .
JOURNAL OF CHROMATOGRAPHY A, 2008, 1180 (1-2) :90-98
[8]
Mass spectral metabonomics beyond elemental formula: Chemical database querying by matching experimental with computational fragmentation spectra [J].
Hill, Dennis W. ;
Kertesz, Tzipporah M. ;
Fontaine, Dan ;
Friedman, Robert ;
Grant, David F. .
ANALYTICAL CHEMISTRY, 2008, 80 (14) :5574-5582
[9]
MassBank: a public repository for sharing mass spectral data for life sciences [J].
Horai, Hisayuki ;
Arita, Masanori ;
Kanaya, Shigehiko ;
Nihei, Yoshito ;
Ikeda, Tasuku ;
Suwa, Kazuhiro ;
Ojima, Yuya ;
Tanaka, Kenichi ;
Tanaka, Satoshi ;
Aoshima, Ken ;
Oda, Yoshiya ;
Kakazu, Yuji ;
Kusano, Miyako ;
Tohge, Takayuki ;
Matsuda, Fumio ;
Sawada, Yuji ;
Hirai, Masami Yokota ;
Nakanishi, Hiroki ;
Ikeda, Kazutaka ;
Akimoto, Naoshige ;
Maoka, Takashi ;
Takahashi, Hiroki ;
Ara, Takeshi ;
Sakurai, Nozomu ;
Suzuki, Hideyuki ;
Shibata, Daisuke ;
Neumann, Steffen ;
Iida, Takashi ;
Tanaka, Ken ;
Funatsu, Kimito ;
Matsuura, Fumito ;
Soga, Tomoyoshi ;
Taguchi, Ryo ;
Saito, Kazuki ;
Nishioka, Takaaki .
JOURNAL OF MASS SPECTROMETRY, 2010, 45 (07) :703-714
[10]
Fast alignment of fragmentation trees [J].
Hufsky, Franziska ;
Duehrkop, Kai ;
Rasche, Florian ;
Chimani, Markus ;
Boecker, Sebastian .
BIOINFORMATICS, 2012, 28 (12) :I265-I273