Fast alignment of fragmentation trees

被引:13
作者
Hufsky, Franziska [1 ,2 ]
Duehrkop, Kai [1 ]
Rasche, Florian [1 ]
Chimani, Markus
Boecker, Sebastian [1 ]
机构
[1] Univ Jena, Chair Bioinformat, Jena, Germany
[2] Max Planck Inst Chem Ecol, Jena, Germany
关键词
UNORDERED LABELED TREES; MASS-SPECTROMETRY; IDENTIFICATION; METABOLOMICS; DATABASE; SPECTRA; EDIT;
D O I
10.1093/bioinformatics/bts207
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Motivation: Mass spectrometry allows sensitive, automated and high-throughput analysis of small molecules such as metabolites. One major bottleneck in metabolomics is the identification of 'unknown' small molecules not in any database. Recently, fragmentation tree alignments have been introduced for the automated comparison of the fragmentation patterns of small molecules. Fragmentation pattern similarities are strongly correlated with the chemical similarity of the molecules, and allow us to cluster compounds based solely on their fragmentation patterns. Results: Aligning fragmentation trees is computationally hard. Nevertheless, we present three exact algorithms for the problem: a dynamic programming (DP) algorithm, a sparse variant of the DP, and an Integer Linear Program (ILP). Evaluation of our methods on three different datasets showed that thousands of alignments can be computed in a matter of minutes using DP, even for 'challenging' instances. Running times of the sparse DP were an order of magnitude better than for the classical DP. The ILP was clearly outperformed by both DP approaches. We also found that for both DP algorithms, computing the 1% slowest alignments required as much time as computing the 99% fastest.
引用
收藏
页码:I265 / I273
页数:9
相关论文
共 30 条
[1]
Proof verification and the hardness of approximation problems [J].
Arora, S ;
Lund, C ;
Motwani, R ;
Sudan, M ;
Szegedy, M .
JOURNAL OF THE ACM, 1998, 45 (03) :501-555
[2]
Sparse RNA folding: Time and space efficient algorithms [J].
Backofen, Rolf ;
Tsur, Dekel ;
Zakov, Shay ;
Ziv-Ukelson, Michal .
JOURNAL OF DISCRETE ALGORITHMS, 2011, 9 (01) :12-31
[3]
Fourier Meets Mobius: Fast Subset Convolution [J].
Bjorklund, Andreas ;
Husfeldt, Thore ;
Kaski, Petteri ;
Koivisto, Mikko .
STOC 07: PROCEEDINGS OF THE 39TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, 2007, :67-74
[4]
Towards de novo identification of metabolites by analyzing tandem mass spectra [J].
Boecker, Sebastian ;
Rasche, Florian .
BIOINFORMATICS, 2008, 24 (16) :I49-I55
[5]
Canzar S., 2011, P INT C AUT LANG PRO, V6755, P98
[6]
Metabolite identification via the Madison Metabolomics Consortium Database [J].
Cui, Qiu ;
Lewis, Ian A. ;
Hegeman, Adrian D. ;
Anderson, Mark E. ;
Li, Jing ;
Schulte, Christopher F. ;
Westler, William M. ;
Eghbalnia, Hamid R. ;
Sussman, Michael R. ;
Markley, John L. .
NATURE BIOTECHNOLOGY, 2008, 26 (02) :162-164
[7]
Innovation - Metabolite profiling: from diagnostics to systems biology [J].
Fernie, AR ;
Trethewey, RN ;
Krotzky, AJ ;
Willmitzer, L .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2004, 5 (09) :763-769
[8]
Extending the breadth of metabolite profiling by gas chromatography coupled to mass spectrometry [J].
Fiehn, Oliver .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2008, 27 (03) :261-269
[9]
Halket John M., 2005, Journal of Experimental Botany, V56, P219
[10]
Herlihy M, 2008, LECT NOTES COMPUT SC, V5218, P350, DOI 10.1007/978-3-540-87779-0_24