Determination of Glycan Structure from Tandem Mass Spectra

被引:15
作者
Boecker, Sebastian [1 ]
Kehr, Birte [2 ]
Rasche, Florian [1 ]
机构
[1] Univ Jena, Fac Math & Comp Sci, Chair Bioinformat, D-07743 Jena, Germany
[2] Inst Comp Sci, Algorithm Bioinformat Grp, D-14195 Berlin, Germany
关键词
Computational mass spectrometry; glycans; parameterized algorithms; exact algorithms; counting trees; SPECTROMETRY; IDENTIFICATION; ALGORITHM; TOOL; OLIGOSACCHARIDES; GLYCOSYLATION; COMBINATION; ANNOTATION; TOPOLOGY; DATABASE;
D O I
10.1109/TCBB.2010.129
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Glycans are molecules made from simple sugars that form complex tree structures. Glycans constitute one of the most important protein modifications and identification of glycans remains a pressing problem in biology. Unfortunately, the structure of glycans is hard to predict from the genome sequence of an organism. In this paper, we consider the problem of deriving the topology of a glycan solely from tandem mass spectrometry (MS) data. We study, how to generate glycan tree candidates that sufficiently match the sample mass spectrum, avoiding the combinatorial explosion of glycan structures. Unfortunately, the resulting problem is known to be computationally hard. We present an efficient exact algorithm for this problem based on fixed-parameter algorithmics that can process a spectrum in a matter of seconds. We also report some preliminary results of our method on experimental data, combining it with a preliminary candidate evaluation scheme. We show that our approach is fast in applications, and that we can reach very well de novo identification results. Finally, we show how to count the number of glycan topologies for a fixed size or a fixed mass. We generalize this result to count the number of (labeled) trees with bounded out degree, improving on results obtained using Polya's enumeration theorem.
引用
收藏
页码:976 / 986
页数:11
相关论文
共 29 条
[1]   On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database [J].
Apweiler, R ;
Hermjakob, H ;
Sharon, N .
BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 1999, 1473 (01) :4-8
[2]   Fourier Meets Mobius: Fast Subset Convolution [J].
Bjorklund, Andreas ;
Husfeldt, Thore ;
Kaski, Petteri ;
Koivisto, Mikko .
STOC 07: PROCEEDINGS OF THE 39TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, 2007, :67-74
[3]   Towards de novo identification of metabolites by analyzing tandem mass spectra [J].
Boecker, Sebastian ;
Rasche, Florian .
BIOINFORMATICS, 2008, 24 (16) :I49-I55
[4]   A fast and simple algorithm for the money changing problem [J].
Boecker, Sebastian ;
Liptak, Zsuzsanna .
ALGORITHMICA, 2007, 48 (04) :413-432
[5]   SIRIUS: decomposing isotope patterns for metabolite identification [J].
Boecker, Sebastian ;
Letzel, Matthias C. ;
Liptak, Zsuzsanna ;
Pervukhin, Anton .
BIOINFORMATICS, 2009, 25 (02) :218-224
[6]   GlycoWorkbench: A tool for the computer-assisted annotation of mass spectra of Glycans [J].
Ceroni, Alessio ;
Maass, Kai ;
Geyer, Hildegard ;
Geyer, Rudolf ;
Dell, Anne ;
Haslam, Stuart M. .
JOURNAL OF PROTEOME RESEARCH, 2008, 7 (04) :1650-1659
[7]   A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry [J].
Chen, T ;
Kao, MY ;
Tepel, M ;
Rush, J ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (03) :325-337
[8]  
Cooper CA, 2001, PROTEOMICS, V1, P340, DOI 10.1002/1615-9861(200102)1:2<340::AID-PROT340>3.3.CO
[9]  
2-2
[10]   Glycoprotein structure determination mass spectrometry [J].
Dell, A ;
Morris, HR .
SCIENCE, 2001, 291 (5512) :2351-2356