A graph-theoretic approach for the separation of b and y ions in tandem mass spectra

被引:30
作者
Yan, B
Pan, C
Olman, VN
Hettich, RL
Xu, Y [1 ]
机构
[1] Univ Georgia, Dept Biochem & Mol Biol, Computat Syst Biol Lab, Athens, GA 30602 USA
[2] Univ Tennessee, Genome Sci & Technol Grad Sch, Knoxville, TN 37996 USA
[3] Univ Tennessee, Computat Biol Inst, Knoxville, TN 37996 USA
[4] Oak Ridge Natl Lab, Div Chem Sci, Organ & Biol Mass Spectrometry Grp, Oak Ridge, TN 37831 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/bti044
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Ion-type identification is a fundamental problem in computational proteomics. Methods for accurate identification of ion types provide the basis for many mass spectrometry data interpretation problems, including (a) de novo sequencing, (b) identification of post-translational modifications and mutations and (c) validation of database search results. Results: Here, we present a novel graph-theoretic approach for solving the problem of separating b ions from y ions in a set of tandem mass spectra. We represent each spectral peak as a node and consider two types of edges: type-1 edge connecting two peaks probably of the same ion types and type-2 edge connecting two peaks probably of different ion types. The problem of ion-separation is formulated and solved as a graph partition problem, which is to partition the graph into three subgraphs, representing b, y and others ions, respectively, through maximizing the total weight of type-1 edges while minimizing the total weight of type-2 edges within each partitioned subgraph. We have developed a dynamic programming algorithm for rigorously solving this graph partition problem and implemented it as a computer program PRIME (PaRtition of Ion types in tandem Mass spEctra). The tests on a large amount of simulated mass spectra and 19 sets of high-quality experimental Fourier transform ion cyclotron resonance tandem mass spectra indicate that an accuracy level of similar to90% for the separation of b and y ions was achieved.
引用
收藏
页码:563 / 574
页数:12
相关论文
共 31 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Proteomic characterization of the human centrosome by protein correlation profiling [J].
Andersen, JS ;
Wilkinson, CJ ;
Mayor, T ;
Mortensen, P ;
Nigg, EA ;
Mann, M .
NATURE, 2003, 426 (6966) :570-574
[4]  
[Anonymous], PROTEOME RES NEW FRO
[5]   FAST ALGORITHM FOR PEPTIDE SEQUENCING BY MASS-SPECTROSCOPY [J].
BARTELS, C .
BIOMEDICAL AND ENVIRONMENTAL MASS SPECTROMETRY, 1990, 19 (06) :363-368
[6]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[7]   A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry [J].
Chen, T ;
Kao, MY ;
Tepel, M ;
Rush, J ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (03) :325-337
[8]   Role of accurate mass measurement (±10 ppm) in protein identification strategies employing MS or MS MS and database searching [J].
Clauser, KR ;
Baker, P ;
Burlingame, AL .
ANALYTICAL CHEMISTRY, 1999, 71 (14) :2871-2882
[9]  
Cormen T. H., 2001, Introduction to Algorithms, V2nd
[10]   De novo peptide sequencing via tandem mass spectrometry [J].
Dancík, V ;
Addona, TA ;
Clauser, KR ;
Vath, JE ;
Pevzner, PA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :327-342