Antilope-A Lagrangian Relaxation Approach to the de novo Peptide Sequencing Problem

被引:13
作者
Andreotti, Sandro [1 ,2 ]
Klau, Gunnar W. [3 ,4 ]
Reinert, Knut [1 ,2 ]
机构
[1] Free Univ Berlin, Inst Informat, D-14195 Berlin, Germany
[2] Int Max Planck Res Sch Computat Biol & Sci Comp I, Berlin, Germany
[3] CWI, Life Sci Grp, NL-1090 GB Amsterdam, Netherlands
[4] NISB, Amsterdam, Netherlands
关键词
Computational proteomics; de novo peptide sequencing; Lagrangian relaxation; discrete optimization; MASS-SPECTROMETRY; IDENTIFICATION; ALGORITHM; ALIGNMENT; SOFTWARE; SPECTRA;
D O I
10.1109/TCBB.2011.59
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Peptide sequencing from mass spectrometry data is a key step in proteome research. Especially de novo sequencing, the identification of a peptide from its spectrum alone, is still a challenge even for state-of-the-art algorithmic approaches. In this paper, we present ANTILOPE, a new fast and flexible approach based on mathematical programming. It builds on the spectrum graph model and works with a variety of scoring schemes. ANTILOPE combines Lagrangian relaxation for solving an integer linear programming formulation with an adaptation of Yen's k shortest paths algorithm. It shows a significant improvement in running time compared to mixed integer optimization and performs at the same speed like other state-of-the-art tools. We also implemented a generic probabilistic scoring scheme that can be trained automatically for a data set of annotated spectra and is independent of the mass spectrometer type. Evaluations on benchmark data show that ANTILOPE is competitive to the popular state- of-the-art programs PepNovo and NovoHMM both in terms of runtime and accuracy. Furthermore, it offers increased flexibility in the number of considered ion types. ANTILOPE will be freely available as part of the open source proteomics library OpenMS.
引用
收藏
页码:385 / 394
页数:10
相关论文
共 35 条
[1]
A Lagrangian relaxation approach for the multiple sequence alignment problem [J].
Althaus, Ernst ;
Canzar, Stefan .
JOURNAL OF COMBINATORIAL OPTIMIZATION, 2008, 16 (02) :127-154
[2]
Andreotti S., 2008, THESIS FREIE U BERLI
[3]
Bafna V., 2003, P 7 ANN INT C COMPUT, P9
[4]
Balev S, 2004, LECT NOTES COMPUT SC, V3240, P182
[5]
FAST ALGORITHM FOR PEPTIDE SEQUENCING BY MASS-SPECTROSCOPY [J].
BARTELS, C .
BIOMEDICAL AND ENVIRONMENTAL MASS SPECTROMETRY, 1990, 19 (06) :363-368
[6]
Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization [J].
Bauer, Markus ;
Klau, Gunnar W. ;
Reinert, Knut .
BMC BIOINFORMATICS, 2007, 8 (1)
[7]
De novo analysis of peptide tandem mass spectra by spectral graph partitioning [J].
Bern, M ;
Goldberg, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) :364-378
[8]
De novo peptide sequencing by tandem MS using complementary CID and electron transfer dissociation [J].
Bertsch, Andreas ;
Leinenbach, Andreas ;
Pervukhin, Anton ;
Lubeck, Markus ;
Hartmer, Ralf ;
Baessmann, Carsten ;
Elnakady, Yasser Abbas ;
Mueller, Rolf ;
Boecker, Sebastian ;
Huber, Christian G. ;
Kohlbacher, Oliver .
ELECTROPHORESIS, 2009, 30 (21) :3736-3747
[9]
Bertsimas Dimitris, 1997, Introduction to linear optimization, V6
[10]
Bouckaert RemcoR., 2004, BAYESIAN NETWORK CLA