Sequence optimization as an alternative to de novo analysis of tandem mass spectrometry data

被引:22
作者
Heredia-Langner, A [1 ]
Cannon, WR [1 ]
Jarman, KD [1 ]
Jarman, KH [1 ]
机构
[1] Pacific NW Natl Lab, Richland, WA 99352 USA
关键词
D O I
10.1093/bioinformatics/bth242
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Peptide identification following tandem mass spectrometry (MS/MS) is usually achieved by searching for the best match between the mass spectrum of an unidentified peptide and model spectra generated from peptides in a sequence database. This methodology will be successful only if the peptide under investigation belongs to an available database. Our objective is to develop and test the performance of a heuristic optimization algorithm capable of dealing with some features commonly found in actual MS/MS spectra that tend to stop simpler deterministic solution approaches. Results: We present the implementation of a Genetic Algorithm (GA) in the reconstruction of amino acid sequences using only spectral features, discuss some of the problems associated with this approach and compare its performance to a de novo sequencing method. The GA can potentially overcome some of the most problematic aspects associated with de novo analysis of real MS/MS data such as missing or unclearly defined peaks and may prove to be a valuable tool in the proteomics field. We assess the performance of our algorithm under conditions of perfect spectral information, in situations where key spectral features are missing, and using real MS/MS spectral data.
引用
收藏
页码:2296 / 2304
页数:9
相关论文
共 26 条
[1]   FAST ALGORITHM FOR PEPTIDE SEQUENCING BY MASS-SPECTROSCOPY [J].
BARTELS, C .
BIOMEDICAL AND ENVIRONMENTAL MASS SPECTROMETRY, 1990, 19 (06) :363-368
[2]   Improved peptide sequencing using isotope information inherent in tandem mass spectra [J].
Cannon, WR ;
Jarman, KD .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2003, 17 (15) :1793-1801
[3]   A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry [J].
Chen, T ;
Kao, MY ;
Tepel, M ;
Rush, J ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (03) :325-337
[4]   Penalty guided genetic search for reliability design optimization [J].
Coit, DW ;
Smith, AE .
COMPUTERS & INDUSTRIAL ENGINEERING, 1996, 30 (04) :895-904
[5]   De novo peptide sequencing via tandem mass spectrometry [J].
Dancík, V ;
Addona, TA ;
Clauser, KR ;
Vath, JE ;
Pevzner, PA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :327-342
[6]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[7]   A statistical basis for testing the significance of mass spectrometric protein identification results [J].
Eriksson, J ;
Chait, BT ;
Fenyö, D .
ANALYTICAL CHEMISTRY, 2000, 72 (05) :999-1005
[8]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[9]   Genetic algorithms for the construction of D-optimal designs [J].
Heredia-Langner, A ;
Carlyle, WM ;
Montgomery, DC ;
Borror, CM ;
Runger, GC .
JOURNAL OF QUALITY TECHNOLOGY, 2003, 35 (01) :28-46
[10]   PATTERN-BASED ALGORITHM FOR PEPTIDE SEQUENCING FROM TANDEM HIGH-ENERGY COLLISION-INDUCED DISSOCIATION MASS-SPECTRA [J].
HINES, WM ;
FALICK, AM ;
BURLINGAME, AL ;
GIBSON, BW .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1992, 3 (04) :326-336