A hybrid method for peptide identification using integer linear optimization, local database search, and quadrupole time-of-flight or OrbiTrap tandem mass spectrometry

被引:15
作者
DiMaggio, Peter A., Jr. [1 ]
Floudas, Christodoulos A. [1 ]
Lu, Bingwen [2 ]
Yates, John R., III [2 ]
机构
[1] Princeton Univ, Dept Chem Engn, Princeton, NJ 08544 USA
[2] Scripps Res Inst, Dept Cell Biol, La Jolla, CA 92037 USA
关键词
hybrid peptide identification; high-precision tandem mass spectrometry; integer linear optimization (ILP);
D O I
10.1021/pr700577z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A novel hybrid methodology for the automated identification of peptides via de novo integer linear optimization, local database search, and tandem mass spectrometry is presented in this article. A modified version of the de novo identification algorithm PILOT1,2 is utilized to construct accurate de novo peptide sequences. A modified version of the local database search tool FASTA(3) is used to query these de novo predictions against the nonredundant protein database to resolve any low-confidence amino acids in the candidate sequences. The computational burden associated with performing several alignments is alleviated with the use of distributive computing. Extensive computational studies are presented for this new hybrid methodology, as well as comparisons with MASCOT(4) for a set of 38 quadrupole time-of-flight (QTOF) and 380 OrbiTrap tandem mass spectra. The results for our proposed hybrid method for the OrbiTrap spectra are also compared with a modified version of PepNovo,(5) which was trained for use on high-precision tandem mass spectra, and the tag-based method InsPecT.(6) The de novo sequences of PILOT and PepNovo are also searched against the nonredundant protein database using CIDentify(7) to compare with the alignments achieved by our modifications of FASTA. The comparative studies demonstrate the excellent peptide identification accuracy gained from combining the strengths of our de novo method, which is based on integer linear optimization, and database driven search methods.
引用
收藏
页码:1584 / 1593
页数:10
相关论文
共 37 条
[1]   SYNTHESIS OF GENERAL DISTILLATION SEQUENCES - NONSHARP SEPARATIONS [J].
AGGARWAL, A ;
FLOUDAS, CA .
COMPUTERS & CHEMICAL ENGINEERING, 1990, 14 (06) :631-653
[2]   A RETROFIT APPROACH FOR HEAT-EXCHANGER NETWORKS [J].
CIRIC, AR ;
FLOUDAS, CA .
COMPUTERS & CHEMICAL ENGINEERING, 1989, 13 (06) :703-715
[3]  
*CPLEX, 2005, ILOG CPLEX 9 0 US MA
[4]  
Crainic T. G., 2006, PARALLEL COMBINATORI
[5]   De novo peptide sequencing via tandem mass spectrometry [J].
Dancík, V ;
Addona, TA ;
Clauser, KR ;
Vath, JE ;
Pevzner, PA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :327-342
[6]   De novo peptide identification via tandem mass spectrometry and integer linear optimization [J].
DiMaggio, Peter A., Jr. ;
Floudas, Christodoulos A. .
ANALYTICAL CHEMISTRY, 2007, 79 (04) :1433-1446
[7]   A mixed-integer optimization framework for de novo peptide identification [J].
DiMaggio, Peter A., Jr. ;
Floudas, Christodoulos A. .
AICHE JOURNAL, 2007, 53 (01) :160-173
[8]  
Floudas C.A., 1995, NONLINEAR MIXED INTE
[9]   SYNTHESIS OF DISTILLATION SEQUENCES WITH SEVERAL MULTICOMPONENT FEED AND PRODUCT STREAMS [J].
FLOUDAS, CA ;
ANASTASIADIS, SH .
CHEMICAL ENGINEERING SCIENCE, 1988, 43 (09) :2407-2419
[10]   SYNTHESIS OF FLEXIBLE HEAT-EXCHANGER NETWORKS WITH UNCERTAIN FLOWRATES AND TEMPERATURES [J].
FLOUDAS, CA ;
GROSSMANN, IE .
COMPUTERS & CHEMICAL ENGINEERING, 1987, 11 (04) :319-336