A mixed-integer optimization framework for de novo peptide identification

被引:8
作者
DiMaggio, Peter A., Jr. [1 ]
Floudas, Christodoulos A. [1 ]
机构
[1] Princeton Univ, Dept Chem Engn, Princeton, NJ 08544 USA
关键词
mixed-integer linear optimization (MILP); de novo peptide identification; tandem mass spectrometry (MS/MS);
D O I
10.1002/aic.11061
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
A novel methodology for the de novo identification of peptides by mixed-integer optimization and tandem mass spectrometry is presented in this article. The various features of the mathematical model are presented and examples are used to illustrate the key concepts of the proposed approach. Several problems are examined to illustrate the proposed method's ability to address (1) residue-dependent fragmentation properties and (2) the variability of resolution in different mass analyzers. A preprocessing algorithm is used to identify important m/z values in the tandem mass spectrum. Missing peaks, resulting from residue-dependent fragmentation characteristics, are dealt with using a two-stage algorithmic framework. A cross-correlation approach is used to resolve missing amino acid assignments and to identify the most probable peptide by comparing the theoretical spectra of the candidate sequences that were generated from the MILP sequencing stages with the experimental tandem mass spectrum. (c) 2006 American Institute of Chemical Engineers AIChEJ, 53: 160-173, 2007.
引用
收藏
页码:160 / 173
页数:14
相关论文
共 63 条
[21]   A MIXED-INTEGER NONLINEAR-PROGRAMMING FORMULATION FOR THE SYNTHESIS OF HEAT-INTEGRATED DISTILLATION SEQUENCES [J].
FLOUDAS, CA ;
PAULES, GE .
COMPUTERS & CHEMICAL ENGINEERING, 1988, 12 (06) :531-546
[22]   SYNTHESIS OF DISTILLATION SEQUENCES WITH SEVERAL MULTICOMPONENT FEED AND PRODUCT STREAMS [J].
FLOUDAS, CA ;
ANASTASIADIS, SH .
CHEMICAL ENGINEERING SCIENCE, 1988, 43 (09) :2407-2419
[23]   STRATEGIES FOR OVERCOMING UNCERTAINTIES IN HEAT-EXCHANGER NETWORK SYNTHESIS [J].
FLOUDAS, CA ;
CIRIC, AR .
COMPUTERS & CHEMICAL ENGINEERING, 1989, 13 (10) :1133-1152
[24]   SYNTHESIS OF FLEXIBLE HEAT-EXCHANGER NETWORKS WITH UNCERTAIN FLOWRATES AND TEMPERATURES [J].
FLOUDAS, CA ;
GROSSMANN, IE .
COMPUTERS & CHEMICAL ENGINEERING, 1987, 11 (04) :319-336
[25]   PepNovo: De novo peptide sequencing via probabilistic network modeling [J].
Frank, A ;
Pevzner, P .
ANALYTICAL CHEMISTRY, 2005, 77 (04) :964-973
[26]   Intensity-based statistical scorer for tandem mass spectrometry [J].
Havilio, M ;
Haddad, Y ;
Smilansky, Z .
ANALYTICAL CHEMISTRY, 2003, 75 (03) :435-444
[27]   Sequence optimization as an alternative to de novo analysis of tandem mass spectrometry data [J].
Heredia-Langner, A ;
Cannon, WR ;
Jarman, KD ;
Jarman, KH .
BIOINFORMATICS, 2004, 20 (14) :2296-2304
[28]   Popitam: Towards new heuristic strategies to improve protein identification from tandem mass spectrometry data [J].
Hernandez, P ;
Gras, R ;
Frey, J ;
Appel, RD .
PROTEOMICS, 2003, 3 (06) :870-878
[29]   PROTEIN SEQUENCING BY TANDEM MASS-SPECTROMETRY [J].
HUNT, DF ;
YATES, JR ;
SHABANOWITZ, J ;
WINSTON, S ;
HAUER, CR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (17) :6233-6237
[30]   A model of random sequences for de novo peptide sequencing [J].
Jarman, KD ;
Cannon, WR ;
Jarman, KH ;
Heredia-Langner, A .
THIRD IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING - BIBE 2003, PROCEEDINGS, 2003, :206-213