P-mod: An algorithm and software to map modifications to peptide sequences using tandem MS data

被引:74
作者
Hansen, BT
Davey, SW
Ham, AJL
Liebler, DC
机构
[1] Vanderbilt Univ, Sch Med, Dept Biochem, Nashville, TN 37232 USA
[2] Vanderbilt Univ, Sch Med, Dept Pharmacol, Nashville, TN 37232 USA
[3] Vanderbilt Univ, Sch Med, Mass Spectrometry Res Ctr, Nashville, TN 37232 USA
[4] Univ Arizona, Coll Pharm, Dept Pharmacol, Tucson, AZ 85721 USA
关键词
protein modifications; tandem mass spectrometry; algorithm; extreme value statistics;
D O I
10.1021/pr0498234
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The discovery of unanticipated protein modifications is one of the most challenging problems in proteomics. Whereas widely used algorithms such as Sequest and Mascot enable mapping of modifications when the mass and amino acid specificity are known, unexpected modifications cannot be identified with these tools. We have developed an algorithm and software called P-Mod, which enables discovery and sequence mapping of modifications to target proteins known to be represented in the analysis or identified by Sequest. P-Mod matches MS/MS spectra to peptide sequences in a search list. For spectra of modified peptides, P-Mod calculates mass differences between search peptide sequences and MS/MS precursors and localizes the mass shift to a sequence position in the peptide. Because modifications are detected as mass shifts, P-Mod does not require the user to guess at masses or sequence locations of modifications. P-Mod uses extreme value statistics to assign p value estimates to sequence-to-spectrum matches. The reported p values are scaled to account for the number of comparisons, so that error rates do not increase with the expanded search lists that result from incorporating potential peptide modifications. Combination of P-Mod searches from multiple LC-MS/MS analyses and multiple samples revealed previously unreported BSA modifications, including a novel decarboxymethylation or D -> G substitution at position 579 of the protein. P-Mod can serve a unique role in the identification of protein modifications both from exogenous and endogenous sources and may be useful for identifying modified protein forms as biomarkers for toxicity and disease processes.
引用
收藏
页码:358 / 368
页数:11
相关论文
共 24 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]   ISSUES IN SEARCHING MOLECULAR SEQUENCE DATABASES [J].
ALTSCHUL, SF ;
BOGUSKI, MS ;
GISH, W ;
WOOTTON, JC .
NATURE GENETICS, 1994, 6 (02) :119-129
[3]   Role of oxidative stress in diabetic complications - A new perspective on an old paradigm [J].
Baynes, JW ;
Thorpe, SR .
DIABETES, 1999, 48 (01) :1-9
[4]   Chemical modification of proteins by lipids in diabetes [J].
Baynes, JW .
CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2003, 41 (09) :1159-1165
[5]  
Creasy DM, 2002, PROTEOMICS, V2, P1426, DOI 10.1002/1615-9861(200210)2:10<1426::AID-PROT1426>3.0.CO
[6]  
2-5
[7]   Covalent modification of amino acid nucleophiles by the lipid peroxidation products 4-hydroxy-2-nonenal and 4-oxo-2-nonenal [J].
Doorn, JA ;
Petersen, DR .
CHEMICAL RESEARCH IN TOXICOLOGY, 2002, 15 (11) :1445-1450
[8]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[9]   Automated identification of amino acid sequence variations in proteins by HPLC/microspray tandem mass spectrometry [J].
Gatlin, CL ;
Eng, JK ;
Cross, ST ;
Detter, JC ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2000, 72 (04) :757-763
[10]   Open mass spectrometry search algorithm [J].
Geer, LY ;
Markey, SP ;
Kowalak, JA ;
Wagner, L ;
Xu, M ;
Maynard, DM ;
Yang, XY ;
Shi, WY ;
Bryant, SH .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (05) :958-964