Improving reproducibility and sensitivity in identifying human proteins by shotgun proteomics

被引:184
作者
Resing, KA [1 ]
Meyer-Arendt, K
Mendoza, AM
Aveline-Wolf, LD
Jonscher, KR
Pierce, KG
Old, WM
Cheung, HT
Russell, S
Wattawa, JL
Goehle, GR
Knight, RD
Ahn, NG
机构
[1] Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
[2] Univ Colorado, Howard Hughes Med Inst, Boulder, CO 80309 USA
关键词
D O I
10.1021/ac035229m
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Identifying proteins in cell extracts by shotgun proteomics involves digesting the proteins, sequencing the resulting peptides by data-dependent mass spectrometry (MS/MS), and searching protein databases to identify the proteins from which the peptides are derived. Manual analysis and direct spectral comparison reveal that scores from two commonly used search programs (Sequest and Mascot) validate less than half of potentially identifiable MS/MS spectra (class positive) from shotgun analyses of the human erythroleukemia K562 cell line. Here we demonstrate increased sensitivity and accuracy using a focused search strategy along with a peptide sequence validation script that does not rely exclusively on XCorr or Mowse scores generated by Sequest or Mascot, but uses consensus between the search programs, along with chemical properties and scores describing the nature of the fragmentation spectrum (ion score and RSP). The approach yielded 4.2% false positive and 8% false negative frequencies in peptide assignments. The protein profile is then assembled from peptide assignments using a novel peptide-centric protein nomenclature that more accurately reports protein variants that contain identical peptide sequences. An Isoform Resolver algorithm ensures that the protein count is not inflated by variants in the protein database, eliminating similar to25% of redundant proteins. Analysis of soluble proteins from a human K562 cells identified 5130 unique proteins, with similar to100 false positive protein assignments.
引用
收藏
页码:3556 / 3568
页数:13
相关论文
共 23 条
[1]   A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores [J].
Anderson, DC ;
Li, WQ ;
Payan, DG ;
Noble, WS .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :137-146
[2]   MASS SHIFTS DUE TO ION-ION INTERACTIONS IN A QUADRUPOLE ION-TRAP MASS-SPECTROMETER [J].
CLEVEN, CD ;
COX, KA ;
COOKS, RG ;
BIER, ME .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 1994, 8 (06) :451-454
[3]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[4]  
HILL ROBERT L., 1965, ADVANCE PROTEIN CHEM, V20, P37, DOI 10.1016/S0065-3233(08)60388-5
[5]   Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search [J].
Keller, A ;
Nesvizhskii, AI ;
Kolker, E ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2002, 74 (20) :5383-5392
[6]  
Keller Andrew, 2002, OMICS A Journal of Integrative Biology, V6, P207, DOI 10.1089/153623102760092805
[7]   Direct analysis of protein complexes using mass spectrometry [J].
Link, AJ ;
Eng, J ;
Schieltz, DM ;
Carmack, E ;
Mize, GJ ;
Morris, DR ;
Garvik, BM ;
Yates, JR .
NATURE BIOTECHNOLOGY, 1999, 17 (07) :676-682
[8]   Probability-based validation of protein identifications using a modified SEQUEST algorithm [J].
MacCoss, MJ ;
Wu, CC ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2002, 74 (21) :5593-5599
[9]   Direct analysis and identification of proteins in mixtures by LC/MS/MS and database searching at the low-femtomole level [J].
McCormack, AL ;
Schieltz, DM ;
Goode, B ;
Yang, S ;
Barnes, G ;
Drubin, D ;
Yates, JR .
ANALYTICAL CHEMISTRY, 1997, 69 (04) :767-776
[10]  
MEYERARENDT KJ, UNPUB