The APEX Quantitative Proteomics Tool: Generating protein quantitation estimates from LC-MS/MS proteomics results

被引:111
作者
Braisted, John C. [1 ]
Kuntumalla, Srilatha [1 ]
Vogel, Christine [2 ]
Marcotte, Edward M. [2 ]
Rodrigues, Alan R. [1 ]
Wang, Rong [1 ]
Huang, Shih-Ting [1 ]
Ferlanti, Erik S. [1 ]
Saeed, Alexander I. [1 ]
Fleischmann, Robert D. [1 ]
Peterson, Scott N. [1 ]
Pieper, Rembert [1 ]
机构
[1] J Craig Venter Inst, Pathogen Funct Genom Resource Ctr, Rockville, MD 20850 USA
[2] Univ Texas Austin, Inst Cellular & Mol Biol, Dept Chem & Biochem, Ctr Syst & Synthet Biol, Austin, TX 78712 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
10.1186/1471-2105-9-529
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Mass spectrometry (MS) based label-free protein quantitation has mainly focused on analysis of ion peak heights and peptide spectral counts. Most analyses of tandem mass spectrometry (MS/MS) data begin with an enzymatic digestion of a complex protein mixture to generate smaller peptides that can be separated and identified by an MS/MS instrument. Peptide spectral counting techniques attempt to quantify protein abundance by counting the number of detected tryptic peptides and their corresponding MS spectra. However, spectral counting is confounded by the fact that peptide physicochemical properties severely affect MS detection resulting in each peptide having a different detection probability. Lu et al. (2007) described a modified spectral counting technique, Absolute Protein Expression (APEX), which improves on basic spectral counting methods by including a correction factor for each protein (called O(i) value) that accounts for variable peptide detection by MS techniques. The technique uses machine learning classification to derive peptide detection probabilities that are used to predict the number of tryptic peptides expected to be detected for one molecule of a particular protein (O(i)). This predicted spectral count is compared to the protein's observed MS total spectral count during APEX computation of protein abundances. Results: The APEX Quantitative Proteomics Tool, introduced here, is a free open source Java application that supports the APEX protein quantitation technique. The APEX tool uses data from standard tandem mass spectrometry proteomics experiments and provides computational support for APEX protein abundance quantitation through a set of graphical user interfaces that partition thparameter controls for the various processing tasks. The tool also provides a Z-score analysis for identification of significant differential protein expression, a utility to assess APEX classifier performance via cross validation, and a utility to merge multiple APEX results into a standardized format in preparation for further statistical analysis. Conclusion: The APEX Quantitative Proteomics Tool provides a simple means to quickly derive hundreds to thousands of protein abundance values from standard liquid chromatography-tandem mass spectrometry proteomics datasets. The APEX tool provides a straightforward intuitive interface design overlaying a highly customizable computational workflow to produce protein abundance values from LC-MS/MS datasets.
引用
收藏
页数:11
相关论文
共 25 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]  
[Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
[3]  
[Anonymous], Weka Machine Learning Project
[4]   New tools for quantitative phosphoproteome analysis [J].
Conrads, TP ;
Issaq, HJ ;
Veenstra, TD .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2002, 290 (03) :885-890
[5]   Changes in the protein expression of yeast as a function of carbon source [J].
Gao, J ;
Opiteck, GJ ;
Friedrichs, MS ;
Dongre, AR ;
Hefta, SA .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (06) :643-649
[6]   Quantitative analysis of complex protein mixtures using isotope-coded affinity tags [J].
Gygi, SP ;
Rist, B ;
Gerber, SA ;
Turecek, F ;
Gelb, MH ;
Aebersold, R .
NATURE BIOTECHNOLOGY, 1999, 17 (10) :994-999
[7]   Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein [J].
Ishihama, Y ;
Oda, Y ;
Tabata, T ;
Sato, T ;
Nagasu, T ;
Rappsilber, J ;
Mann, M .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (09) :1265-1272
[8]   Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search [J].
Keller, A ;
Nesvizhskii, AI ;
Kolker, E ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2002, 74 (20) :5383-5392
[9]   A uniform proteomics MS/MS analysis platform utilizing open XML file formats [J].
Keller, Andrew ;
Eng, Jimmy ;
Zhang, Ning ;
Li, Xiao-jun ;
Aebersold, Ruedi .
MOLECULAR SYSTEMS BIOLOGY, 2005, 1 (1) :2005.0017
[10]   A model for random sampling and estimation of relative protein abundance in shotgun proteomics [J].
Liu, HB ;
Sadygov, RG ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2004, 76 (14) :4193-4201