PEPPeR, a platform for experimental proteomic pattern recognition

被引:111
作者
Jaffe, Jacob D.
Mani, D. R.
Leptos, Kyriacos C.
Church, George M.
Gillette, Michael A.
Carr, Steven A. [1 ]
机构
[1] Broad Inst Harvard, Cambridge, MA 02142 USA
[2] MIT, Cambridge, MA 02142 USA
[3] Harvard Univ, Sch Med, Dept Genet, Boston, MA 02115 USA
关键词
D O I
10.1074/mcp.M600222-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Quantitative proteomics holds considerable promise for elucidation of basic biology and for clinical biomarker discovery. However, it has been difficult to fulfill this promise due to over-reliance on identification-based quantitative methods and problems associated with chromatographic separation reproducibility. Here we describe new algorithms termed "Landmark Matching" and "Peak Matching" that greatly reduce these problems. Landmark Matching performs time base-independent propagation of peptide identities onto accurate mass LC-MS features in a way that leverages historical data derived from disparate data acquisition strategies. Peak Matching builds upon Landmark Matching by recognizing identical molecular species across multiple LC-MS experiments in an identity-independent fashion by clustering. We have bundled these algorithms together with other algorithms, data acquisition strategies, and experimental designs to create a Platform for Experimental Proteomic Pattern Recognition (PEPPeR). These developments enable use of established statistical tools previously limited to microarray analysis for treatment of proteomics data. We demonstrate that the proposed platform can be calibrated across 2.5 orders of magnitude and can perform robust quantification of ratios in both simple and complex mixtures with good precision and error characteristics across multiple sample preparations. We also demonstrate de novo marker discovery based on statistical significance of unidentified accurate mass components that changed between two mixtures. These markers were subsequently identified by accurate mass-driven MS/MS acquisition and demonstrated to be contaminant proteins associated with known proteins whose concentrations were designed to change between the two mixtures. These results have provided a real world validation of the platform for marker discovery.
引用
收藏
页码:1927 / 1941
页数:15
相关论文
共 44 条
[1]   Aligning gene expression time series with time warping algorithms [J].
Aach, J ;
Church, GM .
BIOINFORMATICS, 2001, 17 (06) :495-508
[2]   Toward a human blood serum proteome - Analysis by multidimensional separation coupled with mass spectrometry [J].
Adkins, JN ;
Varnum, SM ;
Auberry, KJ ;
Moore, RJ ;
Angell, NH ;
Smith, RD ;
Springer, DL ;
Pounds, JG .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (12) :947-955
[3]   The human plasma proteome - History, character, and diagnostic prospects [J].
Anderson, NL ;
Anderson, NG .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (11) :845-867
[4]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]   Systematic identification of human mitochondrial disease genes through integrative genomics [J].
Calvo, S ;
Jain, M ;
Xie, XH ;
Sheth, SA ;
Chang, B ;
Goldberger, OA ;
Spinazzola, A ;
Zeviani, M ;
Carr, SA ;
Mootha, VK .
NATURE GENETICS, 2006, 38 (05) :576-582
[7]   AN APPROACH TO CORRELATE TANDEM MASS-SPECTRAL DATA OF PEPTIDES WITH AMINO-ACID-SEQUENCES IN A PROTEIN DATABASE [J].
ENG, JK ;
MCCORMACK, AL ;
YATES, JR .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1994, 5 (11) :976-989
[8]  
FRALEY C, 1998, 342 U WASH
[9]  
Gelman A, 2003, BAYESIAN DATA ANAL
[10]   Place of pattern in proteomic biomarker discovery [J].
Gillette, MA ;
Mani, DR ;
Carr, SA .
JOURNAL OF PROTEOME RESEARCH, 2005, 4 (04) :1143-1154