Corra: Computational framework and tools for LC-MS discovery and targeted mass spectrometry-based proteomics

被引:51
作者
Brusniak, Mi-Youn [1 ]
Bodenmiller, Bernd [2 ,3 ]
Campbell, David [1 ]
Cooke, Kelly [1 ]
Eddes, James [1 ]
Garbutt, Andrew [1 ]
Lau, Hollis [1 ]
Letarte, Simon [1 ]
Mueller, Lukas N. [2 ,3 ]
Sharma, Vagisha [1 ]
Vitek, Olga [4 ,5 ]
Zhang, Ning [1 ]
Aebersold, Ruedi [1 ,2 ,3 ,6 ]
Watts, Julian D. [1 ]
机构
[1] Inst Syst Biol, Seattle, WA 98103 USA
[2] ETH, Inst Mol Syst Biol, Zurich, Switzerland
[3] ETH, Competence Ctr Syst Physiol & Metab Dis, Zurich, Switzerland
[4] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
[5] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[6] Univ Zurich, Fac Sci, Zurich, Switzerland
基金
瑞士国家科学基金会;
关键词
D O I
10.1186/1471-2105-9-542
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics. Results: We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e. g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling. Conclusion: The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field.
引用
收藏
页数:22
相关论文
共 32 条
[11]   Place of pattern in proteomic biomarker discovery [J].
Gillette, MA ;
Mani, DR ;
Carr, SA .
JOURNAL OF PROTEOME RESEARCH, 2005, 4 (04) :1143-1154
[12]   Quantitative analysis of complex protein mixtures using isotope-coded affinity tags [J].
Gygi, SP ;
Rist, B ;
Gerber, SA ;
Turecek, F ;
Gelb, MH ;
Aebersold, R .
NATURE BIOTECHNOLOGY, 1999, 17 (10) :994-999
[13]   PEPPeR, a platform for experimental proteomic pattern recognition [J].
Jaffe, Jacob D. ;
Mani, D. R. ;
Leptos, Kyriacos C. ;
Church, George M. ;
Gillette, Michael A. ;
Carr, Steven A. .
MOLECULAR & CELLULAR PROTEOMICS, 2006, 5 (10) :1927-1941
[14]   MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data [J].
Katajamaa, M ;
Miettinen, J ;
Oresic, M .
BIOINFORMATICS, 2006, 22 (05) :634-636
[15]   Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search [J].
Keller, A ;
Nesvizhskii, AI ;
Kolker, E ;
Aebersold, R .
ANALYTICAL CHEMISTRY, 2002, 74 (20) :5383-5392
[16]   A uniform proteomics MS/MS analysis platform utilizing open XML file formats [J].
Keller, Andrew ;
Eng, Jimmy ;
Zhang, Ning ;
Li, Xiao-jun ;
Aebersold, Ruedi .
MOLECULAR SYSTEMS BIOLOGY, 2005, 1 (1) :2005.0017
[17]   The reproducibility and usefulness of the oral glucose tolerance test in screening for diabetes and other cardiovascular risk factors [J].
Ko, GTC ;
Chan, JCN ;
Woo, J ;
Lau, E ;
Yeung, VTF ;
Chow, CC ;
Cockram, CS .
ANNALS OF CLINICAL BIOCHEMISTRY, 1998, 35 :62-67
[18]   A software suite for the generation and comparison of peptide arrays from sets of data collected by liquid chromatography-mass spectrometry [J].
Li, XJ ;
Yi, EC ;
Kemp, CJ ;
Zhang, H ;
Aebersold, R .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (09) :1328-1340
[19]  
MacCoss MJ, 2005, ANAL CHEM, V77, p295A
[20]   A platform for accurate mass and time analyses of mass spectrometry data [J].
May, Damon ;
Fitzgibbon, Matt ;
Liu, Yan ;
Holzman, Ted ;
Eng, Jimmy ;
Kemp, C. J. ;
Whiteaker, Jeff ;
Paulovich, Amanda ;
McIntosh, Martin .
JOURNAL OF PROTEOME RESEARCH, 2007, 6 (07) :2685-2694