SVM Model for Quality Assessment of Medium Resolution Mass Spectra from 18O-Water Labeling Experiments

被引:6
作者
Nefedov, Alexey V. [1 ]
Gilski, Miroslaw J. [1 ,3 ]
Sadygov, Rovshan G. [1 ,2 ]
机构
[1] Univ Texas Med Branch, Dept Biochem & Mol Biol, Galveston, TX 77555 USA
[2] Univ Texas Med Branch, Sealy Ctr Mol Med, Galveston, TX 77555 USA
[3] Univ Texas Med Branch, UTMB Bioinformat Program, Galveston, TX 77555 USA
关键词
support vector machines; stable-isotope labeling; signal-to-noise ratio; isotope distribution; mass accuracy; PROTEIN IDENTIFICATION; COMPREHENSIVE ANALYSIS; SEQUENCE DATABASES; SPECTROMETRY; PROTEOMICS; PEPTIDES; TANDEM; O-18; QUANTIFICATION; ELECTROSPRAY;
D O I
10.1021/pr1012174
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We describe a method for assessing the quality of mass spectra and improving reliability of relative ratio estimations from O-18-water labeling experiments acquired from low resolution mass spectrometers. The mass profiles of heavy and light peptide pairs are often affected by artifacts, including coeluting contaminant species, noise signal, instrumental fluctuations in measuring ion position and abundance levels. Such artifacts distort the profiles, leading to erroneous ratio estimations, thus reducing the reliability of ratio estimations in high throughput quantification experiments. We used support vector machines (SVMs) to filter out mass spectra that deviated significantly from expected theoretical isotope distributions. We built an SVM classifier with a decision function that assigns a score to every mass profile based on such spectral features as mass accuracy, signal-to-noise ratio, and differences between experimental and theoretical isotopic distributions. The classifier was trained using a data set obtained from samples of mouse renal cortex. We then tested it on protein samples (bovine serum albumin) mixed in five different ratios of labeled and unlabeled species. We demonstrated that filtering the data using our SVM classifier results in as much as a 9-fold reduction in the coefficient of variance of peptide ratios, thus significantly improving the reliability of ratio estimations.
引用
收藏
页码:2095 / 2103
页数:9
相关论文
共 51 条
[1]   A new algorithm for the evaluation of shotgun peptide sequencing in proteomics: Support vector machine classification of peptide MS/MS spectra and SEQUEST scores [J].
Anderson, DC ;
Li, WQ ;
Payan, DG ;
Noble, WS .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :137-146
[2]   The Impact of Peptide Abundance and Dynamic Range on Stable-Isotope-Based Quantitative Proteomic Analyses [J].
Bakalarski, Corey E. ;
Elias, Joshua E. ;
Villen, Judit ;
Haas, Wilhelm ;
Gerber, Scott A. ;
Everley, Patrick A. ;
Gygi, Steven P. .
JOURNAL OF PROTEOME RESEARCH, 2008, 7 (11) :4756-4765
[3]   Quantitative mass spectrometry in proteomics: a critical review [J].
Bantscheff, Marcus ;
Schirle, Markus ;
Sweetman, Gavain ;
Rick, Jens ;
Kuster, Bernhard .
ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2007, 389 (04) :1017-1031
[4]   A suite of algorithms for the comprehensive analysis of complex protein mixtures using high-resolution LC-MS [J].
Bellew, Matthew ;
Coram, Marc ;
Fitzgibbon, Matthew ;
Igra, Mark ;
Randolph, Tim ;
Wang, Pei ;
May, Damon ;
Eng, Jimmy ;
Fang, Ruihua ;
Lin, ChenWei ;
Chen, Jinzhi ;
Goodlett, David ;
Whiteaker, Jeffrey ;
Paulovich, Amanda ;
McIntosh, Martin .
BIOINFORMATICS, 2006, 22 (15) :1902-1909
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[9]   Quantification of Isotopically Overlapping Deamidated and 18O-Labeled Peptides Using Isotopic Envelope Mixture Modeling [J].
Dasari, Surendra ;
Wilmarth, Phillip A. ;
Reddy, Ashok P. ;
Robertson, Lucinda J. G. ;
Nagalla, Srinivasa R. ;
David, Larry L. .
JOURNAL OF PROTEOME RESEARCH, 2009, 8 (03) :1263-1270
[10]   Automatic deconvolution of isotope-resolved mass spectra using variable selection and quantized peptide mass distribution [J].
Du, Peicheng ;
Angeletti, Ruth Hogue .
ANALYTICAL CHEMISTRY, 2006, 78 (10) :3385-3392