Automatic deconvolution of isotope-resolved mass spectra using variable selection and quantized peptide mass distribution

被引:42
作者
Du, Peicheng [1 ]
Angeletti, Ruth Hogue [1 ]
机构
[1] Albert Einstein Coll Med, Dept Dev & Mol Biol, Bronx, NY 10461 USA
关键词
D O I
10.1021/ac052212q
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
We present an algorithm for the deconvolution of isotope-resolved mass spectra of complex peptide mixtures where peaks and isotope series often overlap. The algorithm formulates the problem of mass spectrum deconvolution as a classical statistical problem of variable selection, which aims to interpret the spectrum with the least number of peptides. The LASSO method is used to perform automatic variable selection. The algorithm also makes use of the quantized distribution of peptide masses in the NCBInr database after in silico trypsin digestion as filters to aid the deconvolution process. Errors in the expected isotope pattern are accounted for to avoid spurious isotope series. The effectiveness of the algorithm is demonstrated with annotated ESI spectrum of known peptides for which the peaks and isotope series are highly overlapping. The algorithm successfully finds all correct masses in the experimental spectrum, except for one spectrum where an additional refinement procedure is required to obtain the correct results. Our results compare favorably to those from a widely used commercial program.
引用
收藏
页码:3385 / 3392
页数:8
相关论文
共 19 条
[1]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[2]   Automated interpretation of mass spectra of complex mixtures by matching of isotope peak distributions [J].
Fernández-De-Cossio, J ;
Gonzalez, LJ ;
Satomi, Y ;
Betancourt, L ;
Ramos, Y ;
Huerta, V ;
Besada, V ;
Padron, G ;
Minamino, N ;
Takao, T .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2004, 18 (20) :2465-2472
[3]  
Gay S, 1999, ELECTROPHORESIS, V20, P3527, DOI 10.1002/(SICI)1522-2683(19991201)20:18<3527::AID-ELPS3527>3.0.CO
[4]  
2-9
[5]   The variable selection problem [J].
George, EI .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (452) :1304-1308
[6]   Automated reduction and interpretation of high resolution electrospray mass spectra of large molecules [J].
Horn, DM ;
Zubarev, RA ;
McLafferty, FW .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2000, 11 (04) :320-332
[7]   A software suite for the generation and comparison of peptide arrays from sets of data collected by liquid chromatography-mass spectrometry [J].
Li, XJ ;
Yi, EC ;
Kemp, CJ ;
Zhang, H ;
Aebersold, R .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (09) :1328-1340
[8]  
MANN M, 1995, 43 ASMS C MASS SPECT, P639
[9]   RAPID IDENTIFICATION OF PROTEINS BY PEPTIDE-MASS FINGERPRINTING [J].
PAPPIN, DJC ;
HOJRUP, P ;
BLEASBY, AJ .
CURRENT BIOLOGY, 1993, 3 (06) :327-332
[10]  
Perkins DN, 1999, ELECTROPHORESIS, V20, P3551, DOI 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO