Feature extraction and quantification for mass spectrometry in biomedical applications using the mean spectrum

被引:218
作者
Morris, JS
Coombes, KR
Koomen, J
Baggerly, KA
Kobayashi, R
机构
[1] Univ Texas, MD Anderson Canc Ctr, Dept Biostat & Appl Math, Houston, TX 77030 USA
[2] Univ Texas, MD Anderson Canc Ctr, Dept Mol Pathol, Houston, TX 77030 USA
关键词
D O I
10.1093/bioinformatics/bti254
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Mass spectrometry yields complex functional data for which the features of scientific interest are peaks. A common two-step approach to analyzing these data involves first extracting and quantifying the peaks, then analyzing the resulting matrix of peak quantifications. Feature extraction and quantification involves a number of interrelated steps. It is important to perform these steps well, since subsequent analyses condition on these determinations. Also, it is difficult to compare the performance of competing methods for analyzing mass spectrometry data since the true expression levels of the proteins in the population are generally not known. Results: In this paper, we introduce a new method for feature extraction in mass spectrometry data that uses translation-invariant wavelet transforms and performs peak detection using the mean spectrum. We examine the method's performance through examples and simulation, and demonstrate the advantages of using the mean spectrum to detect peaks. We also describe a new physics-based computer model of mass spectrometry and demonstrate how one may design simulation studies based on this tool to systematically compare competing methods.
引用
收藏
页码:1764 / 1775
页数:12
相关论文
共 29 条
[1]  
Adam BL, 2002, CANCER RES, V62, P3609
[2]   Comprehensive proteomic analysis of breast cancer cell membranes reveals unique proteins with potential roles in clinical cancer [J].
Adam, PJ ;
Boyd, R ;
Tyson, KL ;
Fletcher, GC ;
Stamps, A ;
Hudson, L ;
Poyser, HR ;
Redpath, N ;
Griffiths, M ;
Steers, G ;
Harris, AL ;
Patel, S ;
Berry, J ;
Loader, JA ;
Townsend, RR ;
Daviet, L ;
Legrain, P ;
Parekh, R ;
Terrett, JA .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2003, 278 (08) :6482-6489
[3]   A comprehensive approach to the analysis of matrix-assisted laser desorption/ionization-time of flight proteomics spectra from serum samples [J].
Baggerly, KA ;
Morris, JS ;
Wang, J ;
Gold, D ;
Xiao, LC ;
Coombes, KR .
PROTEOMICS, 2003, 3 (09) :1667-1672
[4]   Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments [J].
Baggerly, KA ;
Morris, JS ;
Coombes, KR .
BIOINFORMATICS, 2004, 20 (05) :777-U710
[5]   VELOCITY DISTRIBUTIONS OF INTACT HIGH MASS POLYPEPTIDE MOLECULE IONS PRODUCED BY MATRIX ASSISTED LASER DESORPTION [J].
BEAVIS, RC ;
CHAIT, BT .
CHEMICAL PHYSICS LETTERS, 1991, 181 (05) :479-484
[6]  
CARPENTER M, 2003, P PHARM SAS US GROUP, P545
[7]   High-resolution serum proteomic features for ovarian cancer detection [J].
Conrads, TP ;
Fusaro, VA ;
Ross, S ;
Johann, D ;
Rajapakse, V ;
Hitt, BA ;
Steinberg, SM ;
Kohn, EC ;
Fishman, DA ;
Whiteley, G ;
Barrett, JC ;
Liotta, LA ;
Petricoin, EF ;
Veenstra, TD .
ENDOCRINE-RELATED CANCER, 2004, 11 (02) :163-178
[8]   Quality control and peak finding for proteomics data collected from nipple aspirate fluid by surface-enhanced laser desorption and ionization [J].
Coombes, KR ;
Fritsche, HA ;
Clarke, C ;
Chen, JN ;
Baggerly, KA ;
Morris, JS ;
Xiao, LC ;
Hung, MC ;
Kuerer, HM .
CLINICAL CHEMISTRY, 2003, 49 (10) :1615-1623
[9]  
COOMBES KR, 2005, IN PRESS CANC INFORM
[10]  
COOMBES KR, 2004, UTMDABTR00704 MD AND