Improving feature detection and analysis of surface-enhanced laser desorption/ionization-time of flight mass spectra

被引:28
作者
Carlson, SM [1 ]
Najmi, A [1 ]
Whitin, JC [1 ]
Cohen, HJ [1 ]
机构
[1] Stanford Univ, Sch Med, Dept Pediat, Stanford, CA 94305 USA
关键词
biomarkers; glutathione peroxidase; spectrum analysis; surface-enhanced laser desorption/ionization;
D O I
10.1002/pmic.200401184
中图分类号
Q5 [生物化学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Discovering valid biological information from surface-enhanced laser desorption/ionization-time of flight mass spectrometry (SELDI-TOF MS) depends on clear experimental design, meticulous sample handling, and sophisticated data processing. Most published literature deals with the biological aspects of these experiments, or with computer-learning algorithms to locate sets of classifying biomarkers. The process of locating and measuring proteins across spectra has received less attention. This process should be tunable between sensitivity and false-discovery, and should guarantee that features are biologically meaningful in that they represent chemical species that can be identified and investigated. Existing feature detection in SELDI-TOF MS is not optimal for acquiring biologically relevant data. Most methods have so many user-defined settings that reproducibility and comparability among studies suffer considerably. To address these issues, we have developed an approach, called simultaneous spectrum analysis (SSA), which (i) locates proteins across spectra, (ii) measures their abundance, (iii) subtracts baseline, (iv) excludes irreproducible measurements, and (v) computes normalization factors for comparing spectra. SSA uses only two key parameters for feature detection and one parameter each for quality thresholds on spectra and peaks. The effectiveness of SSA is demonstrated by identifying proteins differentially expressed in SELDI-TOF spectra from plasma of wild-type and knockout mice for plasma glutathione peroxidase. Comparing analyses by SSA and Ciphergen Express Data Manager 2.1 finds similar results for large signal peaks, but SSA improves the number and quality of differences betweens groups among lower signal peaks. SSA is also less likely to introduce systematic bias when normalizing spectra.
引用
收藏
页码:2778 / 2788
页数:11
相关论文
共 18 条
[1]
Adam BL, 2002, CANCER RES, V62, P3609
[2]
A comprehensive approach to the analysis of matrix-assisted laser desorption/ionization-time of flight proteomics spectra from serum samples [J].
Baggerly, KA ;
Morris, JS ;
Wang, J ;
Gold, D ;
Xiao, LC ;
Coombes, KR .
PROTEOMICS, 2003, 3 (09) :1667-1672
[3]
Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments [J].
Baggerly, KA ;
Morris, JS ;
Coombes, KR .
BIOINFORMATICS, 2004, 20 (05) :777-U710
[4]
An integrated approach utilizing artificial neural networks and SELDI mass spectrometry for the classification of human tumours and rapid identification of potential biomarkers [J].
Ball, G ;
Mian, S ;
Holding, F ;
Allibone, RO ;
Lowe, J ;
Ali, S ;
Li, G ;
McCardle, S ;
Ellis, IO ;
Creaser, C ;
Rees, RC .
BIOINFORMATICS, 2002, 18 (03) :395-404
[5]
Quality control and peak finding for proteomics data collected from nipple aspirate fluid by surface-enhanced laser desorption and ionization [J].
Coombes, KR ;
Fritsche, HA ;
Clarke, C ;
Chen, JN ;
Baggerly, KA ;
Morris, JS ;
Xiao, LC ;
Hung, MC ;
Kuerer, HM .
CLINICAL CHEMISTRY, 2003, 49 (10) :1615-1623
[6]
MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]
Protein biochips for differential profiling [J].
Fung, ET ;
Thulasiraman, V ;
Weinberger, SR ;
Dalmasso, EA .
CURRENT OPINION IN BIOTECHNOLOGY, 2001, 12 (01) :65-69
[8]
Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method [J].
Lee, KR ;
Lin, XW ;
Park, DC ;
Eslava, S .
PROTEOMICS, 2003, 3 (09) :1680-1686
[9]
Probabilistic disease classification of expression-dependent proteomic data from mass spectrometry of human serum [J].
Lilien, RH ;
Farid, H ;
Donald, BR .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) :925-946
[10]
Decision tree classification of proteins identified by mass spectrometry of blood serum samples from people with and without lung cancer [J].
Markey, MK ;
Tourassi, GD ;
Floyd, CE .
PROTEOMICS, 2003, 3 (09) :1678-1679