Processing and classification of protein mass spectra

被引:123
作者
Hilario, M [1 ]
Kalousis, A
Pellegrini, C
Müller, M
机构
[1] Univ Geneva, Dept Comp Sci, Artificial Intelligence Lab, CH-1211 Geneva 4, Switzerland
[2] ETH Honggerberg, Inst Mol Syst Biol, CH-8093 Zurich, Switzerland
关键词
MS preprocessing; classification; biomarker discovery; data mining; proteomics; machine learning; dimensionality reduction;
D O I
10.1002/mas.20072
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
Among the many applications of mass spectrometry, biomarker pattern discovery from protein mass spectra has aroused considerable interest in the past few years. While research efforts have raised hopes of early and less invasive diagnosis, they have also brought to light the many issues to be tackled before mass-spectra-based proteomic patterns become routine clinical tools. Known issues cover the entire pipeline leading from sample collection through mass spectrometry analytics to biomorker pattern extraction, validation, and interpretation. This study focuses on the data-analytical phase, which takes as input mass spectra of biological specimens and discovers patterns of peak masses and intensities that discriminate between different pathological states. We survey current work and investigate computational issues concerning the different stages of the knowledge disco very process: exploratory analysis, quality control, and diverse transforms of mass spectra, followed by further dimensionality reduction, classification, and model evaluation. We conclude after a brief discussion of the critical biomedical task of analyzing discovered discriminatory patterns to identify their component proteins as well as interpret and valid ate their biological implications. (c) 2006 Wiley Periodicals, Inc.
引用
收藏
页码:409 / 449
页数:41
相关论文
共 200 条
[1]   Aligning gene expression time series with time warping algorithms [J].
Aach, J ;
Church, GM .
BIOINFORMATICS, 2001, 17 (06) :495-508
[2]  
Adam BL, 2002, CANCER RES, V62, P3609
[3]   Comprehensive proteomic analysis of breast cancer cell membranes reveals unique proteins with potential roles in clinical cancer [J].
Adam, PJ ;
Boyd, R ;
Tyson, KL ;
Fletcher, GC ;
Stamps, A ;
Hudson, L ;
Poyser, HR ;
Redpath, N ;
Griffiths, M ;
Steers, G ;
Harris, AL ;
Patel, S ;
Berry, J ;
Loader, JA ;
Townsend, RR ;
Daviet, L ;
Legrain, P ;
Parekh, R ;
Terrett, JA .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2003, 278 (08) :6482-6489
[4]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[5]   Ovarian cancer detection by logical analysis of proteomic data [J].
Alexe, G ;
Alexe, S ;
Liotta, LA ;
Petricoin, E ;
Reiss, M ;
Hammer, PL .
PROTEOMICS, 2004, 4 (03) :766-783
[6]   ApoC-I and ApoC-III as potential plasmatic markers to distinguish between ischemic and hemorrhagic stroke [J].
Allard, L ;
Lescuyer, P ;
Burgess, J ;
Leung, KY ;
Ward, M ;
Walter, N ;
Burkhard, PR ;
Corthals, G ;
Hochstrasser, DF ;
Sanchez, JC .
PROTEOMICS, 2004, 4 (08) :2242-2251
[7]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[8]   Quantifying reproducibility for differential proteomics: noise analysis for protein liquid chromatography-mass spectrometry of human serum [J].
Anderle, M ;
Roy, S ;
Lin, H ;
Becker, C ;
Joho, K .
BIOINFORMATICS, 2004, 20 (18) :3575-3582
[9]   A universal denoising and peak picking algorithm for LC-MS based on matched filtration in the chromatographic time domain [J].
Andreev, VP ;
Rejtar, T ;
Chen, HS ;
Moskovets, EV ;
Ivanov, AR ;
Karger, BL .
ANALYTICAL CHEMISTRY, 2003, 75 (22) :6314-6326
[10]  
[Anonymous], P 14 INT JOINT C AI