An intensity-region driven multi-classifier scheme for improving the classification accuracy of proteomic MS-spectra

被引:3
作者
Bougioukos, Panagiotis [1 ]
Glotsos, Dimitris [2 ]
Cavouras, Dionisis [2 ]
Daskalakis, Antonis [1 ]
Kalatzis, Ioannis [2 ]
Kostopoulos, Spiros [1 ]
Nikiforidis, George [1 ]
Bezerianos, Anastasios [1 ]
机构
[1] Univ Patras, Sch Med, Dept Med Phys, GR-26504 Patras, Rio, Greece
[2] Technol Educ Inst Athens, Dept Med Instruments Technol, Med Signal & Image Proc Lab, Athens, Greece
关键词
Classification; Ovarian cancer; Pre-processing; MASS-SPECTROMETRY; FEATURE-EXTRACTION; CANCER-DETECTION; SERUM; QUANTIFICATION; SELECTION; PATTERNS; MARKERS; BIAS;
D O I
10.1016/j.cmpb.2009.11.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this study, a pattern recognition system is presented for improving the classification accuracy of MS-spectra by means of gathering information from different MS-spectra intensity regions using a majority vote ensemble combination. The method starts by automatically breaking down all MS-spectra into common intensity regions. Subsequently, the most informative features (m/z values), which might constitute potential significant biomarkers, are extracted from each common intensity region over all the MS-spectra and, finally, normal from ovarian cancer MS-spectra are discriminated using a multi-classifier scheme, with members the Support Vector Machine, the Probabilistic Neural Network and the k-Nearest Neighbour classifiers. Clinical material was obtained from the publicly available ovarian proteomic dataset (8-7-02). To ensure robust and reliable estimates, the proposed pattern recognition system was evaluated using an external cross-validation process. The average overall performance of the system in discriminating normal from cancer ovarian MS-spectra was 97.18% with 98.52% mean sensitivity and 94.84% mean specificity values. (C) 2009 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:147 / 153
页数:7
相关论文
共 29 条
[1]   Ovarian cancer detection by logical analysis of proteomic data [J].
Alexe, G ;
Alexe, S ;
Liotta, LA ;
Petricoin, E ;
Reiss, M ;
Hammer, PL .
PROTEOMICS, 2004, 4 (03) :766-783
[2]   Pattern-based feature selection in genomics and proteomics [J].
Alexe, Gabriela ;
Alexe, Sorin ;
Hammer, Peter L. ;
Vizvari, Bela .
ANNALS OF OPERATIONS RESEARCH, 2006, 148 (01) :189-201
[3]   Selection bias in gene extraction on the basis of microarray gene-expression data [J].
Ambroise, C ;
McLachlan, GJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6562-6566
[4]   Signal background estimation and baseline correction algorithms for accurate DNA sequencing [J].
Andrade, L ;
Manolakos, ES .
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2003, 35 (03) :229-243
[5]  
[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264
[6]   Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments [J].
Baggerly, KA ;
Morris, JS ;
Coombes, KR .
BIOINFORMATICS, 2004, 20 (05) :777-U710
[7]  
Baggerly KA, 2005, CANCER INFORM, V1, P9
[8]  
BARLA A, 2006, COMP BAS MED SYST SA
[9]  
CHRISTANINI N, 2000, INTRO SUPPORT VECTOR
[10]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836