VPMCD: Variable interaction modeling approach for class discrimination in biological systems

被引:78
作者
Raghuraj, Rao [1 ]
Lakshminarayanan, Samavedham [1 ]
机构
[1] Natl Univ Singapore, Dept Chem & Biomol Engn, Singapore 117576, Singapore
关键词
data classification; variable predictive models; discriminant analysis; computational biology; machine learning; multivariate statistics;
D O I
10.1016/j.febslet.2007.01.052
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 [生物化学与分子生物学]; 081704 [应用化学];
摘要
Data classification algorithms applied for class prediction in computational biology literature are data specific and have shown varying degrees of performance. Different classes cannot be distinguished solely based on interclass distances or decision boundaries. We propose that inter-relations among the features be exploited for separating observations into specific classes. A new variable predictive model based class discrimination (VPMCD) method is described here. Three well established and proven data sets of varying statistical and biological significance are utilized as benchmark. The performance of the new method is compared with advanced classification algorithms. The new method performs better during different tests and shows higher stability and robustness. The VPMCD is observed to be a potentially strong classification approach and can be effectively extended to other data mining applications involving biological systems. (c) 2007 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:826 / 830
页数:5
相关论文
共 20 条
[1]
Beck J.V., 1977, PARAMETER ESTIMATION
[2]
BREIMAN L, 1984, CLASSIFICIATION REGR
[3]
CMLACHLAN GJ, 1992, DISCRIMINANT ANAL ST
[4]
Classification and regression trees-studies of HIV reverse transcriptase inhibitors [J].
Daszykowski, M ;
Walczak, B ;
Xu, QS ;
Daeyaert, F ;
de Jonge, MR ;
Heeres, J ;
Koymans, LMH ;
Lewi, PJ ;
Vinkers, HM ;
Janssen, PA ;
Massart, DL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (02) :716-726
[5]
Duda R. O., 1973, Pattern Classification
[6]
Multi-class cancer subtype classification based on gene expression signatures with reliability analysis [J].
Fu, LM ;
Fu-Liu, CS .
FEBS LETTERS, 2004, 561 (1-3) :186-190
[7]
PSORT-B:: improving protein subcellular localization prediction for Gram-negative bacteria [J].
Gardy, JL ;
Spencer, C ;
Wang, K ;
Ester, M ;
Tusnády, GE ;
Simon, I ;
Hua, S ;
deFays, K ;
Lambert, C ;
Nakai, K ;
Brinkman, FSL .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3613-3617
[8]
Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422
[9]
Support vector machine approach for protein subcellular localization prediction [J].
Hua, SJ ;
Sun, ZR .
BIOINFORMATICS, 2001, 17 (08) :721-728
[10]
Koza JR, 1992, GENETIC PROGRAMMING